Gene NATL1_09561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09561 
Symbol 
ID4780931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp879852 
End bp881501 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content27% 
IMG OID640084233 
Producthypothetical protein 
Protein accessionYP_001014779 
Protein GI124025663 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.241955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.10067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTC CTTTATCTAA TGTTATTAAC TATATGAAGA TTAAAGCAGT AGTTTTAAAT 
TTTTTCTTAA AGAACAAAAA AAATATTTTA CCATTCTTTG CCGTTCTTCT TTTTATATCA
CTTTATTTCC TTATCGATTT TTCTAGTCAA AGTCTTGTCG CTCATGACGA GGGACTTTAC
GCGAGAAGAG CAAGATTGAT TGAGTCCTCT GCTAATTGGT TTTCCCCCCC ATTCATATCC
CCTCACCATA AGACATTAGG AAGTTATTGG TTTATAGCAT TATCTATAAG GTTGTTTGGA
AACAGTGAAC TAGCTCTAAG GCTTCCAAGT ATTCTTTCTT CTTTCCTTTG CTTAATAACC
TCATACTTAA TAGCATTAAA AACAACAAAT AGTAAGTCTG CATTGATTTC TTTGTTCTCA
CTTTCATCAA TGCCATTATG GATTCAATAT TCAAGATATG CGAGTCCAGA CTTACCATTT
GTACTTTGTA CTCTATTAGC TATATTATTT TTCCTTAAAT CCCTAGATTC TTCAAAATAT
ATAAGTCAGT ATTTTAATTT ATTTTGCTCT GGCCTTTTTA TTTCTTCTTC TTTTTTTATT
AGAAGTTATA TGGCCTTTGT TCCTTTTATA GGACTAGCAC CTTTTATTTA TTATCATTTA
TTTAGAAAAG AATTTATTTT TAAACTTTTC TTTTGTACTG GAATAGCTGT TGGATTTATA
CCTACGTTTT TTAATCTGTA TTTTTCAGTA CAAAAGTTTG GCATCTCCGG AATTACAAGC
CTTTTTGATT TTGCAAAGAA ACAGGCTCTC GGTGAATTTG CTTTTAATAA TTTATTACTC
GTACCTGTTA ATTTTCTTTA TTTAACATTT CCTATTGGAA TATTACTTCT TATTCTTTTT
TTATTCACTG GATCCAATAA TAAGGCTAAC TATCCATTAT TGATTTATTG TTATCCTCTT
TCATCTTTAA TCTTACTATT ATGCATGTCT ACATCATATC CACATTATTA TCTTTTTCTC
TTACCTTCTT TATCTATAAT CTTTGCAAAC TACCTAACAT CTAATTCACC TAGATATTCA
TTTTCAAGCT CTATTATTAG ATATTTAGTC TTTATTGTAC TTTTAATAAT ATCATCTGTT
ATATTATTTT CAATTCTTAG ATTTTCAGAC CAAGTACTTC TTTATTCAAG AGGAACTCCC
GTAATAGTAT ATATTCTTAT CTCATTAATT GTGTTATCCT ATATTACTTC ACTTAGATTT
CTATTTGATT TTAAATATCC AAATTCTAAT ATAATTAATT TCTTTTATAA TATAATCATA
CCGCAATATA TTTCTTTATC ATTATTGTTT AATTTTGGTG TCCTAGGTAA TCCCAATCAT
AATACGAAAG TTTTCCTGAA GGATGCTGAT GTTTCATCAA TTATAAACTC TAATACTATT
TACCTATTTA GTGTTGAAAG TAAGATTCAG ACTTTACTAT CTTATTATTT ACCTTCCTCT
GTGATAGTAG ATGATTTTGA ATTAATTAGT AAATATAAGT ATGTAATTAC TTCTAATGCT
AATTCATTAG AGAAATTAAA ATTAAAACAA ATCTTTGTTC CTGTTAAAAA ATTTGACAAT
CACTTATTAT TAATGAATAT TGACTCATAG
 
Protein sequence
MQIPLSNVIN YMKIKAVVLN FFLKNKKNIL PFFAVLLFIS LYFLIDFSSQ SLVAHDEGLY 
ARRARLIESS ANWFSPPFIS PHHKTLGSYW FIALSIRLFG NSELALRLPS ILSSFLCLIT
SYLIALKTTN SKSALISLFS LSSMPLWIQY SRYASPDLPF VLCTLLAILF FLKSLDSSKY
ISQYFNLFCS GLFISSSFFI RSYMAFVPFI GLAPFIYYHL FRKEFIFKLF FCTGIAVGFI
PTFFNLYFSV QKFGISGITS LFDFAKKQAL GEFAFNNLLL VPVNFLYLTF PIGILLLILF
LFTGSNNKAN YPLLIYCYPL SSLILLLCMS TSYPHYYLFL LPSLSIIFAN YLTSNSPRYS
FSSSIIRYLV FIVLLIISSV ILFSILRFSD QVLLYSRGTP VIVYILISLI VLSYITSLRF
LFDFKYPNSN IINFFYNIII PQYISLSLLF NFGVLGNPNH NTKVFLKDAD VSSIINSNTI
YLFSVESKIQ TLLSYYLPSS VIVDDFELIS KYKYVITSNA NSLEKLKLKQ IFVPVKKFDN
HLLLMNIDS