Gene NATL1_00731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00731 
Symbol 
ID4780807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp77837 
End bp80287 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content31% 
IMG OID640083336 
Producthypothetical protein 
Protein accessionYP_001013902 
Protein GI124024786 
COG category[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.456129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGGGC AAGAAGAAAA GAAAAAAGTC ACGGAAGTGA AAACATTCCC AGTGCCATTT 
GCTTTAGGAG AAATTAAAGA AAATATTACT ATTAATACCA ATATTTCTTC TAAACTCTCA
GAACTTTCTA AAGGACAAAT AATTAATCAA GCAATTCAGT TTCATGCACA AGGAAATATC
CAAAAAGCAG CAAAATATTA TCAATATTTT ATTGATCAAG GTTTCAAAGA TCCTAGAGTT
CTTGCTAATT ATGGAGTCAT ATTGAAAGGT TTTGGTAATT CACAGGAAGC AGAATTGTTA
TACCGCAAAG CAATTGAACT GAATCCTAAT TTTGCAGACG CGCATTACAA TCTAGGAAAC
ACATTGAGAG ATCTTGGTAA ATTAAAAGAA GCAGAATTGT CATACCGTAA AGCTATTGAA
ATTAGTCCTA ATTACGCAAA TACACTTTAC AACCTTGGAA CTATATTGAG TGATCTTGGT
AAATTGCAAG ATGCAGAATT TTCATACCGC CAAGCTATTA TAATTAATCC TAATTATACA
GAAGCTCATT ACAATCTAGG AAACACATTG AGAGATCTTG GCAAATTAAA AGATGCTGAA
TTATCATACC GTAAAGCTAT AAAAATTAGC CCTAATTACG CAAAAGTACA TTGCAATCTG
GGAACAATAT TGAGAGATCT TGGTAAATTA AAAGATGCTG AATTATATAC CCGTAAAGCA
ATTCAGCTGA ATCCTGATTT CGCAGAAGCG TATTCTAACC TAGGAAATAT ATTGAGTGAT
CTTGGTAATT TAAAAGAAGC AGAAATCTCA CAAAAAAAAA CAATTGAACT TAAACCTGAT
TGCGCAGAAG CGCATTCCAA TCTTGGTAAC ATATTGAGAG ATCTTGGTAA ATTAAAAGAT
GCAGAGTTGT CGTACCGCAA AGCTATTGAA ATTAGTCCTA ATTACGCAAA TGCGCATTCC
AATCTTGGCA ACATATTGAG AGATCTTGGT AAATTAAAAG GTGCAGAGTT GTCGTACCGC
AAAGCTATTG AAATTAGTCC TAATTACGCA AACGCACATT ACAACCTTGG AAATATATTG
AAAGATATTG GTAATTTCGG TGATGCTCTA AAGCAATTCA AGCAGGCACT AAAATTAAAC
AATGAATTAT CATTAGCGAA GTATGCTTTA ATTATAACCA AAGGCAAGAT ATGCGATTGG
AGCGATGAAG TTACTCATAA TATATGGCTT AAATCACTTG GGATCCAAGG GAAATCTATT
GAACCATTGG GATTATTTCC ATTAGAAGAT AATCCGTCAA ACCATTTAAA AAGATCAAAA
AATTATTATA TAGAAAACTT TACTCGACCA AGTAAATGTA TTCAATCTTA CGAGAAAAAT
ATAATACATA TTGGTTATTT CTCTGCTGAT TTTCGGACCC ATCCTGTAAT GCAAATGCTT
GCTCCTCTAA TAGAGATACA TGATAAGTCT AGATTTAAAA TATATTTATA TTCATTAGCA
AAAAAAGAAG ATGAGTATAC TGAAAGAGCA AAAATGTCTG GATGTATATT TAAAAATATA
ACAGAATTAA ATGATATTGA AGCAGTTGAA TTAGCGAGAA GTGACAAATT AGATATTGCC
GTAGATCTAA TGGGATATAC TCGAAATAAT AGAATGCCTA TATTCTCATA TAGGGTTGCA
CCAATACAAA TCAATTATTT AGGTTTTCCT GGCTCAACTG GCTCTGATAC TATTGATTAT
ATTATCGGCG ATAATATAAC TATACCTAGA GAAAATGAGA AATTTTATAC TGAAAAAATA
ATAAGAATGC CAAATTGCTT TCTATGTGAT GATAATAAAA AAGAAATTTC TAAAGAGTCA
ATATGCCGTA AAGATTTTAA TCTTCCTGAT CAAGGCTTTA TATTTACTTG TTTTAATGAA
AATTATAAAA TAACAAAAAA AGAATTTAAC ATATGGATGA ACTTACTTAT AAAAATAGAA
GGGAGTGTTC TTTGGTTATA TAAGTCAAAT CAATGCTCAA TGAATAATTT ATATAAGGAA
GCTAGAAAAC GTAAAGTTAA TCCCGACAGA TTAATATTTG CTGAGAGATT AGCAATGAAC
AAACATCTTC CTAGGCATTC TCTAGGTGAT TTAGCACTTG ATACTTTTAA TTACAATGGA
GGTGCGACAA CTTCTTGTGC ATTATTGGCT GGGCTACCAG TCCTGACCAA GATAGGTCAG
AGTTTTATGG CTAGAGTATC TGCCAGTCTC CTTTCTTCGA TAGGACTTTC CGAATTAATT
ACTTATAGTG AAAGTGAATA CGAAGAAAAA GCATTATATA TTGCTAATAA CCCTAAAGAA
ATTCTTAGAT TGAAATCTAA ATTGAACAAA CTAAAAGAAA CTTCAACACT TTTTAATTCA
GAATTATTTA CAAGAGATTT AGAGAGTAAA TTTATAGAAT TAGTGAAGTA A
 
Protein sequence
MGGQEEKKKV TEVKTFPVPF ALGEIKENIT INTNISSKLS ELSKGQIINQ AIQFHAQGNI 
QKAAKYYQYF IDQGFKDPRV LANYGVILKG FGNSQEAELL YRKAIELNPN FADAHYNLGN
TLRDLGKLKE AELSYRKAIE ISPNYANTLY NLGTILSDLG KLQDAEFSYR QAIIINPNYT
EAHYNLGNTL RDLGKLKDAE LSYRKAIKIS PNYAKVHCNL GTILRDLGKL KDAELYTRKA
IQLNPDFAEA YSNLGNILSD LGNLKEAEIS QKKTIELKPD CAEAHSNLGN ILRDLGKLKD
AELSYRKAIE ISPNYANAHS NLGNILRDLG KLKGAELSYR KAIEISPNYA NAHYNLGNIL
KDIGNFGDAL KQFKQALKLN NELSLAKYAL IITKGKICDW SDEVTHNIWL KSLGIQGKSI
EPLGLFPLED NPSNHLKRSK NYYIENFTRP SKCIQSYEKN IIHIGYFSAD FRTHPVMQML
APLIEIHDKS RFKIYLYSLA KKEDEYTERA KMSGCIFKNI TELNDIEAVE LARSDKLDIA
VDLMGYTRNN RMPIFSYRVA PIQINYLGFP GSTGSDTIDY IIGDNITIPR ENEKFYTEKI
IRMPNCFLCD DNKKEISKES ICRKDFNLPD QGFIFTCFNE NYKITKKEFN IWMNLLIKIE
GSVLWLYKSN QCSMNNLYKE ARKRKVNPDR LIFAERLAMN KHLPRHSLGD LALDTFNYNG
GATTSCALLA GLPVLTKIGQ SFMARVSASL LSSIGLSELI TYSESEYEEK ALYIANNPKE
ILRLKSKLNK LKETSTLFNS ELFTRDLESK FIELVK