Gene NATL1_18211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18211 
Symbol 
ID4780068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1485867 
End bp1487678 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content33% 
IMG OID640085110 
Producthypothetical protein 
Protein accessionYP_001015641 
Protein GI124026526 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.905605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGAC CTCATATAGA AGAGAACGGA AAGAAAAAAG TAACTGAAGG AAAAACATTC 
CCAGTTCCAT TTAATTTAGA AGAGCTTAAT AAAAAAAGAT CTATTACCAC GAAGACTTCT
TATAAACCTT CTAAAGAACA AATCATTAAT CAAGCATTTA AATTTCATTC ACAAGGAAAC
ATTTCAGAAG CAGCAAAATA TTATCAATAT TGCATAAATC AAGCATTCAA GGATTACAGA
GTCTTTACTA ATTATGGAGT CATTTTAAAA AAATTTGGAA AATTGAAAGA AGCAGAAAAA
TGTCAACGCG AAGCAATTCA AATTAATCCT AATTTCGCAG AAGCATATTC CAATCTAGGT
AATATATTGA GTGATCTTGG TCAATTAAAA GAAGCAGAGT TATCCTTTCG CAAAGCCATT
GAAATCAAAT CTGATTACGC AGAAGCACAT TCAAATCTAG GTAATATATT GAGAGATTTT
GGTCAATTAA AAGAAGCAGA GTTATCCTTT CGCAAAGCTA TTGAAATCAA ATCTGATTAC
GCAGAAGCAC ATTCAAATCT AGGTAATATA TTGAATGATC TTGGTCAATT AAAAGAAGCA
GAGTTATCCT TTCGCAAAGC TATTGAAATT AAACCTGATT TCGCAAATAC TCATAATAAT
CTAGGAATCA TATTGAGTGA TCTTGATCAA TTAAAAGAAG CAGAGTTATC CTTTCGCAAA
GCTATTGAAA TCAAACCTGA TTTCATAAAA GCATACTCAA ATCTTGGGAA CATATTGAGA
GATCTTGGTC AATTAAAAGA AGCAGAGTTA TCCTTTCGCA AAGCTATTAA AATCAAACCT
GATTACGCAG AAGCTTATTT CAACCTTGCT TACTTAGAGC TGCTAAAAGG CAATTATAAA
TCTGGACTAA AGAACTATGA GTTTAGGTTC AAAAAAAAGC AACCTACTGT TCCTCACATC
AAAACAAAAT TGAAAAGAGT CACTAATGAA CAATTCCAAA AAGGAGAGAA ACTTTTAGTT
GTTAGTGAGC AAGGTTTAGG AGATACTCTA CAATATATGC GATACATACC ATATCTGCAA
AATCAAGGTT TTGATATTAA TTTTTGTGCC CAGACACAAC TCCATTCATT GATCAAAGCA
TCAGCTATAA GTGCAGATCC ATTAATAACA GAACAAGCAA ATCAACTTTC ACTAGGACAA
TGGATACCAT TATTATCCTT ACCTAAATAT CTACAAGTAA GCCCAAATAA TCCAATCATC
TCAGAACCAT ATATTAACTC CACAGATAAT CTAACAAAGA AGTGGAAAAA GATACTCTCT
AAAGAAAAAA GAGCAGTCAT TGGTATTAAT TGGCAAGGTA ATCCAACAAT GGAAAAAAGT
TTTTATCAAG GGCGGTCAAT ACCTCTCGAA ATATTTTCAA TCCTTCCTGA ACATAATGAA
ATCACAATGC TCTCATTACA AAAAGGATTT GGATCGGAAC AATTAGAGAA TTGCTCTTTC
AAAAATAAAT TTGTTAGCTG CCAATCAGAA ATTGATTCCA CTTGGGATTT TTTAGAAAAT
GCTGCCATTA TTGATAACTG TGATTTAATT ATTACTTGCG ATACTTCGAT TGCTCATTTA
GCTGGAGGGA TGGGAAAGAA AGTTTGGTTA CTTTTAAGAG ATATCCCTTA TTGGACTTGG
GGACTTGACG GGGAAACTAC ATTTTGGTAT CCATCGATGA GATTATTTCG ACAAAATAAA
CGTCACAATT GGCAAGAGGT TATGGAGAGA GTATCAAGCG CACTTGAAAA AGAAATTGAG
GCAAAAGTAT GA
 
Protein sequence
MDGPHIEENG KKKVTEGKTF PVPFNLEELN KKRSITTKTS YKPSKEQIIN QAFKFHSQGN 
ISEAAKYYQY CINQAFKDYR VFTNYGVILK KFGKLKEAEK CQREAIQINP NFAEAYSNLG
NILSDLGQLK EAELSFRKAI EIKSDYAEAH SNLGNILRDF GQLKEAELSF RKAIEIKSDY
AEAHSNLGNI LNDLGQLKEA ELSFRKAIEI KPDFANTHNN LGIILSDLDQ LKEAELSFRK
AIEIKPDFIK AYSNLGNILR DLGQLKEAEL SFRKAIKIKP DYAEAYFNLA YLELLKGNYK
SGLKNYEFRF KKKQPTVPHI KTKLKRVTNE QFQKGEKLLV VSEQGLGDTL QYMRYIPYLQ
NQGFDINFCA QTQLHSLIKA SAISADPLIT EQANQLSLGQ WIPLLSLPKY LQVSPNNPII
SEPYINSTDN LTKKWKKILS KEKRAVIGIN WQGNPTMEKS FYQGRSIPLE IFSILPEHNE
ITMLSLQKGF GSEQLENCSF KNKFVSCQSE IDSTWDFLEN AAIIDNCDLI ITCDTSIAHL
AGGMGKKVWL LLRDIPYWTW GLDGETTFWY PSMRLFRQNK RHNWQEVMER VSSALEKEIE
AKV