Gene NATL1_18201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18201 
Symbol 
ID4780069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1484472 
End bp1485764 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content28% 
IMG OID640085109 
Producthypothetical protein 
Protein accessionYP_001015640 
Protein GI124026525 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID[TIGR01444] methyltransferase, FkbM family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.48017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCTA GTGATGAAGA TCAAGGGGAA AAGAATAAAC CTCAAGTCCA AATATTTACT 
GTTCCATTTA CTTTAGGTGA AATCAAAGAA AATATTAATA TTAATATTAA TACTCCTTCT
AAACCTTCTA AAGAGCAAAT CATTAATCAA GCATTTAAAT TTCATTCACA AGGAAACATT
TCAGAAGCAG CAAAATATTA TCAATATTGC ATAAATCAAG CATTCAAAGA TTACAGAGTC
TTTACTAATT ATGGAGTCAT TTTAAAAAAA TTTGGAAAAT TGCAAGAAGC AGAAAAATTT
CAACGCGAAG CAATTCAAAT TAATCCTAAT TTCGCAGAAG CATATTCCAA TCTAGGTAAT
ATATTGAGAG ATCTTGGCCA ATTAAAAGAA GCAGAGTTAT CCTTTCGCAA AGCTATTGAA
ATCAAATCTG ATTACGCAGA AGCATATTCC AATCTAGGTA ATATATTGAG AGATCTTGGT
CAATTAAAAG AAGCAGAGTT ATCCTTTCGC AAAGCTATTG AAATCAAACC TGATTACGCA
GAAGCACATT CCAATCTAGG TAATATATTG AGTGATCTTG GCATCAAAAA AGAAGCGAAA
TTAGAAAAGC AGAAAAGCAT TGACATCAAT TTTATTCAAA ATATTAACTC TAAATATAGA
GAAGATTGTA AAAGGAAAAT CAGTCTATCA AAATCACAAT TCAGACAAGA TTTATTTGCC
CTGACTGAGT TGGGTTTTAA AAAAAATGGT TTTTTTGTAG AATTTGGTTC TTATGATGGC
CTGACTGGTT CTAATTCTTA TCTATTAGAG AAATTTTTTA ATTGGGATGG GATATTAGCT
GAACCAATGA TTTCCTGTCA TGAAAAGTTA AAAAATAATC GTTCAGTACG TATAGAAACC
AAATGTGTAT GGAAATCATC TGGTCAAGAG ATCTTATTTA ACGAACCTTT TTCATATAAG
CAAATGGCTA CAATAGATAG TTTTTCGGAG GAGAAACCAG ATTATTATAA GGAAGGAAAA
AGATATATTG TTAAATCAAT TTCATTATTA GATTTATTAG ATATAAATAA TGCACCAAAG
TTTATAGATT ATTTATCTAT AGATACAGAA GGAAGTGAAT ATGATATTTT AAATGCTTTT
GATTTTGAGA AATATAAATT TAGGGTAATT ACCTGTGAGC ATAATTTTAC TCCTATGAGA
GAAAAAATTT ATAACCTTTT AATTAATAAC GGCTATAAAA GAAAATTAAC AAATTTATCA
AGAGTTGATG ATTGGTATAT TTATGAATCC TAA
 
Protein sequence
MESSDEDQGE KNKPQVQIFT VPFTLGEIKE NINININTPS KPSKEQIINQ AFKFHSQGNI 
SEAAKYYQYC INQAFKDYRV FTNYGVILKK FGKLQEAEKF QREAIQINPN FAEAYSNLGN
ILRDLGQLKE AELSFRKAIE IKSDYAEAYS NLGNILRDLG QLKEAELSFR KAIEIKPDYA
EAHSNLGNIL SDLGIKKEAK LEKQKSIDIN FIQNINSKYR EDCKRKISLS KSQFRQDLFA
LTELGFKKNG FFVEFGSYDG LTGSNSYLLE KFFNWDGILA EPMISCHEKL KNNRSVRIET
KCVWKSSGQE ILFNEPFSYK QMATIDSFSE EKPDYYKEGK RYIVKSISLL DLLDINNAPK
FIDYLSIDTE GSEYDILNAF DFEKYKFRVI TCEHNFTPMR EKIYNLLINN GYKRKLTNLS
RVDDWYIYES