Gene NATL1_00431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00431 
Symbol 
ID4779707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp45128 
End bp46435 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content32% 
IMG OID640083306 
Producthypothetical protein 
Protein accessionYP_001013872 
Protein GI124024756 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGAGT CGCAAAAAGA AGAGCAAAGA AAAAAGAATA TACCTCAAGT CCAAACATTT 
ACTGTTCCAT TTACCTCAGT AGAAGTCAAA GAAAACATCA CTATTACTAC TAATAATCCT
TCTAAACCTT CTAAAGAACA AATAATTAAT CAAGCAATTC AGTTTCATCT AAAAGGAAAT
ATTCCAAAAG CAACAAAATA TTATCAGCAG TTAATAAATC AAGAATGTAA TGATTACAGA
GTTTTTTCTA ATTATGGAGC AATTTTACAA GGTCTAGGTA AATCAAAAGA AGCAGAAGCC
TCTCTTCGCA AAGCAGTTGA ACTCAATCCT GATTTAGCCG AGTCGCATTC TTATCTGGGA
AACCTATTGA ATGATCTTGG AAAATTCAAA GAAGCAGAAG CCTCTCTTCG TAAAGCCGTT
GAACTCAATC CTAATTTAGC CTTGGCGCAT GCTTATCTGG GAATCCTATT GAATGATCTT
GGACAATTGA AAGAAGCAGA GGCTAGCCTA AAAAAAGCAA TAAAATTAAA ATTTGGCTCT
GTGAAAGCAT ATGATGCTTT ATCAAATGTT CTAAACAAAT TAGGGCGCAA AAAAGAAGCA
GAGGAAAGTT CTAAGAAGAT TTTATATTTA ACACCAGTCA GCGATTCAAA GTTAGAAGAT
AAAAGAAAAG GCCAAAATAA TATGTTTACG AAACCAAGTC CTATAGAGTA TCCAATTTTT
TACAGACCAG GTATGGGCAC AGAGAATGTT GGGAGTTTTC TTAGATCTAT GGTAATGATG
TTAAGACCAA AGAGAGTTTT AGAAATTGGA GCAGGTTATA CAACGCCTTT TTTACTAGAA
GCATTAATAA ATAATGAAAG AGTCTTTGAT GATGGGAATC TAAGTGAGTC TTATTTTCAA
AATTATAATT ATGAAGCAAA ATTAGTCGTT ATAGATAATC AATCTTTAGG TAACTTGACA
AAAGTTCCGG GAATGAAAGA TATAGTAAAT TCAAAGTATA CAGAATTTAT TGAAGGTACT
TTCGAAGGAA AGGCAAAAGA GTTATCTACA AAATATAGTC ATTTTGATTT CGTTTGGTTT
GACTGTGGAG GACCCAAGGA ATATCTAAGC TTTATTCAAG AATACTGGGA ATATTGTTCA
AACTACATTT TATTTCATTT TACTTATCGA GATGGAAAGC CAAATGTGAA TCACAAAATC
ATTCACGATA ATATAAAAGG AAATCCAGTT ATTATAGATA TTGTTGAACC ACACAAGAAA
CGACAAGGGA GTATAACAAT GGTGAAGAAA GAAAATTTTA AAAAATAA
 
Protein sequence
MEESQKEEQR KKNIPQVQTF TVPFTSVEVK ENITITTNNP SKPSKEQIIN QAIQFHLKGN 
IPKATKYYQQ LINQECNDYR VFSNYGAILQ GLGKSKEAEA SLRKAVELNP DLAESHSYLG
NLLNDLGKFK EAEASLRKAV ELNPNLALAH AYLGILLNDL GQLKEAEASL KKAIKLKFGS
VKAYDALSNV LNKLGRKKEA EESSKKILYL TPVSDSKLED KRKGQNNMFT KPSPIEYPIF
YRPGMGTENV GSFLRSMVMM LRPKRVLEIG AGYTTPFLLE ALINNERVFD DGNLSESYFQ
NYNYEAKLVV IDNQSLGNLT KVPGMKDIVN SKYTEFIEGT FEGKAKELST KYSHFDFVWF
DCGGPKEYLS FIQEYWEYCS NYILFHFTYR DGKPNVNHKI IHDNIKGNPV IIDIVEPHKK
RQGSITMVKK ENFKK