Gene NATL1_01471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01471 
Symbol 
ID4780016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp144458 
End bp145699 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content33% 
IMG OID640083411 
Productputative NADH dehydrogenase, transport associated 
Protein accessionYP_001013976 
Protein GI124024860 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTTTTG AATGCCGAAT TAGTCAAAGT TTGTTTAGGG TCTTTGAAAT CATCTTAGAT 
CCAGTCCCTA TGGACTTGAA CAATTTGAAA TCTGATCCAA TTGTTGTTGT TGGTGGTGGC
TTCGGAGGAC TCTCCACTGT CCAAGCATTA TTAGCTCGGT CGAATGGAAT GCCAATCATT
TTGATTGATC AAAGCCCAAG ATTTCTTTTT AAGCCATTGC TTTATGAATT ATTAAGTGGT
GAGCTTGAAT TATGGGAGGT TGCACCTAAA TATTCTGCTT TAGCTTCAGA ATTAGGATTT
ATTTTTTTAG AAGAGTGTGT TGTTGAAGTT GATGGACTAG AGAGAAAGTT AATTACTTCA
TCGGGGACTA AAGTTAAATA CTCTCAACTT GTCATAAGCA CTGGGGTAAC AACCGATTTT
TCTCTTCTCC GAGATTTGAA AGAATATGCG TACGGTTTCT CTAGTTTGAA TGATCTCGTA
AGAATTCAAG AGTTAATAAT CTCAATCAAT AATTCTTCTA ACCACTCCAA TCCTTTGATT
ATCGCTGGAG CAGGACCAAC TGGTGTTGAG CTCGCATGCA AATTATCTGA TTTAGTTAAT
AATAGAGTAG AAATATACTT GGTTGATAAA GGAAATAAAA TTCTATCTAA ATCCAAATCT
TTTAATAGGG AAAAAGCAAT AGATGCAATA GCTGAGAGAA ATATCAAGAT TTATTTGGAA
CATTATATTG AATCAATAAA TGAAAATACT ATAGAACTTT CTACTGTTGA GACCGAAAGA
AATAATTCTC TAAAAATTAA TTATTCTGGC TTGTTGTGGA CTGCTGGATT AAGCCCTTGT
CGGTTACCAT TTATAGATCA TCTTTTAGAT GAAAATAAGA AGATTAAAGT AAACAAATTT
TTGCAAATAA AAGAATATCA GAATATTTTT TTTGTTGGAG ATATCGTATT TTGTGAGGAC
GTTCCTTTTC CTTCTTCAGC TCAAGTAGCA ATGCAGCAGG GTTCTTTAAC AGCTCAAAAT
ATTATTTCTC TAAGAAAAGG CAACAAACTT AAATCATTTC AATTCGAAGA TCTTGGAGAA
ATGTTGAGTC TAGGTATTGG AAATGCATCA ATAACTGGTT ATGGAGTTAC TTTGGCAGGA
TCTCTTGCTT CCAAAATAAG GCATTTTGCA TATTTAATGC GAATGCCAGG TTTTTCTCTA
TTTTTGAAAT CTGCAGGATC ACGGTTATTA AGTAAAAAAT AA
 
Protein sequence
MSFECRISQS LFRVFEIILD PVPMDLNNLK SDPIVVVGGG FGGLSTVQAL LARSNGMPII 
LIDQSPRFLF KPLLYELLSG ELELWEVAPK YSALASELGF IFLEECVVEV DGLERKLITS
SGTKVKYSQL VISTGVTTDF SLLRDLKEYA YGFSSLNDLV RIQELIISIN NSSNHSNPLI
IAGAGPTGVE LACKLSDLVN NRVEIYLVDK GNKILSKSKS FNREKAIDAI AERNIKIYLE
HYIESINENT IELSTVETER NNSLKINYSG LLWTAGLSPC RLPFIDHLLD ENKKIKVNKF
LQIKEYQNIF FVGDIVFCED VPFPSSAQVA MQQGSLTAQN IISLRKGNKL KSFQFEDLGE
MLSLGIGNAS ITGYGVTLAG SLASKIRHFA YLMRMPGFSL FLKSAGSRLL SKK