Gene NATL1_07121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07121 
Symbol 
ID4780964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp654667 
End bp656310 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content36% 
IMG OID640083986 
Productglucose-methanol-choline (GMC) oxidoreductase:NAD binding site 
Protein accessionYP_001014535 
Protein GI124025419 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.569172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.139328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACA AGCCCTATGA AGTCATAATT ATCGGTTCAG GTGCAACCGG GGGAATGGCA 
GCCTTAACAA TGGCTAAGGC TGGAGTAAGA GTACTAGTAA TAGAAAGAGG TCCTGAACTA
GAGATCAAGC AGGCAAACGG AACAGAGCCT TGCAATATGA TTCGGAGACT TTTAGGGGTA
ACAACTGGAA ATTATCAAAA TCAACCTCAA CATCCAGGGT TCTGGAAATC AAATCCCTTA
CTGTACGCAA ACAAAAAAGC AAATCCTTAT ACACACCCGC CAAAAGCCCC CTTCATGTGG
ACGCAAGGCA ATCAAGTTGG TGGAAGGAGC CTTACTTGGG GAGGGATAAC CTTAAGATTA
TCTGAGGAAG ATTTTGAAGC CTCAAAAGAA AAAGAATACA ATCTCGAATG GCCTATTAGT
TACAAAGACC TTGAGTCACA TTATTCGGAA ATAGAGAGTT TTCTAAAAAT ATACGGCAAC
AAAGACGATC TCAATCAACT ACCTAATGGT GAATATATTG GCAAACTTCC ATTTACAGAA
AGTGAATCAA GATTTGCCTC CAATATAAAA GAAAATTTAA ATCTTCCCTT TATACATTCC
AGAGGGTTTG GACCAAATGA AGATAAAACG AAATGGCCAA GATATAGCAG TTTAGGCAGC
ACATTAAAAG AAGCCGCTAG GCTAGGTAAA ATAGAAATAC TTTCAAATCA TATTGTAGAT
AAATTAGTTT TGAATAAGGA TAGGAAGTCT GCAAAAAGTA TTATTGTAGT AAACCAAAAA
AATGGAGAAA GAAGTGAATT AGAAAGTAAA TTAATAATAC TATGTTCATC AACAATCCAA
ACAATTAGAA TTCTATTAAG TTCCGAAGAA AGTAATAATT CAAATGGTCT AATTGACCCC
TCTAATTCAT TAGGAATGAA CTTAATGGAT CATATATCAA CCTGTAGGTT TTTCACTGTC
CCTATCGATA AGAATTTCAA TGATTATTCA GATAAAAATA ATAATCATCT TTTAACAGGA
GCAGGTAGCT TCTTTATTCC CATCGGTAGA GAGAGCTCAA CTAAAAAAAA CTTTGTTGGT
GGATATGGCA TCTGGGGAGG AATAGATAGA TTTGAACCAC CAGAAGTTTT TAAGAAATAT
AAAAACACAA AAACTGGTTT CCTTATTGGT CATGGAGAAG TACTCCCAAA TAAAAAAAAC
ACTGTTTCTC TCTCAAATAC CAATGATCTA TATGGTATTT CCATACCGCA CATATCAATA
GTTTGGCGAG AAAATGAGAA ACGAATGGTT TCAGAAATGA ACAGAATGAT TGAACTTATT
ATTAATTCTG GTAATGGCAA AATTATTCCA GTGAATGAGA TTCTCAATAT TCCATTTACC
AAACAAATTT TAAGCAAATC AGTTGCTATT AAAAGCGATG CTCCACCCCC TGGTTACTAC
ATACACGAGG TAGGAGGAGC ACCAATGGGA AATGACAAAG GAAATAGTGT TTTAGACAAC
TGGAATCGTC TATGGGAATG CAACAATGTA TTAGTAGTTG ATGGAGCTTG CTGGCCTACT
TCATCTTGGC AAAGCCCAAC ATTAACAATG ATGGCAATAA CTAAGAGAGC TTGCGAAAAG
GCAATTAGAG ACTTTAAAGG TTAA
 
Protein sequence
MIDKPYEVII IGSGATGGMA ALTMAKAGVR VLVIERGPEL EIKQANGTEP CNMIRRLLGV 
TTGNYQNQPQ HPGFWKSNPL LYANKKANPY THPPKAPFMW TQGNQVGGRS LTWGGITLRL
SEEDFEASKE KEYNLEWPIS YKDLESHYSE IESFLKIYGN KDDLNQLPNG EYIGKLPFTE
SESRFASNIK ENLNLPFIHS RGFGPNEDKT KWPRYSSLGS TLKEAARLGK IEILSNHIVD
KLVLNKDRKS AKSIIVVNQK NGERSELESK LIILCSSTIQ TIRILLSSEE SNNSNGLIDP
SNSLGMNLMD HISTCRFFTV PIDKNFNDYS DKNNNHLLTG AGSFFIPIGR ESSTKKNFVG
GYGIWGGIDR FEPPEVFKKY KNTKTGFLIG HGEVLPNKKN TVSLSNTNDL YGISIPHISI
VWRENEKRMV SEMNRMIELI INSGNGKIIP VNEILNIPFT KQILSKSVAI KSDAPPPGYY
IHEVGGAPMG NDKGNSVLDN WNRLWECNNV LVVDGACWPT SSWQSPTLTM MAITKRACEK
AIRDFKG