Gene NATL1_20471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20471 
SymbollldD 
ID4779909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1690320 
End bp1691516 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content41% 
IMG OID640085341 
ProductL-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
Protein accessionYP_001015867 
Protein GI124026752 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.819941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGGAG CAGATGTTTC CTCTCCTGGG GTACTTAATA TTGATGACCT CAGGTCTAGA 
GCTAAAAATC GTCTGCCGGC GATGGTTTTT AACTATATAG ATAGTGGTGC AGATAGAGAA
CAAACACTTT CGCAAAATTG CAACGCATAT AATGAAATTT TATTTAGACC TAGATGTGCA
GTTTCTGTTC CATCGTGTGA GCTTGGAATA TCTGTTTTAG ATCAGCAATT TCAACTTCCT
TTCTTGTTGG GACCAGTAGG GAGTAGCAGA ATGTTCTATC CTCAAGGAGA AGTTGTTGCA
GCTAGAGAGG CAGGAAAAGC TGGAACTGGA TATACCTTGT CGATTCTCTC AGGTTGTTTA
TTAGAAGACG TTAAAGCTGC TACAAACGGA CCAGCTTGGT ATCAGCTTTA TTTACTTGGT
GGTAAAGAAG TCGCTTTAAA AACAATTGCT AGAGCTAAAG AAGCTGGATT CTCAGCAATA
GTTGTAACTA TTGATACACC CGTATCTGGT TTGAGGGAAA GAGATATGCG ATCAGGAACC
CAACAGCTTT TATCAATGAA TCCTTTGGAG ATGCTTCCTT ATATTCCTCA AATATTAGTT
AAACCATGCT GGATGACTCA ATGGTTAAGT GATGGAGGCT TAATGAGTTT TCCTAATGTT
CAACTAGATG ATGGCCCTAT GGGATACACG GCAATTGGTC CTGCTTTAGA GCAATCAGTG
GTTACTTGGG ATGATCTTCA ATGGATAAGA GAAGCGTGGG GTGGAAAAAT TATTGTTAAG
GGTATACATA TTGGCGATGA CGCAAAAAAA GCGGTAGAGC TAGGGGCTGA TGCGATCGTT
ATTTCTAATC ATGGAGCCAG GCAACTTGAT AGCGTTGCTC CCACGATCCG TGTTTTGCCC
GAAATTTTAG CTGCAGTTGA TGGGAAAATA GATGTGTTGC TAGATGGAGG TATTCGCAGG
GGTAGTGATG TTGTTAAAGC ATTATGTCTT GGAGCGAAAG GAGTTTTGAT CGGTAGAGCA
TATGCGTATG GACTTGCTGC TGCAGGAGGG AAAGGCGTTG CCAGAGCTAT AGAAATTCTT
CAAACAGATA TAGTGAGAAC TATGAAACTA TTGGGATGTG GGTCTGTTGC CGATTTAAAT
AAATCTTATA TTCAAGTTCC TGAAAGTTGG GAGAGATTCG AAAAAATCTT TGATTGA
 
Protein sequence
MLGADVSSPG VLNIDDLRSR AKNRLPAMVF NYIDSGADRE QTLSQNCNAY NEILFRPRCA 
VSVPSCELGI SVLDQQFQLP FLLGPVGSSR MFYPQGEVVA AREAGKAGTG YTLSILSGCL
LEDVKAATNG PAWYQLYLLG GKEVALKTIA RAKEAGFSAI VVTIDTPVSG LRERDMRSGT
QQLLSMNPLE MLPYIPQILV KPCWMTQWLS DGGLMSFPNV QLDDGPMGYT AIGPALEQSV
VTWDDLQWIR EAWGGKIIVK GIHIGDDAKK AVELGADAIV ISNHGARQLD SVAPTIRVLP
EILAAVDGKI DVLLDGGIRR GSDVVKALCL GAKGVLIGRA YAYGLAAAGG KGVARAIEIL
QTDIVRTMKL LGCGSVADLN KSYIQVPESW ERFEKIFD