Gene Lcho_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2003 
Symbol 
ID6162364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2170440 
End bp2171930 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content69% 
IMG OID641664772 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001791035 
Protein GI171058686 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000861925 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACATGA CCACCAACCC GCTGGCACTG CTGAACGACC CCGGCCTGCT CAAGACCGAC 
GCGCTGATCG ACGGCGAGTG GATCGCCGGC ACCAGCCGCT TCGACGTCGC CGACCCCGCC
ACCGGCGCCA GGCTGGCGGA CGTGGCGCTG CTGGGCGCGA GCGAAACCGC AGCCGCGATC
GCCGCCGCCA ACGCCGCCTG GCCGGCCTGG CGCGCCAAGA CCGCCAAGGA GCGCGCCGGC
ATCCTGATGA AGTGGTACCA GCTGCTGATC CAGCACGCCG ACGACCTGGC CCGCATCATG
ACCGCCGAGC AGGGCAAGCC GCTGGCCGAG GCGCGCGGCG AGGTGGTCTA CGGCGCCAGC
TTCATCGAGT GGTTTGCCGA AGAAGCCAAG CGCGTTTACG GCGAGACGAT TCCGAGCACC
GACGCCAACA AGCGCTTCAT CGTGCTCAAG CAGCCGATCG GCGTCTGCGC GGCGATCACG
CCGTGGAATT TCCCGATCGC GATGATCACG CGCAAGGTCG CGCCGGCGCT GGCCGCGGGC
TGCCCGGTGA TCATCAAGCC GGCCGAGCAG ACGCCGCTGT CGGCGCTGGC CTGCGCCGAG
CTGGCCCAGC GCGCCGGCAT GCCGCCGGGC GTGCTCAACA TCCTGACCGG TGATGCCGAG
AGCTCGATCG AGATTGGCGG CGTGCTGTGT GCCTCCGACG TGGTGCGCCA CCTGAGCTTC
ACCGGCTCGA CCGAAGTGGG CCGCATCCTG ATGCGCCAGT GCGCGCCGAC GATCAAGAAG
CTTAGCCTCG AACTCGGCGG CAACGCACCC TTCATCGTCT TCGACGACGC CGACATCGAC
AGCGCGGTCG AGGGCGCGAT GGTCAGCAAG TACCGCAACG CCGGCCAGAC CTGCGTCTGC
GCCAACCGGC TGTACGTGCA GGACAGCGTC TACGACGCGT TCGTCGAGAA GCTCGCCGCC
AAGGCCGCGG CGATCAAGGT CGGCAACGGT TTCGAGGCCG GTGTCAACCA GGGGCCGATG
ATCGACGCCC AGGCGCTGGC CAAGGTCGAA TCCCACGTGG CCGATGCCGT CGCCAAGGGT
GCCCGGGTGG TGGTGGGTGG CAGCCGAGGT GCTGGCGCGC TGGGCCAGCG TTTCTACACG
CCGACGGTGC TGTCGGACGT GACCGCCGAG ATGCTCTGCG CGCGCGAGGA AACCTTCGGC
CCGGTGGCGC CGGTGATGCG TTTCAAGACC GAGGCCGAGG CGATCGCGCT GGCCAATGCC
ACCGAGTTCG GTCTGGCGGC TTATTTCTAC AGCCGCGACA TCGGCCGCAT CTTCCGCGTC
GGCGAGGCGA TCGAGGCCGG CATGGTGGGC GTCAACACCG GGCTGATCTC GGTGGCCGAA
GTGCCGTTCG GCGGCGTCAA GCAGTCGGGG CTCGGCCGCG AGGGCTCGCG GCACGGCATC
GAGGATTACG TCGAGATGAA GTACCTGTGC CTGGGCGACA TCCTGCGCTG A
 
Protein sequence
MDMTTNPLAL LNDPGLLKTD ALIDGEWIAG TSRFDVADPA TGARLADVAL LGASETAAAI 
AAANAAWPAW RAKTAKERAG ILMKWYQLLI QHADDLARIM TAEQGKPLAE ARGEVVYGAS
FIEWFAEEAK RVYGETIPST DANKRFIVLK QPIGVCAAIT PWNFPIAMIT RKVAPALAAG
CPVIIKPAEQ TPLSALACAE LAQRAGMPPG VLNILTGDAE SSIEIGGVLC ASDVVRHLSF
TGSTEVGRIL MRQCAPTIKK LSLELGGNAP FIVFDDADID SAVEGAMVSK YRNAGQTCVC
ANRLYVQDSV YDAFVEKLAA KAAAIKVGNG FEAGVNQGPM IDAQALAKVE SHVADAVAKG
ARVVVGGSRG AGALGQRFYT PTVLSDVTAE MLCAREETFG PVAPVMRFKT EAEAIALANA
TEFGLAAYFY SRDIGRIFRV GEAIEAGMVG VNTGLISVAE VPFGGVKQSG LGREGSRHGI
EDYVEMKYLC LGDILR