Gene Lcho_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2146 
Symbol 
ID6163888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2346240 
End bp2347457 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content71% 
IMG OID641664914 
Productaldo/keto reductase 
Protein accessionYP_001791177 
Protein GI171058828 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000567174 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATTC GCCCGACACC CATGAACGCC GACCTTGAAG GTCAAAACCA GCAACGCCGC 
CACCTGATGC TCACCGCCGC CACGCTGGGC GTGGCTCCGT GGCTTCTCTC CGCCTGCGCC
AGTACCGCCG GCGGTGCAGA GGCAGGCCGC TCGCCGGGCC GCCCACAAGC CGCCGGGGCG
CGTCGCCGGC TCGGCCCGCT CGAAGTCTTT CCCGTAGGGC TGGGCTGCCA ATGGCGACCG
GGCGCCACGC CCGGCGTGGT GGTCGATTCG TACAGCAGCC GCTTTGACCG CCCGGCCGCC
ATCCGCCTCA TCCGCCAGGC CGTGGACCAG GGTGTCACGT TGATCGACAC GGCCGAAGCC
TACGGCCCCT TCCTGTCGGA AGACATCGTC GGCGAGGCGC TGCAGGGCAT CCGCGACAAG
GTGGTGCTGG AGACCAAGTT CGGCTTCGAC ATCGATCAGG TCACAGGGCA ACGCCTGCCC
GGTGGCCGCA ACAGCCGGCC CGAGCACATC CGCCGGGTGG TCGACGCCCA GCTGCGGCGC
CTGCGCACCG ACCGCATCGA CGTGCTGATC CAGCACCGGG TGGACCCGAA CGTGCCCATC
GAGGACGTGG CCGGCACGGT CAAGGACCTG ATCGGCGCCG GCAAGGTGCG GCACTTCGGC
CTGTCCGAGC CCGGCCTGCA GAGCGTGCGC CGCGCCCATG CGGTGCAACC GCTGGCGGTG
ATCCAGAACG AATACTCGAT GCTGTGGCGT GGCCCCGAGG CCCAGGTGCT GCCGCTGTGC
GAGGAACTGG GCATCGGCTT CGTCTGCTGG AGTCCGCTGG GCATGGGTTT CCTGGCCGGC
GGCGTGCGGG CGGATTCGCG CTTCGCGACC GCGCCGATCA CCGACTTCCG CGCCATCTCG
CCGCGCTTCG CCCCCGAGGT GCTGCCCGCC AACATGGCGC TGGCCGACCT GGTGCGCAAC
TGGGCGCAAC GCAAGAACGC CACGCCCGGC CAGTTGTCGC TGGCCTGGCT GCTGGCGCAA
AAGCCCTGGA TCGTGCCGAT TCCGGGCACC ACCAACGCGG CCCACATGAC CGAGAACCTG
GGGGCGGCCT CGATCTCGTT CACCGCGCAA GAGCTGCAGC AGCTCAACAC CGCGGTGGCC
GCCATCCGCA TCCAGGGGGA TCGCCTGCCG CCGGCCGTGG CGGTGATGTC GGGCGTCGAG
GCTGCGCCCA AGCGCTGA
 
Protein sequence
MSIRPTPMNA DLEGQNQQRR HLMLTAATLG VAPWLLSACA STAGGAEAGR SPGRPQAAGA 
RRRLGPLEVF PVGLGCQWRP GATPGVVVDS YSSRFDRPAA IRLIRQAVDQ GVTLIDTAEA
YGPFLSEDIV GEALQGIRDK VVLETKFGFD IDQVTGQRLP GGRNSRPEHI RRVVDAQLRR
LRTDRIDVLI QHRVDPNVPI EDVAGTVKDL IGAGKVRHFG LSEPGLQSVR RAHAVQPLAV
IQNEYSMLWR GPEAQVLPLC EELGIGFVCW SPLGMGFLAG GVRADSRFAT APITDFRAIS
PRFAPEVLPA NMALADLVRN WAQRKNATPG QLSLAWLLAQ KPWIVPIPGT TNAAHMTENL
GAASISFTAQ ELQQLNTAVA AIRIQGDRLP PAVAVMSGVE AAPKR