Gene Lcho_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3239 
Symbol 
ID6161672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3591795 
End bp3593219 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID641666013 
Productglycoside hydrolase family protein 
Protein accessionYP_001792262 
Protein GI171059913 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGATG CCTCCCCGCG CCGACTGAAG ATCACCCTGA TCGGTGCCGG CAGCACCGTC 
TTCACCCGCA ACCTGCTGGG TGACATGCTC AGCCATCCCG AGCTCGCGGG CGCCGAGATC
GCGCTGCACG ACATCGACGA ACACCGCCTG CGGCTGTCCG AAAAGGTGGC CTACCGCATC
GCCGACGCGG TGGGCGCCAA GCCGGTGATC ACCGCCAGCA CCGACCGCCG CCGCGCGCTC
GACGGCGCGC GCTTCGTGCT CAACACCATC CAGGTCGGCG GCTACAAGCC GGCCACCGTG
ACCGACTTCG ACATCCCCAA GAAATACGGC CTCGAACAGA CCATCGGCGA CACGCTGGGC
ATCGGCGGCA TCATGCGCGG CCTGCGCACG ATCCCCGTGC AGTTGGCGAT GCTGCGCGAC
ATGGACGAGC TGTGCGCGCC CGGCGCCGTG CACCTCAACT ACGTCAACCC GATGGCGATG
ATCACGTGGG CGCTCAACCG CGCCTCGACG CGCGTGCCCA CCGTCGGCCT GTGCCACAGC
GTGCAGCACA CCGCGCAGGA GCTGGCCAAC GACTTGGGGC TGCCGGTCGA CGAGATCCAG
TACCACTGCG CCGGCATCAA CCACATGTCG TTCTACCTGC GCTTCGAGCG CAACGGCGAG
GACCTGTACC CCAAGCTGCG CGACATCCAG CGCGAGGGCC GCATGCCGGC CTGGAACCGC
GTGCGCTACG AGATGCTGCG TCAGCTGGGG CACTTTCCGA CCGAGTCGAG CGAGCACTTT
GCCGAGTACG TGCCGTGGTT CATCAAGCAG GGCCGCGAGG AGCTGCTGCG CAAGTTCAAC
ATCCCGCTCG ACGAGTACCC GGGCCGCTGC CAGGTGTTCG AACACGCCTG GCCCTACATC
GAGCGTGAGC TCGAAGTGCC CGGCTCGCAG GATCCGGCCG CGCTGCTGGC GCAGCTGCGC
GCCGCCGACA TCCACGTCAT GCAGCGCGAG ATCGAAGGCG CGCCGCAGAT GCTCGAAGGC
CTGCGCTCGG TCAAGCGCAG CGTCGAATAC GGCAGCGCGA TCATCCATTC GATCGTGACC
GGCCAGCCGC GCGTGATCAA CGGCAACGTG CTCAACCACC AGCTGATCGA CAACCTGCCG
CAAGGCTGTG CGATCGAGGT GCCGTGCCTG GTCGACCACA ACGGCGTGCA GCCCACGCGC
GTGGGCAAAC TGCCCGTGCA CTTGGCGGCG CTGATGCGCA CCAACGTCAA CGTGCAGGAG
ATGGTGGTGG AGTCGGTGCT CGAACAGCGG CGCGACCACG TCTACCATGC GGCGATGCTC
GACCCGCACA CCGCCGCCAC GCTCGATCTG GATCAGATCC GCGCGATGGT CGACGAGCTG
CTGGCGGCGC ACCGGCCCTT CCTGCCCGAG TACCTGCACG GCTGA
 
Protein sequence
MRDASPRRLK ITLIGAGSTV FTRNLLGDML SHPELAGAEI ALHDIDEHRL RLSEKVAYRI 
ADAVGAKPVI TASTDRRRAL DGARFVLNTI QVGGYKPATV TDFDIPKKYG LEQTIGDTLG
IGGIMRGLRT IPVQLAMLRD MDELCAPGAV HLNYVNPMAM ITWALNRAST RVPTVGLCHS
VQHTAQELAN DLGLPVDEIQ YHCAGINHMS FYLRFERNGE DLYPKLRDIQ REGRMPAWNR
VRYEMLRQLG HFPTESSEHF AEYVPWFIKQ GREELLRKFN IPLDEYPGRC QVFEHAWPYI
ERELEVPGSQ DPAALLAQLR AADIHVMQRE IEGAPQMLEG LRSVKRSVEY GSAIIHSIVT
GQPRVINGNV LNHQLIDNLP QGCAIEVPCL VDHNGVQPTR VGKLPVHLAA LMRTNVNVQE
MVVESVLEQR RDHVYHAAML DPHTAATLDL DQIRAMVDEL LAAHRPFLPE YLHG