Gene Lcho_3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3329 
Symbol 
ID6163716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3705631 
End bp3706902 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content66% 
IMG OID641666104 
Productextracellular solute-binding protein 
Protein accessionYP_001792352 
Protein GI171060003 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCA ATTTCAAGCA CATCGCCGCT GGGGTCGCCC TGGCTTTCGG TGGCATCGCC 
GGCGCCCAGG CCGCCACGAT CACCATCTCG TGCGGCTCCA ACGCTGCCGA CGTCGAGTTC
TGCGGCAAGT ACGCCGAGGA CTGGGGCAAG GCCAACGGCC ACACCGTCAA GATGTATTCG
TCCCCGGCCA GCACGACCGA CAACCTGGCC CTGCTGCGCC AGCAGTTCGC GGCCAAGTCG
TCCGACCTCG ACGTGATCAT GATCGACGTG GTCTGGCCCG GCGTGATCAA GGATCACCTG
GTCGACCTGA AGAAGTACAG CAAGGGCGCC GAGGCCAAGC ACTTCCCGGC CATCGTGGCC
AACAACACGG TCGACGGCAA GCTGCTGGGC ATGCCCTGGT TCACCGACGC CGGCCTGCTG
TTCTATCGCA AGGACCTGGT CGAGAAGTAC GGCCTGAAGG CGCCGGGCAC CTGGGAAGAG
ATGGCCACCG CGGCCAAGAA GATCCAGGAC GGCGAGCGCG CGGCCGGCAA GGCCGACTTC
CAGGGTTTCG TCTTCCAGGC CAAGGCCTAT GAAGGCCTGA CCTGCGACGC GCTCGAGTGG
GTGGCGAGCT TCGGCGGCGG CGAGATCGTC GACAAGGCCG GCAACATCAC CATCAACAAC
CCCGGTGCGG CCAAGGCGCT CGACACCGCG GCTTCGTGGA TCGGCACCAT CGCCCCGGCC
GGCGTGCTGA ACTACGGCGA GGAAGACTCG CGCGGCGTGT GGCAGAACGG CAATGCCGCC
TTCATGCGCA ACTGGCCTTA CGCCTGGTCG CTGGGTCAGG CCGCCGACAG CCCGATCAAG
GGCAAGATCG GCGTCGCAGC CCTGCCGGCC GGTTCGGGCG CCGGTGCCAA GAAGGCGGCC
ACGCTGGGCG GCTGGCAGCT GGCGGTGTCG AAGTACTCCA AGAACGTCGA CGCGGCCGCC
GCGCTGGCCA TGTACATGAC CAGCCCGGAG ATCCAGAAGA AGCGCGCCGT CGGTGGTTCG
TACAACCCGA CCATCCCCGA CCTCTACAAG GACGCCGACA TCGCCAAGGC GAACCCGTTC
ATGGTCGAGC TGCTCGACGT CTTCACCAAC GCCGTGGCCC GTCCGGCCAC CGCCACCGGC
CTGAAGTACC CGGAAGTCTC CAACGCGTTC TGGGACGCCA CCCACGAGGT GCTCGAGAAG
AAGACCACCG GCGCGGCCGC GGTCAAGAAG CTCGAAGGCA AGCTCAAGCA GATCAAGCGC
ACCAAGTGGT GA
 
Protein sequence
MSFNFKHIAA GVALAFGGIA GAQAATITIS CGSNAADVEF CGKYAEDWGK ANGHTVKMYS 
SPASTTDNLA LLRQQFAAKS SDLDVIMIDV VWPGVIKDHL VDLKKYSKGA EAKHFPAIVA
NNTVDGKLLG MPWFTDAGLL FYRKDLVEKY GLKAPGTWEE MATAAKKIQD GERAAGKADF
QGFVFQAKAY EGLTCDALEW VASFGGGEIV DKAGNITINN PGAAKALDTA ASWIGTIAPA
GVLNYGEEDS RGVWQNGNAA FMRNWPYAWS LGQAADSPIK GKIGVAALPA GSGAGAKKAA
TLGGWQLAVS KYSKNVDAAA ALAMYMTSPE IQKKRAVGGS YNPTIPDLYK DADIAKANPF
MVELLDVFTN AVARPATATG LKYPEVSNAF WDATHEVLEK KTTGAAAVKK LEGKLKQIKR
TKW