Gene Lcho_3299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3299 
Symbol 
ID6162210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3666209 
End bp3667474 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content68% 
IMG OID641666074 
Productextracellular solute-binding protein 
Protein accessionYP_001792322 
Protein GI171059973 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCT CGTCCCCCCT TCGCCGCAGC CTCGTCGTCC TGGGCCTGGC TGCCCTCGGT 
GCCAGCGCCG CCCAAGCCCA GACCTCGCTG TCGATGTGGT ATCACGGTGC CGGCAACCCG
AAGGAGAAGG AGCTGATGAC GGGGATCATC AGCGACTTCA ACAAGAGCCA GAAGGAGTGG
AAGGTCGAGC TGCAGCAGTT CCCACAGGAG GCCTACAACA CCTCGGTGGT GGCCGCCGCG
GTGGCCGGCA AGCTGCCTGA CATCCTCGAC GTCGACGGCC CGGTGATGCC CAACTGGGCC
TGGTCGAAGT ACCTGCAGCC GCTGGCCCTG CCGGCCGGCG CGACCGACAA GTTCCTGCCC
GGCACGATCG GCACCTACAA CGGCAAGGTC TACTCGGTCG GCCTGTGGGA CGCGGCCTGT
GCGATGTTTG CCCGCAAGTC GGTGCTGCAG GCCCACAACA TCCGCATCCC GACGCTCGAC
AAGCCCTGGA CCAAGGCCGA GTTCGACGCC GCACTCGTGA CGCTGCAAAA GAGCGGCAAG
TTCCAGTACC CGATCGACCT GGGCCTGGCC TGGAAGGGCG AGTGGTACTC GTACGCCTTC
GGCCCCTTCC TGCAGAGCCA CGGCGGTGAC CTGCTGAACG CGGCTGCGCC CAAGGCCAAC
GGCACGCTCA ACGGCCGTGC CGGCGTCGAG TTCGGCACCT GGTGGCAGAG CCTGTTCACG
CGCAAGCTGA CCCCGGGCAC CTCGCAGAGC GGCGCCGACC GCGAGACCGG CTTCCTCGAC
GGCAAGTACG CGCTGCAGTG GAACGGCAAC TGGGCCGCGC TGCCGGCGCT GAAGAAGTTC
GGCGACGACC TGGTCTTCCT GCCGGCGCCC GACTTCGGCA AGGGCCCGAA GATCGGCGCC
GCGTCGTGGC AGTTCGGCGT CTCGGCCACC AGCAAGAACG CCAAGGGCGC GAACGCCTTC
ATCGCCTTTG CGCTCAAGGA CAAGTACCTG GCGGCCTTCT CCGACGGTAT CGGCCTGATC
CCGTCGACCC CGGCGGCCGC GGCGCTGACG CAGAACTACA AGAAGGGCGG CCCGCTGGAG
GTGTTCTTCG CGCTGTCGGC CAAGCAGGCC ACGCTGCGCG CGTCGACGCC GGGTTATGCC
GGCGCGTCGG GCGAGTTCGA GAAGGCGCTG GCCGACATCG CCAACGGCGG CAAGGTCGCC
GATGCACTCG ACAACGCCGC CGACGCGATC GACGCCGACC TGAAGAAGAA CGGCAACTAC
CGCTGA
 
Protein sequence
MTVSSPLRRS LVVLGLAALG ASAAQAQTSL SMWYHGAGNP KEKELMTGII SDFNKSQKEW 
KVELQQFPQE AYNTSVVAAA VAGKLPDILD VDGPVMPNWA WSKYLQPLAL PAGATDKFLP
GTIGTYNGKV YSVGLWDAAC AMFARKSVLQ AHNIRIPTLD KPWTKAEFDA ALVTLQKSGK
FQYPIDLGLA WKGEWYSYAF GPFLQSHGGD LLNAAAPKAN GTLNGRAGVE FGTWWQSLFT
RKLTPGTSQS GADRETGFLD GKYALQWNGN WAALPALKKF GDDLVFLPAP DFGKGPKIGA
ASWQFGVSAT SKNAKGANAF IAFALKDKYL AAFSDGIGLI PSTPAAAALT QNYKKGGPLE
VFFALSAKQA TLRASTPGYA GASGEFEKAL ADIANGGKVA DALDNAADAI DADLKKNGNY
R