Gene Lcho_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2033 
Symbol 
ID6162261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2207915 
End bp2210140 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content69% 
IMG OID641664802 
Productextracellular solute-binding protein 
Protein accessionYP_001791065 
Protein GI171058716 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.240451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATC ACGCGATGGC CTGGATGCAA CGCGTGATCC GCTTCGGCGC TGGCCTGGTG 
CTGGCAGGCC TCACCTTGCT GGCCGGCTGC GACAACAGTC CGCACGGCCC TGGCTCGGCG
GCCTCGAACA CGCTCTACAC CGCGTTCACC GAGCGCTCGC CACGACACCT CGACCCAGTG
GCGTCGTACT GGAACCAGGA CACGCCCTAC ACCTACGGCA TCTACGAGCC GCCGTACACC
TATCACTACC TCAAGCGCCC GTTCACGCTG ATCCCGAAGG TGGCGACCGA GGTGGCGCCG
CCGGTCTATC TGGACAAGAA CGGCCGGCGC CTGGCCGCTG ATGCGCCGGG CGAACAGGTG
GCCGAGAGCG TGTACGAGGT GCGCATCCGC CCCGGCATCC GCTACCAGCC GCACCCGGCG
TTTGCGCAGG ACGCGCAAGG CCGCTACCTG TATCACCAGC TGAGCGCGGC GCAGGTGGGC
GAGCGCACGT CGCCGTGGCA GTTCGAGGTG ACGGGTACGC GCGAACTGGT GGCCGAGGAC
TTCGTCTACG GCCTCAAGCG CCACGCCACC ACGCGGGTGA CCACGCCGAT CTTCGGCATC
TTTGCCGAGT ACATCGTCGG GCTGAAGGAG TACGGCGCGC TGGTCAAGCA GGAGGACGCC
AAGCTGCGCG CCGGGCTCGA TCCGGCCGCA CTCGACAAGC CGTTTCTCGA CTTCCGCCGC
TGGCCACTGG CGGGGGTCAG CGCGCCGGAG AAGCACCTGC TGCGCCTGCG CATCCGCGGC
AAGTATCCGC AGTGGAAATA CTGGATGGCG ATGCCGTTTG CCGCGCCGAT GCCGTGGGAG
GCCGACGCCT TCTACGCCCA GCCGGGCATG GCGCGCAACG GGCTGTCGCT GGACCGCTGG
CCGGTCGGCA CCGGGCCGTT CATGATGGCC GAGTTCGAAC AGGATCGCCG CCACGTGCTG
GTACGCAACC CCAACTACCG CGGCGACCCG TACCCCTGCG AAGGCATGCC CGGCGACCGC
GAGCGCGGCC TGCTCGACGA CTGCGGCAAG CCGACGCCCT TCGTCGACCG GATCGTGTTC
CAGATGGAGC GCGAACAGGT GCCGCTGAAA GCCAAGTTCC GCCAGGGTTT CCTGGACGTG
CCCGAGATCG AGCGCACCGA CCGCGGCGCC GACTATCTGA TCGACATGGA AGACTCGCCC
GAAGCGCGCG CCGAGTACAC CGAGCGCGGC TACCAGCTGC CGCGCGCCGG CGACCTGAGC
ATCTGGTTCA TGGGCTTCAA CATGCTCGAC CCGGTGGTCG GCCGCGGCGA CACGCCCGAG
CAGCAGGCGC GCAATCGCCG GCTGCGCCAG GCGATCTCGA TCGCGCTCGA CTGGGCCGAC
TACAGCAACA TCTTCCCGAA CAAGGGCGGC GTCGAGGCGA TGGGGCCGCT GCCGGCAGGC
ATCTTCGGCT CGCGACAAGG TGCGCTCGAC GGCGTCAATC CGGTCACGCA CCGGGTGGTC
GACGGCAAGA TCGTGCGGCG GCCGATCGAG GACGCGCGCA AGCTGATGGT CGAGGCCGGC
TACCCCGACG GGCGCGACGC CCGCACGGGC CGCCCGCTGG TCATCAACTA CGATTTCTAC
CGCACGCTGA CGCCCGAATT CAAGGCCGAG ATCGATTGGA TGAGCCGCCA GTTCGGCCAG
CTCGGCATCC AGCTCGAGGT GCGCGCCACC GACAACAACC AGTTCCAGGA CAAGGTGCGC
AAGGGCCGGC ATCAGCTGTT CTTCTCGGGC TGGCTGGCCG ATTACCCGGA CGCGGAGAAC
TTCCTGTTCC TGCTCTACGG CCCGAACGGC AAGACCCGCT CGGAGGGCGA GAACACCGCC
AACTACGACA ACCCGGCCTT CGACCGGCTG TTCCAGCAGC TCAAGGATCT GGACGACGGG
CCGCCCAAGC AGGCGCTGAT CGACCGCATG GTGGCGCTGC TGCAGGACGA CGCGCCGTGG
ATCTGGGGCT ACATCCCCGA CGCCACCGGC GCCTTCCAGC CCTGGGTGCG CAACGCGGTG
GTGCCGGTGC TGATCAAGGA TCACCTGCGT TTCTACCGCG TCGACACGGC GCTGCGCACG
CGGCTGCAGC GGGCCTGGAA CCGGCCGGTC GGCTGGCCGC TGATGCTCGC GGGCGTGGCG
CTGCTGGCGC TGGTCTGGTT CGGCTGGCGC AGCTGGCGAG CGCGTGAACG AGCGACGGCG
CGATGA
 
Protein sequence
MADHAMAWMQ RVIRFGAGLV LAGLTLLAGC DNSPHGPGSA ASNTLYTAFT ERSPRHLDPV 
ASYWNQDTPY TYGIYEPPYT YHYLKRPFTL IPKVATEVAP PVYLDKNGRR LAADAPGEQV
AESVYEVRIR PGIRYQPHPA FAQDAQGRYL YHQLSAAQVG ERTSPWQFEV TGTRELVAED
FVYGLKRHAT TRVTTPIFGI FAEYIVGLKE YGALVKQEDA KLRAGLDPAA LDKPFLDFRR
WPLAGVSAPE KHLLRLRIRG KYPQWKYWMA MPFAAPMPWE ADAFYAQPGM ARNGLSLDRW
PVGTGPFMMA EFEQDRRHVL VRNPNYRGDP YPCEGMPGDR ERGLLDDCGK PTPFVDRIVF
QMEREQVPLK AKFRQGFLDV PEIERTDRGA DYLIDMEDSP EARAEYTERG YQLPRAGDLS
IWFMGFNMLD PVVGRGDTPE QQARNRRLRQ AISIALDWAD YSNIFPNKGG VEAMGPLPAG
IFGSRQGALD GVNPVTHRVV DGKIVRRPIE DARKLMVEAG YPDGRDARTG RPLVINYDFY
RTLTPEFKAE IDWMSRQFGQ LGIQLEVRAT DNNQFQDKVR KGRHQLFFSG WLADYPDAEN
FLFLLYGPNG KTRSEGENTA NYDNPAFDRL FQQLKDLDDG PPKQALIDRM VALLQDDAPW
IWGYIPDATG AFQPWVRNAV VPVLIKDHLR FYRVDTALRT RLQRAWNRPV GWPLMLAGVA
LLALVWFGWR SWRARERATA R