Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_2033 |
Symbol | |
ID | 6162261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | - |
Start bp | 2207915 |
End bp | 2210140 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641664802 |
Product | extracellular solute-binding protein |
Protein accession | YP_001791065 |
Protein GI | 171058716 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.240451 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATC ACGCGATGGC CTGGATGCAA CGCGTGATCC GCTTCGGCGC TGGCCTGGTG CTGGCAGGCC TCACCTTGCT GGCCGGCTGC GACAACAGTC CGCACGGCCC TGGCTCGGCG GCCTCGAACA CGCTCTACAC CGCGTTCACC GAGCGCTCGC CACGACACCT CGACCCAGTG GCGTCGTACT GGAACCAGGA CACGCCCTAC ACCTACGGCA TCTACGAGCC GCCGTACACC TATCACTACC TCAAGCGCCC GTTCACGCTG ATCCCGAAGG TGGCGACCGA GGTGGCGCCG CCGGTCTATC TGGACAAGAA CGGCCGGCGC CTGGCCGCTG ATGCGCCGGG CGAACAGGTG GCCGAGAGCG TGTACGAGGT GCGCATCCGC CCCGGCATCC GCTACCAGCC GCACCCGGCG TTTGCGCAGG ACGCGCAAGG CCGCTACCTG TATCACCAGC TGAGCGCGGC GCAGGTGGGC GAGCGCACGT CGCCGTGGCA GTTCGAGGTG ACGGGTACGC GCGAACTGGT GGCCGAGGAC TTCGTCTACG GCCTCAAGCG CCACGCCACC ACGCGGGTGA CCACGCCGAT CTTCGGCATC TTTGCCGAGT ACATCGTCGG GCTGAAGGAG TACGGCGCGC TGGTCAAGCA GGAGGACGCC AAGCTGCGCG CCGGGCTCGA TCCGGCCGCA CTCGACAAGC CGTTTCTCGA CTTCCGCCGC TGGCCACTGG CGGGGGTCAG CGCGCCGGAG AAGCACCTGC TGCGCCTGCG CATCCGCGGC AAGTATCCGC AGTGGAAATA CTGGATGGCG ATGCCGTTTG CCGCGCCGAT GCCGTGGGAG GCCGACGCCT TCTACGCCCA GCCGGGCATG GCGCGCAACG GGCTGTCGCT GGACCGCTGG CCGGTCGGCA CCGGGCCGTT CATGATGGCC GAGTTCGAAC AGGATCGCCG CCACGTGCTG GTACGCAACC CCAACTACCG CGGCGACCCG TACCCCTGCG AAGGCATGCC CGGCGACCGC GAGCGCGGCC TGCTCGACGA CTGCGGCAAG CCGACGCCCT TCGTCGACCG GATCGTGTTC CAGATGGAGC GCGAACAGGT GCCGCTGAAA GCCAAGTTCC GCCAGGGTTT CCTGGACGTG CCCGAGATCG AGCGCACCGA CCGCGGCGCC GACTATCTGA TCGACATGGA AGACTCGCCC GAAGCGCGCG CCGAGTACAC CGAGCGCGGC TACCAGCTGC CGCGCGCCGG CGACCTGAGC ATCTGGTTCA TGGGCTTCAA CATGCTCGAC CCGGTGGTCG GCCGCGGCGA CACGCCCGAG CAGCAGGCGC GCAATCGCCG GCTGCGCCAG GCGATCTCGA TCGCGCTCGA CTGGGCCGAC TACAGCAACA TCTTCCCGAA CAAGGGCGGC GTCGAGGCGA TGGGGCCGCT GCCGGCAGGC ATCTTCGGCT CGCGACAAGG TGCGCTCGAC GGCGTCAATC CGGTCACGCA CCGGGTGGTC GACGGCAAGA TCGTGCGGCG GCCGATCGAG GACGCGCGCA AGCTGATGGT CGAGGCCGGC TACCCCGACG GGCGCGACGC CCGCACGGGC CGCCCGCTGG TCATCAACTA CGATTTCTAC CGCACGCTGA CGCCCGAATT CAAGGCCGAG ATCGATTGGA TGAGCCGCCA GTTCGGCCAG CTCGGCATCC AGCTCGAGGT GCGCGCCACC GACAACAACC AGTTCCAGGA CAAGGTGCGC AAGGGCCGGC ATCAGCTGTT CTTCTCGGGC TGGCTGGCCG ATTACCCGGA CGCGGAGAAC TTCCTGTTCC TGCTCTACGG CCCGAACGGC AAGACCCGCT CGGAGGGCGA GAACACCGCC AACTACGACA ACCCGGCCTT CGACCGGCTG TTCCAGCAGC TCAAGGATCT GGACGACGGG CCGCCCAAGC AGGCGCTGAT CGACCGCATG GTGGCGCTGC TGCAGGACGA CGCGCCGTGG ATCTGGGGCT ACATCCCCGA CGCCACCGGC GCCTTCCAGC CCTGGGTGCG CAACGCGGTG GTGCCGGTGC TGATCAAGGA TCACCTGCGT TTCTACCGCG TCGACACGGC GCTGCGCACG CGGCTGCAGC GGGCCTGGAA CCGGCCGGTC GGCTGGCCGC TGATGCTCGC GGGCGTGGCG CTGCTGGCGC TGGTCTGGTT CGGCTGGCGC AGCTGGCGAG CGCGTGAACG AGCGACGGCG CGATGA
|
Protein sequence | MADHAMAWMQ RVIRFGAGLV LAGLTLLAGC DNSPHGPGSA ASNTLYTAFT ERSPRHLDPV ASYWNQDTPY TYGIYEPPYT YHYLKRPFTL IPKVATEVAP PVYLDKNGRR LAADAPGEQV AESVYEVRIR PGIRYQPHPA FAQDAQGRYL YHQLSAAQVG ERTSPWQFEV TGTRELVAED FVYGLKRHAT TRVTTPIFGI FAEYIVGLKE YGALVKQEDA KLRAGLDPAA LDKPFLDFRR WPLAGVSAPE KHLLRLRIRG KYPQWKYWMA MPFAAPMPWE ADAFYAQPGM ARNGLSLDRW PVGTGPFMMA EFEQDRRHVL VRNPNYRGDP YPCEGMPGDR ERGLLDDCGK PTPFVDRIVF QMEREQVPLK AKFRQGFLDV PEIERTDRGA DYLIDMEDSP EARAEYTERG YQLPRAGDLS IWFMGFNMLD PVVGRGDTPE QQARNRRLRQ AISIALDWAD YSNIFPNKGG VEAMGPLPAG IFGSRQGALD GVNPVTHRVV DGKIVRRPIE DARKLMVEAG YPDGRDARTG RPLVINYDFY RTLTPEFKAE IDWMSRQFGQ LGIQLEVRAT DNNQFQDKVR KGRHQLFFSG WLADYPDAEN FLFLLYGPNG KTRSEGENTA NYDNPAFDRL FQQLKDLDDG PPKQALIDRM VALLQDDAPW IWGYIPDATG AFQPWVRNAV VPVLIKDHLR FYRVDTALRT RLQRAWNRPV GWPLMLAGVA LLALVWFGWR SWRARERATA R
|
| |