Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1823 |
Symbol | |
ID | 4809807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2157057 |
End bp | 2158316 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107237 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001038237 |
Protein GI | 125974327 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR03407] urea ABC transporter, urea binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.158541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAGA AGTTTTGCAT ATTTGGCAAA AATACAATTA AGGTTTTGGC AATTGCACTG TCGGCACTGC TGGTATTCGC AGGCTGTTCC GGCAAAGTGG AGGAGCCGGT TGATAACAAA CCCGGAACTG ATACTTCCGC AGAAGACACC ATAAAGGTGG GAATTCTTCA CTCCTTAAGC GGAACCATGG CTATTAGCGA GGTATCCCTC AAAGATGCGG AATTGATGGC AATAGAAGAA ATTAACCAGG CCGGAGGTCT GCTGGGCAAA AAAATTGAAC CGGTGATTGA AGACGGAGCT TCCGATTGGC CTACTTTTGC AGAAAAGGCA AAGAAACTGC TCCAAAATGA CAAGGTTGCA ACCGTTTTCG GATGCTGGAC TTCAGCCAGC CGTAAAGCCG TATTGCCGGT GTTTGAAGAA AATAACGGAC TTTTGTGGTA TCCGGTGCAG TATGAGGGCA TGGAGTCATC ACCAAATATC TTCTATACCG GTGCGGCACC CAATCAGCAG ATTGTTCCCG CAGTCGAATG GCTTTTGGAA AACAAGGGAA AAAGATTTTT CCTCCTTGGC TCCGATTATG TATTTCCCAG AACCGCAAAC AAAATTATCA AAGCTCAGCT AAGCGCCATA GGTGGGGAAC TTATTGCAGA GGAGTATACT CCTTTGGGTC ATACCGATTA CAGTACCATT GTAAATAAAA TTAAAACCGC AAAACCGGAT GTAGTGTTTA ACACCCTGAA CGGGGACAGC AATGTTGCCT TCTTCAAACA GCTCAAGGAT GCGGGAATCA CGTCTGAAGA CATTACCGTT TGTTCTGTAA GTGTTGCAGA AGAAGAAATA AGGGGTATAG GCGCTGAAAA TATAAAAGGT CACCTGGTTT CATGGAACTA TTACCAGACT ACGGATACCC CGGAAAACAA AGAGTTTGTG GAAAAGTACA AATCTAAATA CGGAAGCGAC AGGGTTACCG ATGATCCCAT AGAAGCGGCA TATATAGCAG TTCATTTGTG GGCTGAGGCA GTTAAAAAGG CCGGTTCCTT TGAGGTGGAA AAGGTTAAGG AGGCAGCCAA AGGACTTGAA TTTAAAGCTC CTGAAGGGCT TGTGAAAATT GAAGGAGAGA ACCAGCACCT GTGGAAGCCG GTGAGGATTG GTGAGGTACA GGAAGACGGA CTTATCAAGG AAATCTGGAG TACAAGTGAA GCCGTAAGGC CCGACCCATA CTTAAAAACC TACGACTGGG CAAAAGGCTT AAGCGATTAG
|
Protein sequence | MIKKFCIFGK NTIKVLAIAL SALLVFAGCS GKVEEPVDNK PGTDTSAEDT IKVGILHSLS GTMAISEVSL KDAELMAIEE INQAGGLLGK KIEPVIEDGA SDWPTFAEKA KKLLQNDKVA TVFGCWTSAS RKAVLPVFEE NNGLLWYPVQ YEGMESSPNI FYTGAAPNQQ IVPAVEWLLE NKGKRFFLLG SDYVFPRTAN KIIKAQLSAI GGELIAEEYT PLGHTDYSTI VNKIKTAKPD VVFNTLNGDS NVAFFKQLKD AGITSEDITV CSVSVAEEEI RGIGAENIKG HLVSWNYYQT TDTPENKEFV EKYKSKYGSD RVTDDPIEAA YIAVHLWAEA VKKAGSFEVE KVKEAAKGLE FKAPEGLVKI EGENQHLWKP VRIGEVQEDG LIKEIWSTSE AVRPDPYLKT YDWAKGLSD
|
| |