Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2964 |
Symbol | |
ID | 4810852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3482158 |
End bp | 3483243 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108386 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001039354 |
Protein GI | 125975444 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGATA TGATTAAATC GGAAATTTTA GACGAGGAAA TTTTAGACGA GAAAGTTTTA GACGAGGAAG TTTTAGACGA GGAAGTTTTA GACGAGGAAG TTTTAGACGA GGAAGTTTTA GACGAAAAGC TTTTTCTGCC TGCATCGGAC AGCGAAAAGG CAACAGCCGA AGTCATGCGT CCGTCGGTAG GATATTGGAA AGATGCCTGG AGAAGGCTTA AAGCAAATAA AGTCGCCATG GGTTCTCTGG TGGTTATACT GCTGGTCGTT CTGGCTGCAA TAATCGGGCC CATGCTCTCA CCTTACGAAT ATGATCAGAT AAACAAAGGC AGTGAAAATC TGCCTCCAAA CGCACAGCAT ATATTCGGTA CTGACAGTCT GGGAAGGGAT TTGTTTACCA GAACAATGAT AGGTGCAAGA ATCTCTCTTT CCGTCGGTAT TGTTGCAGCC ATCATGATAT CAATAATCGG TATTCTGTAC GGTGCAATCT CGGGATATTT CGGCGGCTGG GTTGATATTG TCATGATGAG AATCATAGAC ATTGTTTATT CAGTTCCGAC CATACTTATA GTTATTTTGC TGCAGGTTGC CTTAAAGACA CCGATAGACA ATTTTTTAAA TTCCGCCAGT GCTCCCAAAT TTCTAAAAAA CCTTGGTGTG GGGCTAATAA GTATATTCTT CGTTCTGGCC CTTCTTTACT GGGTGGATAT GGCAAGGATA GTGCGCGGTC AGATCCTGGC ACTAAAGGAG CAGGAGTTTG TTTTAGCTGC CAAGGTGCTT GGCGCAAACA ACAGAAGAAT TATTTTCAAG CATCTTATAC CAAACTGCAT TGGTCAGATT ATAGTGGTAG CCACCTTAAA AATTCCTGAG GCGATTTTCG TGGAATCTTT CCTCAGCTTC ATAGGGCTGG GTGTTTCAAT ACCCATGGCA TCCCTTGGTT CTCTGGCACA GACTGCCCTG AAAGGTATAT ATTCATACCC GTACATGCTG TTTTTCCCTG CGGCTACCAT CAGTATTATC ATTCTGTCCA TAAATCTTTT TGGTGACGGT CTGAGAGATG CTCTGGACCC AAGGATGAAA AAGTAG
|
Protein sequence | MSDMIKSEIL DEEILDEKVL DEEVLDEEVL DEEVLDEEVL DEKLFLPASD SEKATAEVMR PSVGYWKDAW RRLKANKVAM GSLVVILLVV LAAIIGPMLS PYEYDQINKG SENLPPNAQH IFGTDSLGRD LFTRTMIGAR ISLSVGIVAA IMISIIGILY GAISGYFGGW VDIVMMRIID IVYSVPTILI VILLQVALKT PIDNFLNSAS APKFLKNLGV GLISIFFVLA LLYWVDMARI VRGQILALKE QEFVLAAKVL GANNRRIIFK HLIPNCIGQI IVVATLKIPE AIFVESFLSF IGLGVSIPMA SLGSLAQTAL KGIYSYPYML FFPAATISII ILSINLFGDG LRDALDPRMK K
|
| |