Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2963 |
Symbol | |
ID | 4810851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3481071 |
End bp | 3482087 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108385 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001039353 |
Protein GI | 125975443 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG AATTGCTTAA AATAAATGAT TTAAAAGTTT CCTTCTTCAC ACCGGCCGGA GAAGTTAAAG CCGTAAACGG AATCAGCTAC ACCCTTGAAC CGGGAAAAGT CCTGGGAATA GTCGGTGAGT CGGGCTCCGG CAAAAGTGTG TCTTCCTATT CTATTATGGG ATTAATTGAC AACCCCGGCA AAATTGTCGG AGGAAGTATT ATATTTGACG GCAAAGATGT TTCCACTATG ACTAAATCAG AAAGGCAGAA TCTTGCGGGA AATGAGATAG CAATGATATT TCAGGACCCT ATGACCTGTT TAAATCCCGT TTTTACAATA GGAAACCAAA TTGCGGAATC TTTAATCCAC AAGTACGGCA GAAAAATTTC AAAAAAGGAA ATAAAAGAAC GTTCGATTGA CTTGTTGAAA TTGGTTGGCA TAAACGAGCC TGAAAAACGC TTGGCTCAGT ATCCTCATGA ATTTTCAGGA GGTATGCGCC AAAGGGTAAT GATTGCCATG GCTCTTGCCG GTTCGCCCAA ACTTTTGATC GCCGATGAGC CGACAACTGC CCTTGATGTT ACAATACAGG CTCAGATTTT AGAGCTTCTC AAAGATATTC AAAAAAAGAC GGGAATGGCC ATAATCCTCA TAACCCATGA CCTTGGTATA GTTGCCGACA TGGCTGATGA TATTATCGTT ATGTACGCCG GAAAAATTGT CGAGCAGGGC TCTGTTTACA GTATATTTAA TAACCCCCGT CATCCGTATA CAAAAGGCTT GCTTCGTTCC CTGCCCGACC TCAATAAAAA AGGCGAAAAA CTAATTCCTA TTCAGGGAAA TCCTATAGAT CTGTTAAATC TGCCTCAAGG CTGTGCCTTT GCGCCAAGGT GCGAAAACTG CATGAAGGTC TGTTTAAAAT ATGCACCAAA AGAGTATTCA ATTGAGGACG GACACACAGT CAGCTGTTGG CTGTACGATG GCATGGCCAA TAATAGCACG GAGGTAAAGA ATAATGACAA ACATTGA
|
Protein sequence | MSKELLKIND LKVSFFTPAG EVKAVNGISY TLEPGKVLGI VGESGSGKSV SSYSIMGLID NPGKIVGGSI IFDGKDVSTM TKSERQNLAG NEIAMIFQDP MTCLNPVFTI GNQIAESLIH KYGRKISKKE IKERSIDLLK LVGINEPEKR LAQYPHEFSG GMRQRVMIAM ALAGSPKLLI ADEPTTALDV TIQAQILELL KDIQKKTGMA IILITHDLGI VADMADDIIV MYAGKIVEQG SVYSIFNNPR HPYTKGLLRS LPDLNKKGEK LIPIQGNPID LLNLPQGCAF APRCENCMKV CLKYAPKEYS IEDGHTVSCW LYDGMANNST EVKNNDKH
|
| |