Gene Cthe_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2963 
Symbol 
ID4810851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3481071 
End bp3482087 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content42% 
IMG OID640108385 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001039353 
Protein GI125975443 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG AATTGCTTAA AATAAATGAT TTAAAAGTTT CCTTCTTCAC ACCGGCCGGA 
GAAGTTAAAG CCGTAAACGG AATCAGCTAC ACCCTTGAAC CGGGAAAAGT CCTGGGAATA
GTCGGTGAGT CGGGCTCCGG CAAAAGTGTG TCTTCCTATT CTATTATGGG ATTAATTGAC
AACCCCGGCA AAATTGTCGG AGGAAGTATT ATATTTGACG GCAAAGATGT TTCCACTATG
ACTAAATCAG AAAGGCAGAA TCTTGCGGGA AATGAGATAG CAATGATATT TCAGGACCCT
ATGACCTGTT TAAATCCCGT TTTTACAATA GGAAACCAAA TTGCGGAATC TTTAATCCAC
AAGTACGGCA GAAAAATTTC AAAAAAGGAA ATAAAAGAAC GTTCGATTGA CTTGTTGAAA
TTGGTTGGCA TAAACGAGCC TGAAAAACGC TTGGCTCAGT ATCCTCATGA ATTTTCAGGA
GGTATGCGCC AAAGGGTAAT GATTGCCATG GCTCTTGCCG GTTCGCCCAA ACTTTTGATC
GCCGATGAGC CGACAACTGC CCTTGATGTT ACAATACAGG CTCAGATTTT AGAGCTTCTC
AAAGATATTC AAAAAAAGAC GGGAATGGCC ATAATCCTCA TAACCCATGA CCTTGGTATA
GTTGCCGACA TGGCTGATGA TATTATCGTT ATGTACGCCG GAAAAATTGT CGAGCAGGGC
TCTGTTTACA GTATATTTAA TAACCCCCGT CATCCGTATA CAAAAGGCTT GCTTCGTTCC
CTGCCCGACC TCAATAAAAA AGGCGAAAAA CTAATTCCTA TTCAGGGAAA TCCTATAGAT
CTGTTAAATC TGCCTCAAGG CTGTGCCTTT GCGCCAAGGT GCGAAAACTG CATGAAGGTC
TGTTTAAAAT ATGCACCAAA AGAGTATTCA ATTGAGGACG GACACACAGT CAGCTGTTGG
CTGTACGATG GCATGGCCAA TAATAGCACG GAGGTAAAGA ATAATGACAA ACATTGA
 
Protein sequence
MSKELLKIND LKVSFFTPAG EVKAVNGISY TLEPGKVLGI VGESGSGKSV SSYSIMGLID 
NPGKIVGGSI IFDGKDVSTM TKSERQNLAG NEIAMIFQDP MTCLNPVFTI GNQIAESLIH
KYGRKISKKE IKERSIDLLK LVGINEPEKR LAQYPHEFSG GMRQRVMIAM ALAGSPKLLI
ADEPTTALDV TIQAQILELL KDIQKKTGMA IILITHDLGI VADMADDIIV MYAGKIVEQG
SVYSIFNNPR HPYTKGLLRS LPDLNKKGEK LIPIQGNPID LLNLPQGCAF APRCENCMKV
CLKYAPKEYS IEDGHTVSCW LYDGMANNST EVKNNDKH