Gene Ccel_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3334 
Symbol 
ID7312477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3878597 
End bp3879727 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content36% 
IMG OID643610237 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_002507603 
Protein GI220930694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0347475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTAAAT TTGAAAATAT ATTCAAAAAA TACAAAGATA CAACTGTGTT GAAAAACATT 
TCGCTGGAAG TTGAAAAAGG GCAATTGGTT TCACTTATCG GAGAAAGTGG CTGCGGTAAG
ACAACTACAT TAAAAATGAT TAATCGCCTG ATAAAGCCTT CGTCCGGCAA AATATTTATT
AATGGTAAGG ACATAGAAAA AAGAGATATT ATAAAGCTTA GAAGAAATAT GGGGTATGTA
ATTCAGCAGA CAGGGTTATT TCCACATATG ACAATAAAGG AAAATATTGA GCTAATCCCA
AAGGTCCAGA AAAAGGATTC TGAAGAAATA AGAAAAAAAA CTTATGAATT ATTGGAAATG
GTTGGGCTGG AAGCTGACGA GTTTCTTGAT AGGTACCCTT CTGAGATAAG TGGTGGGCAA
CAGCAGAGAG TTGGGGTAGC AAGAGCATTT GCAACTGACC CGGAGATTAT CCTGATGGAT
GAGCCGTTTT CAGCACTTGA TCCAATTACC AGGATAAGCC TGCAGGATGA GCTTATAAAT
ATACAGGCAA TTTATAAAAA GACAATAGTC TTTGTTACAC ATGACATGGA TGAGGCAATA
AAGATATCGG ATAAGATATG TATAATGAAA GATGGAGAAA TTCTTCAGTA TGATACACCT
GAAAATATAT TGAAAAATCC TCAGAATGAA TTTGTATCAG AGTTTGTAGG TAGAAATAGA
ATCTGGACTT CCCCCGAATT TATAAAGGCA AAGGATATTA TGATTGATAC CCCGGTAACC
TGTCAGAGCA GTACGACGCT TCTTGGGTGT ATTGAAAGAA TGCGTGTGGA AAAGGTAGAT
AGCCTTATGG TAGTTGAAGA AAAAACAAAA AGACTGTTAG GTATAGTGAA TGCAAAGCAA
ATACAAAACC AGAGAGACCG TACAATAAAA GTTGGCGATA TTATGACCAC TAACTTCCTG
AGTGTACTTG AGGACGACTC AATTATAGAT ATTTTAAAAA TTGTAGACGA AAAGCATGTA
TCAGCAATTC CTGTTTTGAA CGAAAGTGAC AGACTTTTAG GCTTGATAAC AAAGAGCAGT
CTTGTTACTA CTCTAAGCCA GCAATATCTT GATTTGGAAA ATCTGGAGTA A
 
Protein sequence
MIKFENIFKK YKDTTVLKNI SLEVEKGQLV SLIGESGCGK TTTLKMINRL IKPSSGKIFI 
NGKDIEKRDI IKLRRNMGYV IQQTGLFPHM TIKENIELIP KVQKKDSEEI RKKTYELLEM
VGLEADEFLD RYPSEISGGQ QQRVGVARAF ATDPEIILMD EPFSALDPIT RISLQDELIN
IQAIYKKTIV FVTHDMDEAI KISDKICIMK DGEILQYDTP ENILKNPQNE FVSEFVGRNR
IWTSPEFIKA KDIMIDTPVT CQSSTTLLGC IERMRVEKVD SLMVVEEKTK RLLGIVNAKQ
IQNQRDRTIK VGDIMTTNFL SVLEDDSIID ILKIVDEKHV SAIPVLNESD RLLGLITKSS
LVTTLSQQYL DLENLE