Gene Cthe_1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1363 
Symbol 
ID4809358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1655927 
End bp1657321 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content37% 
IMG OID640106787 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_001037788 
Protein GI125973878 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3944] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000148306 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGG ATGATATCTT GGAAATTGAT TTGCGGGAGA TAGTATATAT TCTTCTTAAG 
AAGTGGTATC TCATTGTGTT GTGCTTTGTT CTATCAGCTG GCACAGCCTT TGTAGTTACT
CAATTTTATA TAAAACCGGT CTATAAAGCT GAAACAACAC TGTTTCTTGG TAAGGAGAAG
GATCAGGTTT CACTGAGCTT TTCCGATATA CAGGTAAATA ATCAACTTGT TGTGGACTAT
AGAGAGATAC TTCAATCAAG GCGTGTGGCA GAGATTATTA ATCAAAAGTT TGGAGTTGGA
ATACAGGAAT TTCAAAAAAA TGTAAACGTA AAAACAGTAA AAGATTCAAG ATTGTTCAGT
ATAAGCTATG AGGATACTGA TCCAAAACGT GCTGCTGACA TAGTAAATGA ACTGGCAACG
GTTATTCAGA AGATGGCAGA CGAGATTATA CAGGTCAAAA ATATTAAAGT TATAGATACT
GCGAAAATAC CGGAAAACCC TATAAAGCCT AATAAGAAAA TGAATATATG TGTGGCAGGA
TTGTTTGGTT TGGTATTGGG GATCGGATTG ATATTCTTAC TTGAATTGAT TGACCATACA
TTTAAGAAGC CGGAAGAAGT TGAAAAGATG CTTGGAATTA ATGTTATTGG TACAATCCCG
GCATTTGATG GCGGCAAACG AGGAAAGAAA AAAGCCAAGG ATGAAAAAGA ACTTCAAGAA
CAATACCTTA AAAATCTTAT TGTACATAAT AATCCAAAGT CAGCCACTGC CGAGGCGTTT
AGAGAGCTTC GCACAAATTT GTATTATTCA AGTGTAGACA GCGAAGTAAA GACTATAGTA
GTAACAAGTC CAACTCTCGG AGACGGCAAA ACGGTTACTG CTGTGAATCT TGCAATAACT
CTTGCACGTT CCGGCAAGAA AGTTCTTGTT ATAGATGCTG ACTTAAGAAA GCCAAAGGTT
CATCATTATT TTGGTGTGAA AAATAAAGAG GGTTTAACTA ATCTTTTAAC TGATTCAAAG
GAAGAAGTAA AAATTAAAAC AACAGAGAGA AGCGATATTT CAAATCTGTA TATAATTACA
AGTGGTCCGA TTCCGCCAAA TCCGGCAGAA ATGCTTAATT CAAACAGGAT GAAAAGCCTT
TTGGAAAAAG TACGGGAGGA ATATGATATT GTTATCATAG ACACTCCTCC GGTGGGGCAG
GTTACAGATG CAGCAATTCT TGCCGGAATT ACGGATGGAG TTATCCTTGT ATTGGCAAGT
GGCCAAACAA GAATAGAAAT GGCGAAGCGA GCTTTTAAAT CCCTTGAGAG TGTAAAAGCA
AGGTTTATTG GTGCAGTTCT TACGAAATTG GATACTGAAA GAACTGGTTA TTATTACAGT
TACAAGTATG AGTGA
 
Protein sequence
MEKDDILEID LREIVYILLK KWYLIVLCFV LSAGTAFVVT QFYIKPVYKA ETTLFLGKEK 
DQVSLSFSDI QVNNQLVVDY REILQSRRVA EIINQKFGVG IQEFQKNVNV KTVKDSRLFS
ISYEDTDPKR AADIVNELAT VIQKMADEII QVKNIKVIDT AKIPENPIKP NKKMNICVAG
LFGLVLGIGL IFLLELIDHT FKKPEEVEKM LGINVIGTIP AFDGGKRGKK KAKDEKELQE
QYLKNLIVHN NPKSATAEAF RELRTNLYYS SVDSEVKTIV VTSPTLGDGK TVTAVNLAIT
LARSGKKVLV IDADLRKPKV HHYFGVKNKE GLTNLLTDSK EEVKIKTTER SDISNLYIIT
SGPIPPNPAE MLNSNRMKSL LEKVREEYDI VIIDTPPVGQ VTDAAILAGI TDGVILVLAS
GQTRIEMAKR AFKSLESVKA RFIGAVLTKL DTERTGYYYS YKYE