Gene Cthe_0876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0876 
Symbol 
ID4810494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1050392 
End bp1051684 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content38% 
IMG OID640106292 
Productglycosyl transferase family protein 
Protein accessionYP_001037303 
Protein GI125973393 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000359561 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCAA GGTTTATAGA CTACATCAAA AAAAACAAAT ATCTTGTCAT AATATTTATA 
GGGGTAATAC TCAGGCTCGT ATGGATTTTT GCCATGCCCA CGTATCCCGA AACCGACTTT
ATGTGGTATC ATGTAAAGGG AAAGGAAATT GCTGAAGGAA AAGGATTTTT AAACGGAATT
TATCCCTATT ACACCGGAAG GCCGGGATTT CCCACAGCTT TCAGACCCAT AGGCTATCCC
GGGACTTTGG CAATCCTCTA TTTCATATTC GGTGCCCACT TTATAGTGGC AAAACTGTTT
AATATTGTGC TTTCCACTCT TATCATGTTT TTGACTTACA AGCTTGCCGA CAAGTTTTTC
GGTAACAAAA TTGCTCTTTT GACTTTGCTC CTTTATGCCT TGTCGCCCTT AGCAATAGCA
TACAACAGTA TCATATGCTC GGAAATTCTG TTTTCTGCGC TGTTAATGTT GTCTGTTTAT
CTGTTTTTCA ATAAAAACAA TCCCCTTCTC ATTGGGCTTC TAATTGGTTA TTTAACTCTT
GTAAGACCCA TAGGAGTGTT TATTCCGTCA ATATTTGTAC TGTATGAATT TATCCGAAAA
GACGTGGGAC TTAAGCATAA AATCAAATAT GTTGCAGTTT TTGCAGTAGC AGTGGGATTG
GTAATTGCTC CATGGATAAT AAGAAATTAC ATTGTTTTCG GCGAACCAAT CTTCTCCACC
AACGGCGGCT ATGTTTTCTA CGTAAATAAC AACGACTATG CAACCGGTTC GTGGAGTGAC
CCCTTCAAGT ACCCCGACAG TCCGATGCTG AAGTACAAAA CGGAAGACGG ATTTGATGAA
TTGGCAATTC ACAAACTTGG CAAACAACTT GCTAGAGAAT GGATAAAGAA AAATCCCAAA
AGATTCATTG AGCTGGCATT TCTCCGTATT GCCAATTCAT ACTGGTTCAA AACCGAAGAT
ATAATGTGGG CGTTTACCAT AGGCATCAAC CAGTGGCACC CTGTAACTTC TAAGGCTGTC
AAGCTTCAAA AACTTTTATA CCGGCCTTTT TACATCGTAC TTTTCATATT CATTATATAT
GCTTTGATAA GGTTTATACG ACAGAGAAAA ATTGACTTTA CCACATTCAT CCTTCTTATA
TTTCTCTACT TTAATGCAAT GATGTTTGTG CTGGAGGGAA ACTCAAGATA TGTTTTTCCT
CTTCACCCGA TTTACACGAT AGGCGTATCC TTTGTTATAT ACAATGTACT TAAAAAGCTG
CTGCCTGAAC GTTTTTCAGC AGTTTTGTCT TAA
 
Protein sequence
MISRFIDYIK KNKYLVIIFI GVILRLVWIF AMPTYPETDF MWYHVKGKEI AEGKGFLNGI 
YPYYTGRPGF PTAFRPIGYP GTLAILYFIF GAHFIVAKLF NIVLSTLIMF LTYKLADKFF
GNKIALLTLL LYALSPLAIA YNSIICSEIL FSALLMLSVY LFFNKNNPLL IGLLIGYLTL
VRPIGVFIPS IFVLYEFIRK DVGLKHKIKY VAVFAVAVGL VIAPWIIRNY IVFGEPIFST
NGGYVFYVNN NDYATGSWSD PFKYPDSPML KYKTEDGFDE LAIHKLGKQL AREWIKKNPK
RFIELAFLRI ANSYWFKTED IMWAFTIGIN QWHPVTSKAV KLQKLLYRPF YIVLFIFIIY
ALIRFIRQRK IDFTTFILLI FLYFNAMMFV LEGNSRYVFP LHPIYTIGVS FVIYNVLKKL
LPERFSAVLS