Gene Cthe_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1918 
Symbol 
ID4810776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2287981 
End bp2289441 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content44% 
IMG OID640107335 
Productarginine decarboxylase 
Protein accessionYP_001038330 
Protein GI125974420 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1982] Arginine/lysine/ornithine decarboxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000463631 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCTAA GAATGAACCA GCTTAAGACA CCGGTTTTTG ATGCTGTGAA AAGATACGTT 
GAAAGCAATG TGATACAGTT TCATGTACCG GGTCACAAGC AGGGAGCTGG TCTTGAGGAG
CTTAGAGAGT ATATAGGCGA AACAGCGCTT AAAATGGATG CAAACGGTAT GGAGGATTTG
GATTTTGCCA ACAACCCCAC GGGGGTTATT TATGAGTCGG AGATGCTGGC AGCTCAGGCA
TTTGGGGCTC AGCATGCTTA TTTTTTGGTA AACGGGACTA CGTCCGGTGT TCAGGCAATG
ATTATGAGTG CTTGTGAACC GGGAGACAAA ATAATACTTC CGAGAAATGC GCACAAGTCC
ACGATAGGTG GAATTATATT AAGCGGTGCC GTGCCGGTGT ATGTGCAGCC CGAAATAAAT
GAAAAACTGG GCATTGCCAT GGGAATTACG GTGGAAAGCC TGAAAAAGGC AATAAAGGAA
AACCCCCATG CAAAAGCTGT GTTCATTATC AATCCGACTT ATTATGGAAT AGCCTCGGAT
TTAAAATCCC TCGTAAGAAT TGCCCACAGG TACGAAATGG CTGTTTTGGT GGATGAGGCT
CACGGTGCGC ATATGTCTTT TCATGATGAT TTTCCTCTGA CGGCAATGGA AGTCGGTGCC
GATATGAGTG CCGTAAGTAC TCACAAAACG GGAGGTTCAC TGACACAAAG TTCTTTGCTG
CTTTTAAGGG GAAACATGAT CAGCCCTGAA AGAGTAAAAC AGGTTTTAAA CCTTACATAT
ACTTCCAGCG CATCATATCT TTTGATGTGT TCCCTTGACA TTGCAAGAAA ACAGCTTGCC
ACAAAAGGAA GCGACATGTT GGAGGAAACT TTGAGGCTTG CCAGAATGGC AAGAGAGGAG
ATAAACAAAA TTGAAGGTTT GTATGCCTTC GGGAAAGAAC TTATAGGAAC ACCGGGATGT
CATGATTTTG ATGAAACAAA ACTGGGTATA TGTGTATCAG GCTTGGGATA TACCGGATAT
GAGATGGAAG CGAAACTCAG AAAAGAATAC AATATACAGA TTGAAATGTC GGATCTTTCA
AATATACTGG CAATAGTAAG CATAGGTGAC AGGGAAGAAA ACTTGGTGGC CCTTATAAAC
GCACTGAAAG ATATTGCGGC CAAAACAGAG AAAAAGGAAT ATCCCAAACC GCCGATTATT
CCTCCGACTC CCAAAATGAT AGTTTCACCC AGGGACGCTT TTTACAGCCC CAAGAAAATA
GTCCCGTTGG ACAAATCTGT GGGAGAGATA TCGGGAGAAA TGGTTATGGC TTACCCTCCG
GGCATACCGG TTGTATGCAT GGGTGAGAGA ATTACTCAGG ACATAGTGGA TTACATAAAG
ATACTGAAGG AGCAGAAAAC CCAGCTGCAA GGCACGGCGG ATCCATACAT TGATCATATA
ATGGTTCTTG GTGTTGATTG A
 
Protein sequence
MVLRMNQLKT PVFDAVKRYV ESNVIQFHVP GHKQGAGLEE LREYIGETAL KMDANGMEDL 
DFANNPTGVI YESEMLAAQA FGAQHAYFLV NGTTSGVQAM IMSACEPGDK IILPRNAHKS
TIGGIILSGA VPVYVQPEIN EKLGIAMGIT VESLKKAIKE NPHAKAVFII NPTYYGIASD
LKSLVRIAHR YEMAVLVDEA HGAHMSFHDD FPLTAMEVGA DMSAVSTHKT GGSLTQSSLL
LLRGNMISPE RVKQVLNLTY TSSASYLLMC SLDIARKQLA TKGSDMLEET LRLARMAREE
INKIEGLYAF GKELIGTPGC HDFDETKLGI CVSGLGYTGY EMEAKLRKEY NIQIEMSDLS
NILAIVSIGD REENLVALIN ALKDIAAKTE KKEYPKPPII PPTPKMIVSP RDAFYSPKKI
VPLDKSVGEI SGEMVMAYPP GIPVVCMGER ITQDIVDYIK ILKEQKTQLQ GTADPYIDHI
MVLGVD