Gene Cthe_1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1461 
Symbol 
ID4810611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1780355 
End bp1781920 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content43% 
IMG OID640106882 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001037883 
Protein GI125973973 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATAG AGCAACAAAA AATTTTTGCC AAACCGCCCC AGTCATATTG GATGGCTTCT 
ACCCCTAAAG CCAATTATCC AACCCTGGAA GAAGACATAA AAGTTGATGT TGCAATTATC
GGAGGGGGTA TCACCGGTAT CGCCACTTCC TACATGCTTG GCAAAGCCGG CGTAAAAGTG
GCTGTTATTG AAGCCGACCG CATTTTACAA GGCACAACCG GCCATACCAC GGCAAAAATA
ACATCCCAGC ATGACCTGAT ATACAGTAAA ATATACAGCC AAATGGGCAG GGAATTGGCA
CAGCAATATG CCGATGCAAA CGAATCTGCC ATTCGGATGA TTGAAAAAAT AGCAACTGAA
AATGGCATCG AATGCGATTT CGTTCCCCAA TCCGCATATG TGTATACAAT GCAAGACAAG
TATATCGACA AAATAAAAGA TGAAGCCGTG ATTGCCGAAT TCCTCGGTAT AAAAGCCACA
TACCTTGAAG AAATACCTTT GCCCTTCCCA ATTAAAGCCG CGGTCCGCTT TGACAACCAG
GCCCAGTTCC ATCCCCGAAA ATTTCTGCTG CGCCTAGCAG AGGAAATTGT TAAAAGCGGC
AATCAAATAT TCGAGCAAAG CAGAATTGTG GACATTGAAG ATGATAACAA CTATGTTTTA
ATTACAAATC AAGGCAAAAA GGTAACTGCG GAAAAGCTTA TTATCGCTTC CCATTACCCA
TGTTACAATA AAGCCGGGCT ATACTTTACA AGACTATATC CGGAACGGTC ATATGTTGTT
GCCATAAAAG CAAAAGAAAG TTATCCCGGC GGAATGTATA TAAACATGGA AGAGCCAAAG
CGCTCACTCC GCAGCCAAAG GTCAGATGAC GGCGAACTGA TACTGGTCGG CGGTGAAAGT
CACAAAACCG GACAAGGTGA GGATACAATC AAGCATTATG AAGCGTTGAT AGATTATGCC
ACTAAAACTT TTACGGTAGA AGATATTCCT TACCGGTGGT CCACCCAGGA TTGCATGACC
TTGGACGGAC TGCCTTATGT GGGGCATTTC ACATCAAACA CTCCAAATAT GTACATCGCA
ACCGGTTACG GCAAGTGGGG AATGACCAAC AGCATAGCTT CCGCGATGAT ATTAAGGGAT
TTGATAATTG AGGGAAAAAG TCCCTGGCAG GATGTTTACA ACCCGTCACG CAAAACAGTG
TTGGCCTCCG CTAAAAACTT TATTGTTGAA AATCTTAACG TCGCAGAAAA ACTAATTGAA
GGAAAAATCT TGCCCATGGC GGACAATACC GATATTAAAG CCGGAGAAGG AAAAATTATC
AACGTGAACG GTCAGAGACT CGGAGCATAC AGAGACCAAC AAGGTACTCT GCACGTCGTA
GACACAACAT GTACGCATAT GGGTTGTGAA TTATACTGGA ACTCTGCCGA AAAATCCTGG
GATTGTCCCT GCCATGGCTC AAGGTTTACC TATGAGGGCG ATATAATTGA AGGACCGGCA
GTTACGCCTT TAAATGTACA CCGTGATGTG AACACAATTG AAAAACTTTT TAAAGACAAT
TTTTAA
 
Protein sequence
MSIEQQKIFA KPPQSYWMAS TPKANYPTLE EDIKVDVAII GGGITGIATS YMLGKAGVKV 
AVIEADRILQ GTTGHTTAKI TSQHDLIYSK IYSQMGRELA QQYADANESA IRMIEKIATE
NGIECDFVPQ SAYVYTMQDK YIDKIKDEAV IAEFLGIKAT YLEEIPLPFP IKAAVRFDNQ
AQFHPRKFLL RLAEEIVKSG NQIFEQSRIV DIEDDNNYVL ITNQGKKVTA EKLIIASHYP
CYNKAGLYFT RLYPERSYVV AIKAKESYPG GMYINMEEPK RSLRSQRSDD GELILVGGES
HKTGQGEDTI KHYEALIDYA TKTFTVEDIP YRWSTQDCMT LDGLPYVGHF TSNTPNMYIA
TGYGKWGMTN SIASAMILRD LIIEGKSPWQ DVYNPSRKTV LASAKNFIVE NLNVAEKLIE
GKILPMADNT DIKAGEGKII NVNGQRLGAY RDQQGTLHVV DTTCTHMGCE LYWNSAEKSW
DCPCHGSRFT YEGDIIEGPA VTPLNVHRDV NTIEKLFKDN F