Gene Athe_1376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1376 
Symbol 
ID7409119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1460196 
End bp1461488 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content36% 
IMG OID643715741 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_002573249 
Protein GI222529367 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTGA TAAAAAACGC ACAGATTGTA AATAGTATTG ACAACAAACT TGAAAAGGCT 
GATATACTGA TAGTTGATGA CAAGATAGAA AAGATTGGTA AAAACATAGA AGAAAATCCA
GATAAGATGA TTATAATAGA TACAAGCGGC AAGTATGTAA TGCCAAGCTT TACCGATATT
CACTGTCATT TGCGGGAACC TGGTTTTGAG TACAAAGAAG ACATAAAGAG CGGAAGTAGA
GCTGCTTTAG CAGGAGGATT TACAACCATC TGTTGTATGC CAAACACAAA CCCTCCTGTA
GACAACAGAG CAATGATTGC GTATATAAAA TACCGTGCAA AAGAGGTCTC ACCAATTGAG
GTTTTACCTG TTGGGGCTAT AACAAAAGGA CTTTCAGGAG AAGAGCTTGC AGAGATAGGA
TTTATTAAAG AAGAAGGGGC CATTGCTATA TCAGACGATG GAAAGTGTGT TATGAACGCA
AACATTATGA GAAATGCTCT TTTGTACTCA AAAGATTTTT CAATACCTGT CATTTCACAC
TGTGAGGATA CAAACTTATC TGAAGGAGGA CAGATAAATT TAGGATATGT GTCAACAATC
ACGGGACTTA GAGGAATTCC ACGCGAGGCA GAATCAATTA TTGTTGCAAG AGATATTCTT
CTTGCAAAAG AGACAAAAGC ACATCTTCAT ATAACCCATG TGTCCACCAA AGAATCTGTT
AGACTTATAA AAATGGCAAA AGAGTGGGGT GTAAATGTCA CGGCTGACAC ATGCCCGCAT
TATATAAGTC TTACAGAAGA AGAGGTACTT GGATTTAACA CAAATGCAAA AGTAAACCCT
CCTTTGAGAA CACAAGAGGA TATTGAAGCT TTAATTGAAG GATTAAAAGA AGGTGTAATT
GACTGTATAT CAACAGACCA TGCCCCGCAT CATAAAGATG AAAAGAATGT CGAATTTAAC
CTTGCTGCAA GCGGTACAAT TGGGTTTGAG ACTGCATTTT CTGTGCTGTT CACATATCTT
GTCGAGAAAA ATGGGTTTGA TATTGGGAAA ATAGTAGAAC TTTTGAATTA CAATCCCAGA
AAAATAATTG GACTTTCTCC AAATATTATA AAAGAAGGTG AAAAAGCCAA CCTTGTAATT
GTGGATTTAA AGAAAAAGTG GGAAGTAAAA GAGGAAAACA TTGTGTCAAA ATCAAAAAAT
AGTGTGTTTT TGGGAAAACT TTTGACTTCT TATGTTGAGA CAGTAATATA CAATGGGAAG
ATATTAAAAA AGGACGGTGT TTTAAGTTGT TGA
 
Protein sequence
MILIKNAQIV NSIDNKLEKA DILIVDDKIE KIGKNIEENP DKMIIIDTSG KYVMPSFTDI 
HCHLREPGFE YKEDIKSGSR AALAGGFTTI CCMPNTNPPV DNRAMIAYIK YRAKEVSPIE
VLPVGAITKG LSGEELAEIG FIKEEGAIAI SDDGKCVMNA NIMRNALLYS KDFSIPVISH
CEDTNLSEGG QINLGYVSTI TGLRGIPREA ESIIVARDIL LAKETKAHLH ITHVSTKESV
RLIKMAKEWG VNVTADTCPH YISLTEEEVL GFNTNAKVNP PLRTQEDIEA LIEGLKEGVI
DCISTDHAPH HKDEKNVEFN LAASGTIGFE TAFSVLFTYL VEKNGFDIGK IVELLNYNPR
KIIGLSPNII KEGEKANLVI VDLKKKWEVK EENIVSKSKN SVFLGKLLTS YVETVIYNGK
ILKKDGVLSC