Gene Elen_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1401 
Symbol 
ID8415699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1674744 
End bp1675787 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content68% 
IMG OID645024370 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003181759 
Protein GI257791153 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000105022 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCA TCCATCGCGA GGACCAAGAG CAGCGCGAGC ACGTGGAGCT TTCCTGCGAC 
GCGGCGTTCG CCGACGAGAG CGACGGCCGC GACCGATCCG CCGAGCCCGA CATCCTGCGC
ACCGACTACC AGCGCGACCG CGATAAGATC CTCCACACGA AGTCCTTCCG CCGCCTGTCG
CACAAGACAC AGGTGTTCCT GGCCGCCGAG GGCGACCACT TCCGCACCCG TCTCACGCAC
ACGCTGGAGG TGGCGCAGAT CGCCCGCACC ATCGCCCGCG CGCTGGGGCT GAACGAGGAT
CTCGCCGAGG CCATCTCGCT CGGCCACGAC CTGGGGCACA CGCCCTTCGG GCATACGGGG
GAGGAGGCGC TCGCGCGCTG CCTGGCGCGC CACAAGGGGA TCGACCCGGC ATCGCCCGAG
GCGGAGGCGC TCTACCGCCA CAACGAGCAG AGCCTGCGCG TGGTCGAGCG CATCGAGAAC
GGCGGCAAGG GACTGAATCT CACGTCCGAG GTGCGCGACG GCATCCTCAA CCACACCGGC
GACCTGCGCG CCGAGACGCT GGAGGGGCGC ATCGTGGGCA CGGCCGACCG CATCGCGTAC
GTCAACCACG ACATCGACGA CGCCATCCGC GCGGGCATCC TGCGCGAGGT CGACCTGCCG
GCGTCGACGC ACGCCATGCT GGGCCCCGAC CATTCGTCGC GCATCGAGAC GCTCGTGCTC
GACATGGTGG AGACGTCGGC CGCCGTCGAC GACATCCGCA TGAGCGACGC GGTGTGGAAC
GCCATGATGG AGCTGCGGTC GTTCCTGTTC GAGCGCGTGT ACAGCGCCCC TGCCGTCACC
GACGAGGTGG CGAAGGCGAC GCACCTCGTG GACGACCTGT TCGACTACTA CGTGGCGCAC
ACGGGCGAAG TTCCGCAGGA GTACCGCGCC ATCTCCGAGG GCGACGACCT GCGCGCCGTC
ACCGACTTCA TCGCCGGCAT GACCGACCGC TACGCCAAGA ACCTCTACCA AAGGCTGTTC
ATCCCCAACG CGCTGCATTA CTAG
 
Protein sequence
MRIIHREDQE QREHVELSCD AAFADESDGR DRSAEPDILR TDYQRDRDKI LHTKSFRRLS 
HKTQVFLAAE GDHFRTRLTH TLEVAQIART IARALGLNED LAEAISLGHD LGHTPFGHTG
EEALARCLAR HKGIDPASPE AEALYRHNEQ SLRVVERIEN GGKGLNLTSE VRDGILNHTG
DLRAETLEGR IVGTADRIAY VNHDIDDAIR AGILREVDLP ASTHAMLGPD HSSRIETLVL
DMVETSAAVD DIRMSDAVWN AMMELRSFLF ERVYSAPAVT DEVAKATHLV DDLFDYYVAH
TGEVPQEYRA ISEGDDLRAV TDFIAGMTDR YAKNLYQRLF IPNALHY