Gene Haur_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4685 
Symbol 
ID5736532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5983366 
End bp5984433 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content52% 
IMG OID641281849 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001547444 
Protein GI159901197 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCATC AATCTGTGCG GCAATTTTTC GAAGCAACTG AACGTGATAC GCTCTCGCCA 
TTGGCCGCCT TGAGTGCTCA GGCCAAACGC GATCAGCCAG AGCCAGCTTC GCCAGTGCGC
ACCGAATTTC AGCGCGATCG CGACCGCATT TTACATTCCA AGGCTTTTCG CCGACTCAAA
CATAAAACTC AAGTCTTTAT TGCGCCGATT GGCGATCATT ATCGTACTCG TTTGACCCAC
ACGCTTGAGG TGACCCAAAT TGCCCGTACC ATTGGTCGCG CTTTGCGGCT AAACGAAGAT
TTAATCGAGG CGATTGGCTT GGGCCACGAT TTGGGGCACA CGCCATTTGG CCATGCTGGT
GAAGCCGCGC TCGCCAAAGC AATTGGCCGC AAGTTTCGGC ATAACGAGCA AAGTGTGCGG
GTGGTTGAGC TGTTGGAGAA ACATGGCGAG GGGCTGAATT TAACCCAACA AGTGCGCGAG
GGTATCTATT CGCACTCCAA ATCGCGCAAA GATATTACCA CCGCAACATG GGGCACAGCC
TCAACCCTCG AAGGCCAAAT TATCAAATTG GCTGATAGTG TGGCCTACAT TAATCATGAT
ATTGATGATG CGATGCGGGC TGGCATTTTA CAGCTGGGCG ATTTGCCAAG CGCCTATGTG
GCAGTGCTTG GCACAACCCA CGCCGAGCGG ATTAATACCA TGGTTTGCGA TATGATCGAC
CATAATTGGT GGGCACGCGG TGAGCAGCCA GCCCCCGGCG AATTGAGCAT CAGCATGAGT
CCGCAAATTC TAGAGGCAAC CAACGGCGTG CGCGAATATA TGTATGCCAA TGTTTATTTG
CGCGGCCCCG CCAAAACCGA GGATGGCAAG GTCGAGTATG TGATCAACAC ACTCTACGAA
TATTATTGTC AGCATCCCGA AGCCTTGCCA AGCGATCTGT TGGCAATTTG CGAGCAGCGC
GGCGAGCCAA CCGAACGCGC CGTGATCGAT TATATTGCTG GCATGACTGA TCGCTATGCC
CTGAAAAAAT TCAACGATTT GTTTATTCCT AAAACGTGGG ATATGTAG
 
Protein sequence
MQHQSVRQFF EATERDTLSP LAALSAQAKR DQPEPASPVR TEFQRDRDRI LHSKAFRRLK 
HKTQVFIAPI GDHYRTRLTH TLEVTQIART IGRALRLNED LIEAIGLGHD LGHTPFGHAG
EAALAKAIGR KFRHNEQSVR VVELLEKHGE GLNLTQQVRE GIYSHSKSRK DITTATWGTA
STLEGQIIKL ADSVAYINHD IDDAMRAGIL QLGDLPSAYV AVLGTTHAER INTMVCDMID
HNWWARGEQP APGELSISMS PQILEATNGV REYMYANVYL RGPAKTEDGK VEYVINTLYE
YYCQHPEALP SDLLAICEQR GEPTERAVID YIAGMTDRYA LKKFNDLFIP KTWDM