Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4685 |
Symbol | |
ID | 5736532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5983366 |
End bp | 5984433 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281849 |
Product | deoxyguanosinetriphosphate triphosphohydrolase-like protein |
Protein accession | YP_001547444 |
Protein GI | 159901197 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.103338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCATC AATCTGTGCG GCAATTTTTC GAAGCAACTG AACGTGATAC GCTCTCGCCA TTGGCCGCCT TGAGTGCTCA GGCCAAACGC GATCAGCCAG AGCCAGCTTC GCCAGTGCGC ACCGAATTTC AGCGCGATCG CGACCGCATT TTACATTCCA AGGCTTTTCG CCGACTCAAA CATAAAACTC AAGTCTTTAT TGCGCCGATT GGCGATCATT ATCGTACTCG TTTGACCCAC ACGCTTGAGG TGACCCAAAT TGCCCGTACC ATTGGTCGCG CTTTGCGGCT AAACGAAGAT TTAATCGAGG CGATTGGCTT GGGCCACGAT TTGGGGCACA CGCCATTTGG CCATGCTGGT GAAGCCGCGC TCGCCAAAGC AATTGGCCGC AAGTTTCGGC ATAACGAGCA AAGTGTGCGG GTGGTTGAGC TGTTGGAGAA ACATGGCGAG GGGCTGAATT TAACCCAACA AGTGCGCGAG GGTATCTATT CGCACTCCAA ATCGCGCAAA GATATTACCA CCGCAACATG GGGCACAGCC TCAACCCTCG AAGGCCAAAT TATCAAATTG GCTGATAGTG TGGCCTACAT TAATCATGAT ATTGATGATG CGATGCGGGC TGGCATTTTA CAGCTGGGCG ATTTGCCAAG CGCCTATGTG GCAGTGCTTG GCACAACCCA CGCCGAGCGG ATTAATACCA TGGTTTGCGA TATGATCGAC CATAATTGGT GGGCACGCGG TGAGCAGCCA GCCCCCGGCG AATTGAGCAT CAGCATGAGT CCGCAAATTC TAGAGGCAAC CAACGGCGTG CGCGAATATA TGTATGCCAA TGTTTATTTG CGCGGCCCCG CCAAAACCGA GGATGGCAAG GTCGAGTATG TGATCAACAC ACTCTACGAA TATTATTGTC AGCATCCCGA AGCCTTGCCA AGCGATCTGT TGGCAATTTG CGAGCAGCGC GGCGAGCCAA CCGAACGCGC CGTGATCGAT TATATTGCTG GCATGACTGA TCGCTATGCC CTGAAAAAAT TCAACGATTT GTTTATTCCT AAAACGTGGG ATATGTAG
|
Protein sequence | MQHQSVRQFF EATERDTLSP LAALSAQAKR DQPEPASPVR TEFQRDRDRI LHSKAFRRLK HKTQVFIAPI GDHYRTRLTH TLEVTQIART IGRALRLNED LIEAIGLGHD LGHTPFGHAG EAALAKAIGR KFRHNEQSVR VVELLEKHGE GLNLTQQVRE GIYSHSKSRK DITTATWGTA STLEGQIIKL ADSVAYINHD IDDAMRAGIL QLGDLPSAYV AVLGTTHAER INTMVCDMID HNWWARGEQP APGELSISMS PQILEATNGV REYMYANVYL RGPAKTEDGK VEYVINTLYE YYCQHPEALP SDLLAICEQR GEPTERAVID YIAGMTDRYA LKKFNDLFIP KTWDM
|
| |