Gene Haur_2661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2661 
Symbol 
ID5734556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3414559 
End bp3416712 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content50% 
IMG OID641279803 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001545427 
Protein GI159899180 
COG category[R] General function prediction only 
COG ID[COG1480] Predicted membrane-associated HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00304076 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGTTC GTTCATATCG CTTAAAAACC TTGCTACGTT CGTTCATCGA GCATCACCAC 
CATTTGGTGT TGGTGCTCTT TGGGGCCGTG CTCACGCTAA TTTTGACCTT GATTTTTACG
TGGCGCTCGG CGATCAACCA AGATATTATG GTTGGTCGTC CCAGCCCACG CACAATCAAC
GCCGACCGCG ATTTGACCTT TGAAAGCCCC TTGCTGACTG AGGCCAAACG CCGTGAAGCC
GCCAACGATC CCCGCAACTT GGTCTATAAC GAAGATACTC AAATTCATGG TCAACAGCGT
GAACAGCTGC AAGCAACCTA CAGCGTGATT AACTCGGTTC GCGAAAATCC CAGCCTCAAC
CTCGATCAAC AACGCGGGCA ACTGACTGAA TTGCCTTCGC TGCCGCTCTC CGATACCCTC
GCCATCACCA TTCTTGAAGC TGATGACGAT ACCTGGCAAC GGATCAAAGA TCAAACCAAT
GCCTTGTATG ATCGAACGCT GCGCGAAAAT AATTATTCAA TTGATGAAAC CACCCTCGCC
GAAATTAAAG TGCGCTATTT GCCTTACAAC TTGCCCAGCA GTTTAAAGCC CGATCAACGA
GCGGTTGTGC TGTATTTGGT CGAACAAACC CTGCATGTTA ATCGCACCCT GAATCAAGAG
GAAACTGAGC GCCGCCAACA AACAGCCCGC AATGCTGTCC AATCGGTCTC AAAAGATGTC
GTCAATGGCC AAAATATTGT GCGCCAAGGC GATACGGTTT CAGCAGAACA ATACGAAACC
TTGATCAAAA TGGGTCTGAT CACGCCTGAA TTAGGCTTTG ATGGCTTTAT GGGGCGCTTC
TTGCTAGCGC TCTTAGTGGC GTTGGCCTTA TGTACAGCAC TTTATATCGA TCAACATAAT
CTTTTGACAT GGCCACGGGC ATTGCTGGTC ATCTTGATTT TGATGGTTAT TCCGATTTTG
TCTGGGCGCA TTTTCCTCAA CACATGGCTG AATTTCCCTG AAACGTTTGC TTTGGCGGTG
ATTGCGATTC CGTTGGCAGC GCTATTTAAC AACAATTTGG CCTTAGTTAT TTCAGCCTTA
GTTTCGATTG TGATGATGTT TTTGGGCGAA GGCGCACTCC AAGTTGGCAT GATCAGCTTT
GCCGGAGCCT TGTGTGGCAT CTACGCAATT CGTCGCGCCG ACCGGGCCAT GGCCTTTATT
ATGGCTGGCG TTTGGATTGC GCTGGGAGTA TTCGCCACCG CCATGATTTG GCGTTTGATT
CAGCCCCAAG GCGTAACCTG GCAACAAACT ATGTTCACCT TGATTTTTAG CATGCTCAAC
GGTGGCATCA CGGCCATGAT GTCGTTGACC TTGCATAACG TGCTTGGGCG GATCGCTGGC
ATCGTTACGC CAATGCAATT ATTGGAGTTG GCCCACCCCA ACCAGCCGCT GCTGCGCCGC
TTGATGCAAG AAGCTCCAGG CACTTATCAT CACTCAGTCG TTGTCAGCAA TTTGGCCGAA
CAGGCTGCTG AACGGATCGG CGCTGATACC TTGCTAACCC GCGTGGGAGC CTACTATCAC
GATATTGGCA AAATGCTGCG GCCATTCTTC TTCACCGATA ATCAATACGA TCGCTCAAAT
GTCCACGATA ACCTTGATCC GCAAACCAGC GCCAAATTAA TCGCCGATCA CGTGATTGAG
GGAGCTAAAA TTGCGCGGCA GCATAAGCTG CCTGAGCAAA TTGTTAATTT CATCGTTGAG
CATCACGGCA CCGATGTGAT TCGCTATTTC TATCAGCAAG CCTTACAAGC CCAAGATAGC
GTTGATATCA ACGATTATCG CTACCCTGGA CCCAAGCCAC AATCCAAGGA AACAGCGATT
TTGATGCTAG CCGATGGAGT TGAGGCCACT GTGCGCTCCA AGGAGCAAGC GGGCATGCTC
GTGGCTGAGC GTCACGATGA CGATGATCAA CAAGCACCCA AAGGTTGCCA AAGCATTGCC
CAAGTGGTCA ACCAAAGCAT CGATATGCGC CTTGCCAGCG GCCAGCTTGA TCAATGCCCG
CTCACCCAAA AAGATCTCAA CACAATTCGC CAATCGTTTG TCAAAACGCT CCAAGGGATC
TATCATCCAC GGGTTGAGTA TCCCAAATTG ATGCGGGAAC CGCAAAATAA ATAA
 
Protein sequence
MIVRSYRLKT LLRSFIEHHH HLVLVLFGAV LTLILTLIFT WRSAINQDIM VGRPSPRTIN 
ADRDLTFESP LLTEAKRREA ANDPRNLVYN EDTQIHGQQR EQLQATYSVI NSVRENPSLN
LDQQRGQLTE LPSLPLSDTL AITILEADDD TWQRIKDQTN ALYDRTLREN NYSIDETTLA
EIKVRYLPYN LPSSLKPDQR AVVLYLVEQT LHVNRTLNQE ETERRQQTAR NAVQSVSKDV
VNGQNIVRQG DTVSAEQYET LIKMGLITPE LGFDGFMGRF LLALLVALAL CTALYIDQHN
LLTWPRALLV ILILMVIPIL SGRIFLNTWL NFPETFALAV IAIPLAALFN NNLALVISAL
VSIVMMFLGE GALQVGMISF AGALCGIYAI RRADRAMAFI MAGVWIALGV FATAMIWRLI
QPQGVTWQQT MFTLIFSMLN GGITAMMSLT LHNVLGRIAG IVTPMQLLEL AHPNQPLLRR
LMQEAPGTYH HSVVVSNLAE QAAERIGADT LLTRVGAYYH DIGKMLRPFF FTDNQYDRSN
VHDNLDPQTS AKLIADHVIE GAKIARQHKL PEQIVNFIVE HHGTDVIRYF YQQALQAQDS
VDINDYRYPG PKPQSKETAI LMLADGVEAT VRSKEQAGML VAERHDDDDQ QAPKGCQSIA
QVVNQSIDMR LASGQLDQCP LTQKDLNTIR QSFVKTLQGI YHPRVEYPKL MREPQNK