Gene Haur_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1002 
Symbol 
ID5732905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1147161 
End bp1148369 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content54% 
IMG OID641278136 
Productmetallophosphoesterase 
Protein accessionYP_001543778 
Protein GI159897531 
COG category[R] General function prediction only 
COG ID[COG1408] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCAT CAGAAGCAGA TAGTTTGCCG AAAGCTCGTC CATCGCAACG GCTCAATCGA 
CGCAAAATTG CCAGCATTAT TATGGCTTGG ATGGCGCTGT GCTGGCTGAT TATTGGTAGC
GTGTTTTATA GTGTCGTGCC TGGTGGCTGG CTCAGCATTC TAGGCTTGAT GCTCTTGAGT
TACATTCCAT TGCTTTTTGT AGCACGTTCG TTTGGTGGGC CAGTTGCACC ATCGGCGCAT
ATCCGCTTGT GGGGCTTTCG GCCATTTTGG TATAGCCAAT TGCTCTTGCC ATTGATGGCA
ATCGGTGGCC TAATTGGCTT GATTATTGGC TTGCCTTTCC ACAGCAGTGG GTTGTTGGGT
CGTGGCTTGG CCGGTGGCAT CGGGTTACTC TATTTGACGG GCATTGGTTT GGCTTATTTT
GGCTCACGCC GCTTGGCTGT GCGCGAATTG ACCGCTAATT TGCCCCAATT GCCCAACGAG
CTAGCTGGCT TGAAAATTGT CCAGATCTCG GATACCCACG TTGGGCCGCA TACCTCGCGC
CGCCACTTGC GCAACGTCGT CGCGGCAATC GAAGCCGCCA AGCCTGACCT GATTGTCATG
ACTGGCGATC AAGTTGATGA TTATGTTGAT GACGTTGAAC CATTTGCCGC AGCCTTCGGC
CAACTCTCAG CCCCCTTAGG CGTGGTTGCC ATCGCTGGCA ATCACGATGT CTATGCTGGT
TGGGATGGCG TGCGGGCTGG ATTAGAAGCC ATGGGCATCA AGGTTTTGGT CAATCAAGCG
ACGGCATTTA ATTATCGTGG CGTGCGTTGG TGGCTGGCAG GCACTGGCGA TCCGGCAGGA
ACCTACGTGG CCCAAGGTCG GGAAATTGTG GCCCCTGATA TTCCCAAAAC CTTGGCTGAT
GTTCCAGCCA ATGAGTTTCA TGTGGTTTTA GCCCACAACC CAGCGCTCTG GCCCGCTTTG
GCCCAACGCA ACGTGCCACT AACCTTGAGC GGCCATACCC ACTACGGCCA ATTTGCCATT
CCCAAACTTG GCTGGAGCAT GGCTTCGGCC TTTTTGGAGC ATGCCATGGG TCACTATCAG
CTTGAGCAAT CGCTACTCTA CATCAACCCC GGCACGAACT ATTGGGGCAT TCCCTTCCGG
CTCGGCACCA AGCCCGAAGT CACAGTAATT ACATTACAAC CCAGCCAAAC CGCATCCATC
GTTGGGTAA
 
Protein sequence
MAASEADSLP KARPSQRLNR RKIASIIMAW MALCWLIIGS VFYSVVPGGW LSILGLMLLS 
YIPLLFVARS FGGPVAPSAH IRLWGFRPFW YSQLLLPLMA IGGLIGLIIG LPFHSSGLLG
RGLAGGIGLL YLTGIGLAYF GSRRLAVREL TANLPQLPNE LAGLKIVQIS DTHVGPHTSR
RHLRNVVAAI EAAKPDLIVM TGDQVDDYVD DVEPFAAAFG QLSAPLGVVA IAGNHDVYAG
WDGVRAGLEA MGIKVLVNQA TAFNYRGVRW WLAGTGDPAG TYVAQGREIV APDIPKTLAD
VPANEFHVVL AHNPALWPAL AQRNVPLTLS GHTHYGQFAI PKLGWSMASA FLEHAMGHYQ
LEQSLLYINP GTNYWGIPFR LGTKPEVTVI TLQPSQTASI VG