Gene Haur_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4212 
Symbol 
ID5736924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5366647 
End bp5368293 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content54% 
IMG OID641281367 
Productprotein serine/threonine phosphatase 
Protein accessionYP_001546972 
Protein GI159900725 
COG category[T] Signal transduction mechanisms 
COG ID[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.685502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAGC CGACTCAACC GATGACTGGT GGAACCCAAC CTCTCAACCC CGTCAACCAG 
CCAGAGGAAC ACGAACCCTT TGCGATTGGC ACTGTGTTGA AGGATATTTA TCGGGTAACC
GCCTTGCTGA CCGATACTCC AACCTTGCGC GTCTATCGCG TGGCGCTGTT GGAGCCATGG
GATCATTGCG CTCGTTGTGG CGCAGCCTTA CAAGCGAGCG ATCAGTTTTG CGAAGAGTGC
GGGGCGCAGG TCGAAGAACA AACTGCTCTG TTACAAGAAA CTCCGGCGGC CCAGCCAATT
GGCGCAGCTT TGCTTGATGA TTTACCCGAT GACCCCGCCC GCGCTGCCTT GCCCACCGTG
CGCGAGGTCT TTGTGGGCGA AGATTCACGG TTTGCGGTGC TGCCCGATGG TACGAGCTTA
GTGCGTTTCG ACACGTTGCT GAGCGAACCA AATACCTTTG TTGATCAAAC TGATGCTGTT
GATATTGGAA TTCAAGTAGC CCGCGCCTTA GCCTATTTGC ATCGCCACGG GTTGGCGCTA
GGCCAATTAA CCTTAGCTGA TTTGGCTTTG ACCAACAAGC GCGAAATTAA ACTGGCTGAT
GCTGGCGCGA TTCGCCGTTC GTTGGGCAAA GAAGATCAAC TTGATGATGT TGAGCATTTA
GGTTTGGTGC TAGAAAAAAT GGCGGGAATT CAGCGCCAAA CTCGCCGCCT TGATGATTCG
AATAATCCTT CGCCGCTCGA TAGCGCTTTT GCCACAATTT TGAGTGATCT CCGCGCCAAG
CGCATCACCG ATGCTAGCAT TTTGGCCCAA ACCCTCGAAA CCCTGCTGGC CGAACAAGCT
ACGCCGATCA GTTTGCGAGT ACGGACTGGC TATGCTACCG ATGTTGGCAT GATTCGCGAT
CATAACGAAG ATAGCGTGCT GACCTGGGAT TTACGCCTGA ACTGGGATGC CAAGCCAGTC
AACGTTGGTC TGTATGTAGT GGCTGATGGT ATGGGTGGTC ACGAAGGCGG CGAGGTTGCT
AGCGGTTTGG CGATCACGAC TACTGCTCAA ACCCTCGTGC CAACCTTGCT TGATCCGCAG
TTACATGCTG GGCCAGTTTC GAGCAAACAC CTCGCCGAAT TGGTCAAGCA AGCAGCATTT
CAAGCCAACC AAGCGGTTTA CGAAGAAAGC GTGCGCCGCA AAAACGATAT GGGTACGACC
CTGACCATGG CGGTGGTTAT CGGCGATCGG GCGATTGTTG GCAACGTTGG CGATAGTCGG
ACTTACCTTT ATCGCGATGG CAAATTGCAG CGCATCAGCA AAGATCACTC GTTAGTCCAG
CGCCTAATCG ATATTGGCCA ACTTGATCCT GATGATATTT ACACCCACCC CCAACGCAAC
GCCATTCTCA AATCGCTTGG CGATAGCGGC GACCCTGGCA CCGACACGTT CGAGGTGCAA
TTACAGCCTA ACGATGCGCT ATTTCTCTGC TCTGACGGCA TGTGGGAAAT GGTGCGCGAC
CCCAAAATGG CGGCACTTTT CGCTGAACAT GCCAACCCCG CCGATCTCTG CGATGCCTTG
ATTGAGGCTG GTAATGCTGG TGGCGGCGAA GATAATATCA GCGTGGTGGT GGTGCGTTTT
GATGCCCTTC CAATAGTTCA ACACTAA
 
Protein sequence
MSEPTQPMTG GTQPLNPVNQ PEEHEPFAIG TVLKDIYRVT ALLTDTPTLR VYRVALLEPW 
DHCARCGAAL QASDQFCEEC GAQVEEQTAL LQETPAAQPI GAALLDDLPD DPARAALPTV
REVFVGEDSR FAVLPDGTSL VRFDTLLSEP NTFVDQTDAV DIGIQVARAL AYLHRHGLAL
GQLTLADLAL TNKREIKLAD AGAIRRSLGK EDQLDDVEHL GLVLEKMAGI QRQTRRLDDS
NNPSPLDSAF ATILSDLRAK RITDASILAQ TLETLLAEQA TPISLRVRTG YATDVGMIRD
HNEDSVLTWD LRLNWDAKPV NVGLYVVADG MGGHEGGEVA SGLAITTTAQ TLVPTLLDPQ
LHAGPVSSKH LAELVKQAAF QANQAVYEES VRRKNDMGTT LTMAVVIGDR AIVGNVGDSR
TYLYRDGKLQ RISKDHSLVQ RLIDIGQLDP DDIYTHPQRN AILKSLGDSG DPGTDTFEVQ
LQPNDALFLC SDGMWEMVRD PKMAALFAEH ANPADLCDAL IEAGNAGGGE DNISVVVVRF
DALPIVQH