Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4042 |
Symbol | |
ID | 5735904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5160867 |
End bp | 5162036 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281193 |
Product | protein serine/threonine phosphatase |
Protein accession | YP_001546802 |
Protein GI | 159900555 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0631] Serine/threonine protein phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTTT TTCGTAAATT ATTCGGGCGT ACTTCAACTG AGCCAAGCAC GCTTGATCCA GCTTCTACTA CAACCGATCA GTTACCTGTT GCTGTCGCTA GCCCAGTTGC TAGCGAGGCA TTAGCCGAAA CGCCAATCGA GGTTGTTGCG GTTGCTGCAA CTGAGGTACC AACCGAGCCA TTGATCAGCG CCGAGCCACT GGCCGAATTG CCAGCAGCCC AAACTGCTGG CGCGGTGGCA ACCGAGCGTT CGCAAAAAAC GACCGCGCCA CTTGACCCTG AGCAACTGCC CGAGCGCGAT CCTGATGGCA CGAATTATCT CGGAACCCGC GATATTAGTG CTGCGCCGAT CGTTGCTAAA GCGGTCTCAA CCCGTGGGCT AGCCAGTTGG GCCGCCCGCG ATATTGGCCG CATTCGCCGT AATAATCAAG ATAGTGTTTA CACAAGCTTG ATGAGTTTGC CCGATGGCGA GCACGATATC AGCGTGGGCC TGTTTGTGGT TGCCGACGGC ATGGGTGGCC ACGAAGGCGG CGAAATCGCC TCGCGCCGTG CGATCGAAAC CGTGATGATC GCGGTGTTGG AGCAAATGGC CTTGCCAGCA ATGGCCGATG AAGATCCTGG TAACCCACTG CCCTTGCTGA TGATGAGCGC CGTGCAAGAT GCCAACACCC GCATCTGGAA CGAAGCTCAA TCGCGTGGCA CCGATATGGG CACGACCTGT ACTGCTGCCT TATTGGTTGG CGATGGCTTG TATATTGCCC ATGTTGGCGA TAGTCGCCTA TATGCCATGA GCGATGGCAA ACTCCGCCTG ATCACCGCTG ATCATTCCAC CGTGGGGCGC TTGATTGCGA TGGGTCAATT GACCGAAGAA GAAACCCGCA ATCACCCGCT GCGCAACCAA CTCTATCGCA CCGTTGGCCA ACATCCGGAG ATCCAAGTCG ATTCAATCTA CCAAAGCCTT GAGGGTATCA GCCATTTGTT ATTGTGTAGC GATGGCTTAT GGAGTATGGT TGACGACGAT GAGATGGCTG CAATCATCAA CGAAACGCCA TGGCCGCAAG ATGCCTGCCA GCGTTTGATT GCCCGCGCCA ACTTAGCCGG TGGCGAAGAT AACATTAGCG CGGTGGTTGT TTCATTGCCA CCCTTGCAAG GCCAAGGAGC GCTCCGATGA
|
Protein sequence | MGFFRKLFGR TSTEPSTLDP ASTTTDQLPV AVASPVASEA LAETPIEVVA VAATEVPTEP LISAEPLAEL PAAQTAGAVA TERSQKTTAP LDPEQLPERD PDGTNYLGTR DISAAPIVAK AVSTRGLASW AARDIGRIRR NNQDSVYTSL MSLPDGEHDI SVGLFVVADG MGGHEGGEIA SRRAIETVMI AVLEQMALPA MADEDPGNPL PLLMMSAVQD ANTRIWNEAQ SRGTDMGTTC TAALLVGDGL YIAHVGDSRL YAMSDGKLRL ITADHSTVGR LIAMGQLTEE ETRNHPLRNQ LYRTVGQHPE IQVDSIYQSL EGISHLLLCS DGLWSMVDDD EMAAIINETP WPQDACQRLI ARANLAGGED NISAVVVSLP PLQGQGALR
|
| |