Gene Haur_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3720 
Symbol 
ID5735584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4679863 
End bp4681773 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content52% 
IMG OID641280872 
Productserine/threonine protein kinase 
Protein accessionYP_001546484 
Protein GI159900237 
COG category[K] Transcription
[L] Replication, recombination and repair
[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain
[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00169428 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCC CTGCATCTCG GGCTAAAACA ACTGCGACAC CGCGCCGCGA CGATGATCAT 
GAACTGCGCA ATTATCGCTT GGAAGAGCGG ATAGGCCAGG AAGAGTTAAC GACCATCTAT
CGCGCTCGCC ACCTAACCCT TGATCGCCCG GTTGAGGTGC ATGTGTTACG CCGTTCAGAT
TGGGTTTCGG TCAGCCGATT TGGGCAGGCC GCCCGTTTGG CTGCTCGGCT CAAACATCCA
ACAATTGTGC CAGTGATTGA TGCTGGCCAC GATGATCGGT TTGGTTACTA TTTGGTTACG
CCGCCGCTCG GTGGGCGCTT GTTGCAAGAG GTGCTTGAAG ATGGGGCGCT CTCAATTGGC
GATGCTGTGC GTATTTTCAG CGATATTGCC AAAGCCCTCG ACACAATTCA CGCCGAAAAA
ATCGTGCATC GCGATATTCA GCCAGGTACA ATTGTCGTTA CCGAGGCGAT CCATGATCAA
GCAGTTACTC GCCATGGCTT TTTGACCAAC TTTAGCTTGG CTTGGAGCAG CGATGGGCCA
GATCTTTCGC AACTTGAAGA GGCTGATTAT TTGACGGCTT ACGCTCCACC AGAGCAAGAT
TTTAAGCAAA ATGCCACCGA ACCAAGCCTT GATATTTATG CCTTGGGGGC AGTGCTTTAC
CATATGTTAA CGGGTGAAGT GCCACCAGCC CCCGGCCAAA CCCCCAAATC GTTAGGCGAT
TTCAATGTAG CCTTAGCTCC AGCTGATCGA GTGTTACGGC GTTTGCTCTC ACCTCAAGCC
TCAGTGCGTT ATAGTTCGGC TAATCAGGCT GCGGCAGCCT TACGTCAAGC CTTGCGCGAA
GCCTTACCCG CTGAAACCAG CACGACCCTT GCGCCAATCG CCGCCAGCAG CGTGGAAAGC
GAATGGCTTG AAAATCCAGT TGAAACAGTG TTGGTAGGGA TTCTCGATGG CGATTTTGTG
CAGCGTGGCC GCGAGTGGGG TCGCCAATTG CATGAGCCAA CTAATTTGCG CAGTGTACTC
AATCGCTGGA GCGGCCAAAA TATGCTGCGG CGGCGCGATT TGGGCAACGC GCTTGTGCCC
GAACGGGTGG TCAGCTATAA CTTTTATACC TACGAACTAC GGGCCTATTA TGAAACCCGC
ACCACGCCAG AGCCACGCGA AAAGCCCTAT CAAGGCAGTC GCATCAGCAC TAGCCAAAAT
CCACCTGGAG TTTGGCAAGT GATTTTGCCT GATCCTCAGC CATTTGACGA AATTCGCCCA
ACTGAGATGA TTATTCCCAA CTCCGAGCGG GTTGAGATGT GTATTTATTG TGGTGGCAAG
GGCGATTTAC ACTGTACCAA ATGTCATGGC CGTGGCTTGC TCGAAACCAA ACGGGTGCAA
ACCAATCCCG ATGGAACCAA AGAGCGCCGC ACGGTCACGC TCGATTGCCC TGAATGCGAA
GGTGAGGGCC AAGCCGATTG TGGGCGTTGC CAAGGTTCGG GCCAAGTTTT GACCGAAGAT
GTCTTTTATT GGTCGCGTTG GGGCAAACTT TGGGAAAACA CCGATGACGA AGCAGGCTTG
CCGCTAGCCG ATATTCGCGA AAAATCGCAG CAAGTCTATA CTGCCCAAAT TGATGTGCGC
GATACCAAAT GGCATGCGAT TGCGCCATTG CATGAATTAT TGCAAGCTGC CGAAAATGTC
GATGCTGATC AGCAAACCCG CTTGTTGCAT GCCGAATTAA CCATTCATGG CACGCCCGTA
ACCGAAGTCG ATTATACCGA GCGCAATCAA GCCCATACCT TATATATGAT TGGCTTTGAG
CCAACGATTA TTCGTGGCAA CTTTACCTTG TTTGATCGCG AACGCATTGT GTTATATAGC
GTGATTGGGG TTTTAGTGTT GATTGCAGTT ATTGGATTTT TGATCTTTTA A
 
Protein sequence
MSAPASRAKT TATPRRDDDH ELRNYRLEER IGQEELTTIY RARHLTLDRP VEVHVLRRSD 
WVSVSRFGQA ARLAARLKHP TIVPVIDAGH DDRFGYYLVT PPLGGRLLQE VLEDGALSIG
DAVRIFSDIA KALDTIHAEK IVHRDIQPGT IVVTEAIHDQ AVTRHGFLTN FSLAWSSDGP
DLSQLEEADY LTAYAPPEQD FKQNATEPSL DIYALGAVLY HMLTGEVPPA PGQTPKSLGD
FNVALAPADR VLRRLLSPQA SVRYSSANQA AAALRQALRE ALPAETSTTL APIAASSVES
EWLENPVETV LVGILDGDFV QRGREWGRQL HEPTNLRSVL NRWSGQNMLR RRDLGNALVP
ERVVSYNFYT YELRAYYETR TTPEPREKPY QGSRISTSQN PPGVWQVILP DPQPFDEIRP
TEMIIPNSER VEMCIYCGGK GDLHCTKCHG RGLLETKRVQ TNPDGTKERR TVTLDCPECE
GEGQADCGRC QGSGQVLTED VFYWSRWGKL WENTDDEAGL PLADIREKSQ QVYTAQIDVR
DTKWHAIAPL HELLQAAENV DADQQTRLLH AELTIHGTPV TEVDYTERNQ AHTLYMIGFE
PTIIRGNFTL FDRERIVLYS VIGVLVLIAV IGFLIF