Gene Haur_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0194 
Symbol 
ID5732040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp226580 
End bp228307 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content52% 
IMG OID641277318 
Productexopolysaccharide tyrosine-protein kinase 
Protein accessionYP_001542974 
Protein GI159896727 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0489] ATPases involved in chromosome partitioning 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.27748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCA TCCGTCGTTA TATAACTGGT CTCCGTCGCT GGCTCTGGCT TTTAGTGCTA 
GGGCCAGTTG TCGCTGCTGG CGCGGCTTAT GGCATTAGCA GCCAACAAAC ACCGCGTTAT
GCCAGCAGCA CTCGTGTGAT TGTCGGCCAA ACCCTCAAAA ATAGTAATCC CGATTATGGC
AGTTTGGTGG CAAGTGAGCG CTTGGTAGCA ACCTATGCCC AAATCGCCCA AAGCCGCACA
ACCATGCAGG CAGTTGAGCA GCGCTTAAAT TTGAGCGATA TGGCAAGTTC AGCGATCATC
ACTACCCGCC CAGTCCAAGA AACTGAGTTT TTGGATATTG CGGTTGAGGC CAATGATCCG
CAACAGGCTG CCGATATTGC CAATGCAATT GCTGACCAAT TGATTTTGAC TAGTCCGGCA
GGGCCACAGA GCGCTGAAGC CAAATTGCTT GATGAAGTCA ATCGCCAAAT TGCTACGCTC
AACGAGGAAA TTACCCGTAC CGATGAGGAA ATTAAAACCC TCAAGGCCGA AATTGAGCAA
ATTGGGGCCG ATAAACCTGC TGCTGAATCG TTGATTGCGA ACTTGCAACT CAAACAACAG
AGCCAAAATC AAAATCGCCA AACCCTCAGC ACGCTCTACA GCACGGCGCT GGGCAATCGC
GCCAACTCGA TCAGCGTGGT CGAGGCTGCC ACGGTCAATC CAACCCCAAT TGCACCACGG
CCAATTCGCA GCGCAATTTT GGCGGGGATT TTGGGCTTTG CCTTGGTATT TGGTTTGGCC
TTGTTGATCG AATATTTTGA TGATAGCGTG CAAACGCCCG ATGAAGGCGT TGATCTCGTT
AATGCGCCGT TGCTGGCAGC GATTGTCAAG CAAGAAACTA AGGTGACCAA AGCCTCGCAA
CGCTTGGTTT CGCGACTTGA TCCGCGTTCA CCAACCGCTG AAACCTTTCG CACCTTGCGC
ACGAATTTGC AGTTTTCGAA TGTTGATACC AAAGCTCGCA CATTGATTGT CACCAGCAGC
CAGCCTGAAG AAGGCAAAAG CACAGTCGCT GCTAACTTGG CATGGGTGCT GGCGCAAGCA
GGCCAAAAAG TCGTCTTGAT CGATGCTGAT TTGCGCAAGC CCATGATGCA CCGCGTGTTT
GAGGTGAGCA GCGAATATGG CCTGACCAAT TTGCTGACCA ATAATGAAGA CCCAACGATC
CGTGAGCGCA CGGTGCTATC GGTTGCCGAA AATTTGTGGC TCATTCCTAG CGGGCCTTTG
CCTCCTAACC CCTCGGAATT GCTCAGCAGC AAACGCATGG AAATGCTGAT TTGGCTGTTG
CAGCAAGAAT ACGATTGGAT TTTGTTCGAT ACACCGCCAA TTTTGACCGT AACCGACCCA
ATCGCACTGA TTCCACGGGT TGATGGTGTA GTGTTGGTGG CCGAGGCCAA GCGCACCCGC
CGCGATATGC TGGCAAAATG TCGGGCTGCG GTGCAAACCG TCGGCGGGCG GGTGATTGGC
TTGGTCTTTA ACAAGCTTGA TCCGCGCTCC GAGGGCTATG GCGTTTACTA TACCTACTAC
TACGATCAAC ACCATACTTC CAATCGTGGT CGCCGCTTTT GGAATCGCAA AGATGATCAT
CAGCCAGTGC CGAGTATGAG CGAGCCAGCC CCGTTGGATC TGCATGATCC TGCGCTTGAT
CGTTCGGAAG CCGCCTATGA GATGGCCAGC CATGAGCGCA GCAAGTAA
 
Protein sequence
MNIIRRYITG LRRWLWLLVL GPVVAAGAAY GISSQQTPRY ASSTRVIVGQ TLKNSNPDYG 
SLVASERLVA TYAQIAQSRT TMQAVEQRLN LSDMASSAII TTRPVQETEF LDIAVEANDP
QQAADIANAI ADQLILTSPA GPQSAEAKLL DEVNRQIATL NEEITRTDEE IKTLKAEIEQ
IGADKPAAES LIANLQLKQQ SQNQNRQTLS TLYSTALGNR ANSISVVEAA TVNPTPIAPR
PIRSAILAGI LGFALVFGLA LLIEYFDDSV QTPDEGVDLV NAPLLAAIVK QETKVTKASQ
RLVSRLDPRS PTAETFRTLR TNLQFSNVDT KARTLIVTSS QPEEGKSTVA ANLAWVLAQA
GQKVVLIDAD LRKPMMHRVF EVSSEYGLTN LLTNNEDPTI RERTVLSVAE NLWLIPSGPL
PPNPSELLSS KRMEMLIWLL QQEYDWILFD TPPILTVTDP IALIPRVDGV VLVAEAKRTR
RDMLAKCRAA VQTVGGRVIG LVFNKLDPRS EGYGVYYTYY YDQHHTSNRG RRFWNRKDDH
QPVPSMSEPA PLDLHDPALD RSEAAYEMAS HERSK