Gene Haur_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4721 
Symbol 
ID5736565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6030293 
End bp6031981 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content50% 
IMG OID641281886 
Producthypothetical protein 
Protein accessionYP_001547480 
Protein GI159901233 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC CTGAGTTGGT GTCGTTGCCA AATGTTCCAC GCCGCCGTCG TTTCTTTCGC 
GTCGCGTGGT TTTTTGTGCG CTTAGTTGCC CATATTATCG TCTGGGATTT GTTGCTCGGA
CGGCAGTTTT GGTTTCGCTG GTACGCGGAT CGCACGCGGA TCGAGCGCTA TCGGCGTTTT
TCGCGGCGCT TCCGCGATTT GGCGATCGAT CTTGGCGGCG TGATGATTAA GCTTGGGCAA
TTTGCTTCAA CTCGCGTCGA TGTGTTGCCG CCAGCCGTGG TTGAAGATTT GATCGGCTTG
CAAGATGAAG TTTCGCCCGT ACCATTTCGC TTGATTCAAG CCACAATCGA ACACGAATTA
GGCCAACCAC TCGATCATAT TTTTAAAACT TTTGAGCGTG AACCAATCGC TGCTGCGTCG
TTTGGCCAAG TGCATTTCGC CACGCTGCAC AACGATCAGC CGATTGCCAT CAAAATTCAA
CGCCCACAAA TTGAGCAATT TGTTGAAATC GATATTGCTG CACTGCGCTG GGTTGCTAGT
TGGATGCAAT ATTATGGCCC AATTCGCCGT CGCACCGATT TACCAGCGCT GATCGAAGAG
TTTTCGCGAA TTACTCTGCG CGAACTCGAT TACCTAAGCG AAGCTGATCA CGCCGAGCGC
TTTCAGCGTA ATTTTGCTGG CAACGATCAT ATTTATGTGC CAAAAATTCA GCGCGATTAT
TCAACTGAAC GAATTTTGGT GATGGAGCGA ATCGAGGGCA TTAAAATTTC GGAATATGCG
GCGCTCGATG CAGCTGGAAT AGATCGGCTG GATTTGGCCG AAAAGCTTTA TTTGGCCTAT
TTGCAACAAT GCTTTACCGA TGGCTTTTTT CATGCCGACC CACACCCAGG CAATTTGTTT
GTGCGGCCTG TCGGCGAGCG CTTGGCGAAT GGCAAACAAC CCTTTGTAAT CACCTTCCTC
GATTTTGGCA TGGTCGATTC AATTCCCCAA AGCGTGATGG ATGGCCTCGC CACGATTGCA
GCTGGCGTGG TAATGCGCGA ACCACAACGT ATGATCGATG GCGCACGCTC GATTGGCGTG
GTCATGCCCA ATGCCAATGA TCAACAATTA CGCCAAGCCT TGGAAATTTG GTTTTCCTAT
ACCTATGGCC GCACAATTCG CGAGTTGCAA CAAATCGATG TTGAGGGTTT TGTCGGCGGA
TTAAGCGAAT TGCTCTATGA TTTGCCGTTT CAGTTGCCTC AATCATTACT CTTTTTGGGA
CGCACGGTAG GGATTATCGG TGGCGTGGCG GCTGGTTTAG CGCCCGATTT TGATATTTTC
AGCGTGACTA AACCCTTTGC CTTACGCTTT ATTCGTGAGC AAACCAGTGG CCGCGATCTG
CGCGAACGGG TAATTAACGA AGGCCGCGAA TTAATTACCG ACCTCAGCCA AATTCCGCGC
CATGCCAAAC AATTTTACGT CAAGGCTGCT CAAGGCGATT TGCAAGTGCG CACCGAAATT
GTCAAACTAG AGCGCACCAC CAAACGAATC GAGCGGGCAT TGAGCCGTCT TACCGCAGGG
ATTGCGGCAA GCGCCTTGAT CATCAGCGCG AGCATTCTCC AAGCCCAACA GATTTATAGC
CCTTGGATGT GGTGGTTGGC TGGTGGTTTG TTGATTTGGT CATTGTTGCC ACGCTTCAAT
CAAAATTAA
 
Protein sequence
MSKPELVSLP NVPRRRRFFR VAWFFVRLVA HIIVWDLLLG RQFWFRWYAD RTRIERYRRF 
SRRFRDLAID LGGVMIKLGQ FASTRVDVLP PAVVEDLIGL QDEVSPVPFR LIQATIEHEL
GQPLDHIFKT FEREPIAAAS FGQVHFATLH NDQPIAIKIQ RPQIEQFVEI DIAALRWVAS
WMQYYGPIRR RTDLPALIEE FSRITLRELD YLSEADHAER FQRNFAGNDH IYVPKIQRDY
STERILVMER IEGIKISEYA ALDAAGIDRL DLAEKLYLAY LQQCFTDGFF HADPHPGNLF
VRPVGERLAN GKQPFVITFL DFGMVDSIPQ SVMDGLATIA AGVVMREPQR MIDGARSIGV
VMPNANDQQL RQALEIWFSY TYGRTIRELQ QIDVEGFVGG LSELLYDLPF QLPQSLLFLG
RTVGIIGGVA AGLAPDFDIF SVTKPFALRF IREQTSGRDL RERVINEGRE LITDLSQIPR
HAKQFYVKAA QGDLQVRTEI VKLERTTKRI ERALSRLTAG IAASALIISA SILQAQQIYS
PWMWWLAGGL LIWSLLPRFN QN