Gene Haur_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3040 
Symbol 
ID5734912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3839583 
End bp3840689 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content49% 
IMG OID641280184 
Productserine/threonine protein kinase 
Protein accessionYP_001545806 
Protein GI159899559 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000020347 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAACT TAAGTAATCT TCAACTTGGG GAGTATCATC TAGCTGAGCA GATCGGCCAA 
GGCGGTATGG CGGTTGTCTA TAAGGCTGAA CATCCACAGT TTGGCACAAC CGCATTTAAA
GTCTTGCCTT CAATGCTGAT CCATGTCGGC GAATTGTTAA CCCGTTTTCT CAACGAGGCT
GACGCTGTGC GGATTTTACA TCACCCGCAT ATTGTCCAAT CGTATGAGAC CGGAGCAGTG
CCCCATCCCC AACTAGACGA GGAGGTCTAT TTTATTGCGC TCGAATACAT CGAGAATGGT
TCGTTATTGG AGCGCATGAT CGCTAGCTCG CTCGCCGTCG AAGATGTGAT CAAAATGGGC
ATCGATATTG GCTATGCCTT GGAATATGCT CATAGCAAGG GGATTATTCA CCGCGATATC
AAGCCCAGCA ATATCTTATT TCGCAACAAT GGTCAAGCCG TTTTAGCCGA TTTTGGCATC
GCCAGCACGG CCCAATATAT TCGGCTCACC AAAACCGGCA ATGTCACTGG CACAATCGCC
TACATGGCCC CAGAAATTAT GCAAGAAGTG CCAGCCTCGC CACGCTCGGA CCTCTACTCG
CTGGCCTTGG TGCTCTATGA AACCTTGACC AATTCACGGC CTTTTGGCAC CGATACAGCC
TCACCACAGT TGGTGCAAAA AATCTTGCAA GAGCGAATTC CGCCACTGCA AGATGTTATA
CCGGATATTT CACCAACAAT CGCCCACGTC ATCGAACAAG CCTTGGCCAA ACAGCCAAGC
CAGCGCCAAA CATCGGTTGG TGAATTTGTC AGCCAATTGC AACATGCGCT GCAACGCCGT
ACCCCCAGCC AATTTACCAT CCCATTGCCT GAGCCATCCG AGGATCTATT GGTCGATCAG
TTTACCAAGC CCCAACAACG TAAGCCCAAG GCTAAACCGA TCGAGGTCAA TCGACCAAAT
GCGGCTGCTT CATCAACGCT TGGCATTCAA GCCAGCAATG ATCTCGCCAG CTCGCCACGG
GCAAAATTTA CGACAACCCT CCAGTTTGTG TTAATCGCAG TTGTGACCTT CTTTTTAGTC
CTAGGTATTT TTTTCATTTT TCAATAA
 
Protein sequence
MQNLSNLQLG EYHLAEQIGQ GGMAVVYKAE HPQFGTTAFK VLPSMLIHVG ELLTRFLNEA 
DAVRILHHPH IVQSYETGAV PHPQLDEEVY FIALEYIENG SLLERMIASS LAVEDVIKMG
IDIGYALEYA HSKGIIHRDI KPSNILFRNN GQAVLADFGI ASTAQYIRLT KTGNVTGTIA
YMAPEIMQEV PASPRSDLYS LALVLYETLT NSRPFGTDTA SPQLVQKILQ ERIPPLQDVI
PDISPTIAHV IEQALAKQPS QRQTSVGEFV SQLQHALQRR TPSQFTIPLP EPSEDLLVDQ
FTKPQQRKPK AKPIEVNRPN AAASSTLGIQ ASNDLASSPR AKFTTTLQFV LIAVVTFFLV
LGIFFIFQ