Gene Arth_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0116 
Symbol 
ID4447414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp118906 
End bp120606 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content64% 
IMG OID639687911 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_829617 
Protein GI116668684 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGCGA CCCATGTCAC CCTACAATGT ACTCAACCGG GGACCGAAGG GGGCAGTGTG 
GTAATTCAGA CGGCGGAGAA TTCACTCGTA CCACGCACGG CCATCGTGGC GGATCCGGAC
AGCGGACGCC TGTCCGCCAC GGCCGGGGTC CTGGTAGCCG CGGGGTTCGA GGTCTCCACG
GCCTCAAGCC ACGAAATTTT GACCGGGCTC CTGGAGCAGC ACAAGCCCAG CATCATCGTC
GCGGACCATT CGTTTGGGGG CCCGCCCGTG GATGCACCAC TGCTTCTCCT CGTCGATCTG
GCCGACGAAT CCGCGGCGGA AAGCCTCCGG ACGCTCGGCA TGGCGGACTA CATCGCGAAG
CCTGCCGCGC CGGACGAGCT GGTCCACAGG GCGGACGTCC TGATCAGCCG CGCCGCACAA
CGCTCCGACG CCCGGCAAAG CGCCGAACGG CTGCGTGAAA AGCTCCGCCT GGTCTCCTCC
AACATCCGTG CCACGCACGA TCCCCGCCTG ATTTCGGATT GCCTGGTGAC GGGATTCGGG
GAGACGTTCG GTGCCGACCG CGTACTGTTC ACCACGTTCG ACGACGACCG CGTGCCCCGG
ATTTCCGCCG AGTGGCACCG TCCCGGCCTC CCCCCGGTCC CGGACGGACT GGGGCTCCAA
GAGTCCTCCG CGCACCGCGT AGCGGACCGG CTGTGGTCGG AGGCCGAAGT TCTTGCGGTG
GAGGACCACC ACGTCCACGA ATGGTCCCCT GAGGACCAGG AACTGGCGGC CTGGTCAGAA
GATATGGGCC CGTGGGCGTC CGCATTCGTT CCCGTGGGCG AGGGAAAGTC ATCGCTGGGC
GTCATATGGA TTGCGCAGCT GGACGAGCCG CGGGTCTGGA CACGCACCGA AATCTCCCTG
ATCCAGCACG TTGCCGGCAA CGTGGCGTAC GGGCTCATCC AAAGCCATCT GATGAGCGCC
CAGCAACAGG TGGTCAAGCA ACTGCGGCAG CTTGACCAGG CGAAGACCGA CTTCCTGGCC
ACCGTCAACC ATGAGCTCCG GACCCCGCTG ACGTCCATCT CCGCCTACCT GGACATGATC
CAGGACGGTG TGGGGGGGCC CGTGCCCCCC GAAGTGAGCC GGATGCTGGA CATCATTGTC
CGTAACTCAG AGCGGCTCCG CCGGCTGATC GAGGATATGC TCACCGTTTC CCGCCAGGAC
TACGACGGCG CCAACCTGCA CCTGGGTCCG GTCCAGCTGG GGCACACCCT GCAGATCGTG
ACGGTGGCGC TGCGTCCGCT GGCGGAACTG GGTGACGTTT CCATCTCACT GGAGCTCTGC
GACGGCGACC CCGCGATTAT CGCCGATGAA GTCCAGCTGG AGCAGGTGTT TACCAACCTT
GTTTCAAATG CCATCAAATT CACGCCACGC GGAGGCAGGA TTGTTGTCAG CTGCCGTTTT
CAGGCAATGA CGGACGGGGA GCCGGGGGTA AATGTTCATG TCCGCGACAC GGGGGTCGGA
ATCCCGGAGG AAGAAATCCC GCATCTTTTT ACCCGGTTTT TCCGTGCCTC CAATGCCACT
TCCACTGCCG TGCCGGGCAG CGGCCTGGGG CTTGCCATTG CGCACGACAT CGTGAAGTCG
CACAGGGGGT ATTTGGCGGT CAGCTCCGAG CTCGGCGCAG GCACCACCAT CACTGTTCAG
CTGCCCGTTT CCGGCCCGTA A
 
Protein sequence
MRATHVTLQC TQPGTEGGSV VIQTAENSLV PRTAIVADPD SGRLSATAGV LVAAGFEVST 
ASSHEILTGL LEQHKPSIIV ADHSFGGPPV DAPLLLLVDL ADESAAESLR TLGMADYIAK
PAAPDELVHR ADVLISRAAQ RSDARQSAER LREKLRLVSS NIRATHDPRL ISDCLVTGFG
ETFGADRVLF TTFDDDRVPR ISAEWHRPGL PPVPDGLGLQ ESSAHRVADR LWSEAEVLAV
EDHHVHEWSP EDQELAAWSE DMGPWASAFV PVGEGKSSLG VIWIAQLDEP RVWTRTEISL
IQHVAGNVAY GLIQSHLMSA QQQVVKQLRQ LDQAKTDFLA TVNHELRTPL TSISAYLDMI
QDGVGGPVPP EVSRMLDIIV RNSERLRRLI EDMLTVSRQD YDGANLHLGP VQLGHTLQIV
TVALRPLAEL GDVSISLELC DGDPAIIADE VQLEQVFTNL VSNAIKFTPR GGRIVVSCRF
QAMTDGEPGV NVHVRDTGVG IPEEEIPHLF TRFFRASNAT STAVPGSGLG LAIAHDIVKS
HRGYLAVSSE LGAGTTITVQ LPVSGP