Gene Arth_3051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3051 
Symbol 
ID4444284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3420738 
End bp3422318 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content71% 
IMG OID639690877 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_832530 
Protein GI116671597 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.252815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCAG TTGAATCCGC CCGTGGCGGC GCCGCCCGAT CCGGACCCGG GCGGTCGGGG 
ACGGCACGCC CGGAGCTCCA GGGCATCGGG CGGAGCCGCT TTATCGTCGG CATGGTGCTT
GCCGTCGTCC TGCCGCCCGC AGTGGAAGTA CTAGTGACCC TGGCGGGCTA CCGGAACTTC
GCAATCATCA TGCTGCTCCA TCTCGCCGTC GCTGTGGCGG TGGCTGCCAT CGGGGGATTC
TGGCCGGCCG TCGTGGCGGC TGTCCTGGGC ACAACGCTCC TCAACTACTT CTCAGCCGAT
CCCGTGGGCA CGCTGTCCAT CGCCGATCCC TCCACCCTGT TCACCCTGCT CGTGTTCCTG
GCTGTGGCCT GCAGCGTGGC ACTGGCCGTG GGCCTGGCCA CCCGCCGGGC GCAGGAAGCC
GCCAGGTCCG GGGCCGAAGC CACGGCCTTG AGCGAGCTCT CCCTGCGGAT CCTCAGTTCC
GACGGCAGCC TGGAGACGTT CCTCGAGAAG GTCCGCAGCA GGTTGGGAGT CGAAGCCGTA
ACCCTGGTGG CCGGCAGCTC ACCTGGCAGC CCGCACGCTG CGGGATCTGC TGCCGGAAGC
GGTCCGGGGC GGGCCCCCGG CAGCAACCCC GGGTGGGTCG CGCTGGCGAG TGCCGGGACG
AGCGCGCCGG TGACGCACTC GGCCGCCGAC CACGCCGTCG TCGTCGATTC CCGCTACACG
CTGCTGATCA ACGGCGGACC GCCCGCCGGT CAGCCGTTTT CCGGCCAGCA CCAGCGCATG
CTGGCCGCTT TCGGGGCGTT CCTGGTGGCG ATCCTGGAGC GGCGCCAATT GGCGGCGAGC
ATGGAGGACA ACCAGCGGCT TTCCGAGGGC AACAAGATGC GCACGTCCAT CCTGAGGGCC
GTCAGCCATG ACCTGCGCAC CCCGCTGGCC GGGATCAAGC TGGCCGTCAG CAGCCTGCGC
CAGGAGGATG TGCGGTTCTC CCCGGAGGAC GAACGCGAAC TGCTCGCCAC CATCGAGGAC
TCGGCCGACC GGCTGGACCA CCTGATCGGC AACCTGCTGG ACATGTCCCG GATCACGGCT
GACTCGGTCA ACCCGCTCCT GCGCGGGCTG GGCTGGGCGG ACGTGCTGCC CGATGCGCTC
AAAGGGCTGC CCGCGGCGCG GATCCGCGTG GAACTGCCGC CCAACCTGCC CCGCGTGGAG
GCCGACGCCG GGATGCTGGA GCGCGTGGTG GCGAACCTGG TGGAGAACGC ACTCAAATAC
GCGCGCGAAG CCGATGTGGT GCTGACAGCG CGGGCGGGGG AGGGAATCGC GCTGGCCGGC
CGGCCTGCCA GCGAATTCCG CGTGGTGGAC CATGGCTCCG GGGTTGCCCC GGCGGCGGTG
CTGGACATGT TTCAGCCGTT CCAGCGGCTC AACGATTCCC AGCGCACCGG CGGCGGCCGC
ACCGTGGGGA TCGGGCTGGG CCTGGCCGTT GCCAACGGCT TCACCGAAGC CATGGGAGGA
ACCCTTGCGG CCGAGCCGAC GCCGGGCGGC GGGCTGACGA TGGTGGTCAC CCTGCCGCTG
TGGGAGGGAC CGCTGCCGTG A
 
Protein sequence
MAAVESARGG AARSGPGRSG TARPELQGIG RSRFIVGMVL AVVLPPAVEV LVTLAGYRNF 
AIIMLLHLAV AVAVAAIGGF WPAVVAAVLG TTLLNYFSAD PVGTLSIADP STLFTLLVFL
AVACSVALAV GLATRRAQEA ARSGAEATAL SELSLRILSS DGSLETFLEK VRSRLGVEAV
TLVAGSSPGS PHAAGSAAGS GPGRAPGSNP GWVALASAGT SAPVTHSAAD HAVVVDSRYT
LLINGGPPAG QPFSGQHQRM LAAFGAFLVA ILERRQLAAS MEDNQRLSEG NKMRTSILRA
VSHDLRTPLA GIKLAVSSLR QEDVRFSPED ERELLATIED SADRLDHLIG NLLDMSRITA
DSVNPLLRGL GWADVLPDAL KGLPAARIRV ELPPNLPRVE ADAGMLERVV ANLVENALKY
AREADVVLTA RAGEGIALAG RPASEFRVVD HGSGVAPAAV LDMFQPFQRL NDSQRTGGGR
TVGIGLGLAV ANGFTEAMGG TLAAEPTPGG GLTMVVTLPL WEGPLP