Gene Arth_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2135 
Symbol 
ID4445212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2405863 
End bp2407560 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content65% 
IMG OID639689943 
Productputative signal transduction histidine kinase 
Protein accessionYP_831615 
Protein GI116670682 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0347228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGCCG ATGGCGACAG TCCCGAACGG TACACGGCTG CGAGCGGTAC AAGGATTGAA 
GACCTGCTCA AGGATTTCGT CGCCCGGGCC GGCGAACTCC TTCAGTTCCA AGAGCGCATG
GGCGGCCTGC TTGAAGCCGT CGTGGCGGTT GCGGAGGATT TGAGCCTCGA CGCCGTCCTG
GAGCGCGTCG TTCAGTCCGC CTGCCAGTTG CTGCGCGCCC GCTACGGCGC GTTGGGCGTC
ATCGGCGATG ACCGCGCCCT CAGCCACTTC ATCACGGTCG GGATCGACGG CGAACTGGCC
AAACGGATCG GCCCCCTCCC TACCGGTCAT GGGGTCTTGG GATTGTTGAT CTCCGATCCG
CGGCCGCTGC GGCTCCCCGA CCTGCGGAGC CATCCCGAGG CGTACGGCTT TCCCGAGCAT
CACCCGCCCA TGCAGTCCTT CCTTGGCGTT CCCGTCCGGG TACGGGACGT TGTGTTCGGA
AACCTGTATC TGACGGAGAA GGAGGGCGGC GGCGATTTTA CGGTCGAGGA CGAGGAGCTG
GCCGTAGCCC TGGCTGCCGC TGCCGGTGTC GCCATCGAGA ATGCACGGCT TTATGATGAC
GCCCGCCGGC GCGCACAATG GCTTGAGGCC TGCATGGATG TCTCCGGGCT GATGCTGGGG
ACCGAACCGT CGTCGTCTGC CGGCCTTGAT CCCATTGCCG GCAGGGCGCT GCGGGAATCC
GGGTCCCGGC TGGCCCTGAT AGTGGAACCC GCCGCGGACG GCGTGGGATA TGTCGTGGCC
GGGGCCGACG GTGACGACGC GGAGTTGTTC GCCGGCCTGA CGCTGTACCT GGATTCGGAA
GTTCTCCAGG GGGTGCTTGC CGGCGGGGAC CCGCTACTCG TGGACAAGGC CGCCGACGTG
CTGGGGGTGC TGGAGGGGAC CGTGGCCGGT TCGCTCCTCG CAGTGGCGCT CAGCACCCAG
GGCGCACATC ATGGCCTGCT CCTCCTGGTC CGGGACGCCA GCGAGGGTCC CTACGGCCGG
ATTGATATGG AGATGGGAGC CGTTTTCGGG TCCCACGTGG CGTTGGCGCT TGAACTGGCC
CGGGTCCACC GGCTGCGGGA AGAGCTGCTG GTCTTCACTG ACCGCGACCG GATTGCCCGT
GACCTCCATG ACCTCGTGAT CCAGCGGCTC TTCGCAGCAG GCCTGAGCGT CCAGAGCCTG
AACCGGTTCA CGAAGGAAGA CCTTGCACTG GAGAGGATTC GTGCCATCAC CGGTGAACTG
GATGAGGCCA TCCGCAGCCT GCGGGACACC ATCTACTCGC TCAAGACCGG CAACAGCGAT
GCCGAGCCCC TCAGCGGGAG GCTGCGGAGT GTCGCGCGGA GCGCTGCAAA GTCCATGCCC
TTTGCGCCGG CGCTTAGCCT GGAAGGCCCG GTTGACTCAG TCCAACCGGA CAAGGCAGAC
CATGTGGTGG CCGTTGTTTC AGAGGGACTG AGCAACGCCA TCCGGCATTC GGGAGCTGAT
TCGATCGAGG TTGCCGTCTC CGCCATGAAT GGCAGGATGA CCGTCCTGGT GACCGACAAC
GGCAGCGGGT TCAAAGATTC GGCAAAGCGC AACGGACTGA ACAACATGGA AGAGCGCGCG
AGGATGCTGA ACGGCACCTG CACCATCACC GGCGCCCCGG ACACCGGAAC CAGTCTGGTG
TGGTCGGTTC CGCTCTAG
 
Protein sequence
MHADGDSPER YTAASGTRIE DLLKDFVARA GELLQFQERM GGLLEAVVAV AEDLSLDAVL 
ERVVQSACQL LRARYGALGV IGDDRALSHF ITVGIDGELA KRIGPLPTGH GVLGLLISDP
RPLRLPDLRS HPEAYGFPEH HPPMQSFLGV PVRVRDVVFG NLYLTEKEGG GDFTVEDEEL
AVALAAAAGV AIENARLYDD ARRRAQWLEA CMDVSGLMLG TEPSSSAGLD PIAGRALRES
GSRLALIVEP AADGVGYVVA GADGDDAELF AGLTLYLDSE VLQGVLAGGD PLLVDKAADV
LGVLEGTVAG SLLAVALSTQ GAHHGLLLLV RDASEGPYGR IDMEMGAVFG SHVALALELA
RVHRLREELL VFTDRDRIAR DLHDLVIQRL FAAGLSVQSL NRFTKEDLAL ERIRAITGEL
DEAIRSLRDT IYSLKTGNSD AEPLSGRLRS VARSAAKSMP FAPALSLEGP VDSVQPDKAD
HVVAVVSEGL SNAIRHSGAD SIEVAVSAMN GRMTVLVTDN GSGFKDSAKR NGLNNMEERA
RMLNGTCTIT GAPDTGTSLV WSVPL