Gene Arth_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0866 
Symbol 
ID4446634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp938625 
End bp940292 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content64% 
IMG OID639688673 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_830364 
Protein GI116669431 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.917443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCA ACGTCGCCTG GGACATCAAG TCCGGCCGGA TTCTCTTCTT CCGACGGCCG 
TTCCACGAGT ATTCGCTCCG GGGACGGGTC ACCCTGAGTC AGATGCCGTT GTCCATCACG
GTGGCGCTCA CGGCCGTCCT CATGCTGTTG TTCTTCCCGG GGACGCTGGG GAATCCCCTG
TTCATCGCGT TCCTGGCCAC GCAGGTCCTG ATCCTGGCGC TGTGCTTCTT CATTCCCTGG
CATCGGCTGC CGTTCGCCAG CTTCCTTGTA ATTCCCCTCC TTGACTTCGT CTCGATCGCG
CTCGGCAGGG AGGGCGGGCA GGAGGGCCTG ACCGGCATCA GCCTGTTGGC GGTTTTTCCC
GTCATCTGGC TGTGTGCTTC GGGGCTGTAC CCCCGCACCG CCCTGCTGGT CTCCTTCCTC
GCTCCCCTCT CCATCATCTG GGTGCCGTTG TTCCTTCACG GCAACGTCAC CGGACAGGAC
CTGGCGCGCT CCCTGCTGCT GCCGGTGATG ATGCTGGGGA TCGGCGTTTC CGTGAGCGTG
CTGACGCTCA GCATGGTCCG GCAACAGCGA GACCTTGAGG AGAAGGATGC CCAGCTGCGG
GTCACCCTCA AAGAGAGCAA GCGGCAGGAG GAACTGCTCA ATTCGGTGCT GGAAACGGTG
CACATGGGCG TCCTGGCGGT GGACGCGGAC GGGCACGACA TCCTCATGAA CCGGAAGCAA
CGGGAAAACC ACCGGCTTGC CACACCTGCG GGAAACGACG ACCCCAACGA ATCCCAGCTC
CTGGTGTTCC AAGCCGACCG GACCACAACC GTCCCGGTCG AGCAACGGCC TGTCCGTCGC
GCCGTGATGG GCGAGAGCTT CACGGACAAC CTGGTGTGGC TGGGCAGCGG CAGCGAACAG
CGCGCCATCA CCACCAGCGC CCGCGCCATG CGGGACCAGG ACGGCGCCTT CGCCGGCTCG
GTGATCGTCT TCAGCGATGT CACGGACCTG GTTAATGCCC TGGCGGCCAA AGACGACTTC
GTGGCGAACG TTTCCCACGA GTTCCGCACG CCGCTGACCT CAATCCTCGG TTACGTGGAA
CTGCTGCTGG ACGACCATGA CGGCCTGCGC GCTCCCCACC GCGAAGCGCT CATGATCATC
CGGCGAAACG CTGAGCGCCT GCTCGGCCTG GTGTCGGACC TGCTGGCAAG CCGGAGCGGG
CAGCTTATCG TCAGCCCGCA GGCTGTGAAC GTGGCTGAAC TTGTCCGGGC CAGCATCAGC
GCCGCGCTCC CCCGTGCCGC CGCCGCAAAG GTGCGGCTCC GTGCGGACGC TCCTGAACGG
CTGGAGGCCC ACGTGGATTC CGGCCGCATG TCGCAGGTAC TGGATAACCT CGTGTCCAAC
GCCATCAAGT ACTCGCCGAA CGGCGGTGAC GTTGTGGTGT CGCTGGACTC GGACGGGTCC
CACCTGGTCT GCCGGGTGAG CGATACCGGC ATGGGCATGA GCGACAAGGA CCAAGCGGAG
GTCTTTACCA AGTTCTTCCG TACCGGCAAT GTGCGGAACA CTGCCATCCC GGGCGTAGGG
CTGGGCCTAT CCATCAGCAA GGCCATTGTT GAAGCACACG GCGGAAGCAT CCAGCTTCGG
AGCGCACTCG GAGAGGGCAC CACATTCACC GTCAGGGTGC CTGTCTAA
 
Protein sequence
MSGNVAWDIK SGRILFFRRP FHEYSLRGRV TLSQMPLSIT VALTAVLMLL FFPGTLGNPL 
FIAFLATQVL ILALCFFIPW HRLPFASFLV IPLLDFVSIA LGREGGQEGL TGISLLAVFP
VIWLCASGLY PRTALLVSFL APLSIIWVPL FLHGNVTGQD LARSLLLPVM MLGIGVSVSV
LTLSMVRQQR DLEEKDAQLR VTLKESKRQE ELLNSVLETV HMGVLAVDAD GHDILMNRKQ
RENHRLATPA GNDDPNESQL LVFQADRTTT VPVEQRPVRR AVMGESFTDN LVWLGSGSEQ
RAITTSARAM RDQDGAFAGS VIVFSDVTDL VNALAAKDDF VANVSHEFRT PLTSILGYVE
LLLDDHDGLR APHREALMII RRNAERLLGL VSDLLASRSG QLIVSPQAVN VAELVRASIS
AALPRAAAAK VRLRADAPER LEAHVDSGRM SQVLDNLVSN AIKYSPNGGD VVVSLDSDGS
HLVCRVSDTG MGMSDKDQAE VFTKFFRTGN VRNTAIPGVG LGLSISKAIV EAHGGSIQLR
SALGEGTTFT VRVPV