Gene Arth_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1002 
Symbol 
ID4446507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1081292 
End bp1082770 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content55% 
IMG OID639688808 
Productserine/threonine protein kinase 
Protein accessionYP_830499 
Protein GI116669566 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACGG GTTCGACAGT TGTTTTAGAT AACGGGTCTT GGACATTGTC GGATGAACTG 
GGACATGGTG GTTTTGGGGA GGTCTACCGT GGCCATCAGG GCGCGGTAGA GGCGGCAATT
AAGTTCATAC CTAAAAGTAA AGGTGTTCCA AGAGAGATAT TGCTGGACAT CCCCAAAAAC
GCTCGGAATG TCATCCCCAT CACGGGTACT GGCGAAGACG CAGATAACTG GATCATTTCA
ATGCCGGTTG CGGACCATTC GCTTGAAAAG ATGCTTAACG CCCACGGCGG CAAGCTTCCT
GAGGATCTAG CCGTCATGGT CCTAACCCAT ATTGCCGAGG CGCTTGCCAG CCTCGACGGA
AGTATCGTCC ACCGGGACAT CAAGCCGGGC AACATATTAC TTTTCAACGC CAAGTGGTGT
CTCACCGACT TTGGCATTGC CCGCTATGCT GCAGCAGCCA CGGGAAGCTT GACTCACAAA
GGGTACGGAA CTGCAGCGTA CGTTGCTCCC GAACTGTGGC TCGGTCAAAG TGCAACAAGC
CAGAGCGACA TCTACGCCCT AGGCATTGTG GCTTACGAGA TCATCACGGG GAGTCTTCCG
TTCCAAGGAA CAGAAGCCGA AATCGCGCAC GGCCACTTGA ACGTTATTCC GCCTTCCACT
GGTGCCCCGG CTCGGCTTGA CTGGGTGATC CTGGACAGTC TGAGTAAACC GCCATCACTT
AGGCCCACTG CTGAACAATT CAAGGTAAAG CTCAGCCAGA GGGCTGCCGT TTTCAATTCA
AAGGCCGCTA TGGCAATGGG GCAAGCTAAC CACGAGTTGA GGAGCTTGGA GGAACAGGCG
GAGCAACGGT TGCGCCAGGC GGTAGCCGAA GCGGAACTAC GACAGCATCA CGTAGACCGC
GCCGGGAAGT TGCTTTCCCG CATTGGGGAG GAAGTTCTGA CTACCCTCCA AGGCTTTGCT
GATCGTGTGC AGTCCCAGCC ACAAAAAGAT GGGGGCGGAA AGTTGACCTT CCATAAAGCC
TCCTTGATGA TCTCGCCCAT CATTCCGAAT ACCAACGGCC ATCTCATGGC GCAGGAAGAG
GACCCCTTCG TTGTGCTGGC TACTGCCCAC ATCACTCTGT GGCAGTTTTC CGGGGTTAAT
GGCTATCCCG GACGTTCGCA TGCGCTGTGG TATGCGGACG CAAAGGAAGA GGGCAATTTC
CAGTGGTATG AGACCGCCTT CATACAAAAT GGCGGGATGC AGCCCACTCA GCGCTTTCGG
CCGTTTGCCG CAGAGTTCGA GTCAAGGGAA GCCACGGCCG CACTCAGGGG AGAGGGCAAT
TTCCTCGTTG CTTGGCCGTT TGCACCCCTT GATGCTGACG ACCTCGACGA GTTTATTGAA
CGTTGGGGAG TTTGGTTCGC ACAAGCTTCT AAGGGGGAAC TGCAGGCAGA GCAAATCCAC
AACATAGGTG ACATTCAGGA TTCTTGGCGT AAGGCTTAG
 
Protein sequence
MRTGSTVVLD NGSWTLSDEL GHGGFGEVYR GHQGAVEAAI KFIPKSKGVP REILLDIPKN 
ARNVIPITGT GEDADNWIIS MPVADHSLEK MLNAHGGKLP EDLAVMVLTH IAEALASLDG
SIVHRDIKPG NILLFNAKWC LTDFGIARYA AAATGSLTHK GYGTAAYVAP ELWLGQSATS
QSDIYALGIV AYEIITGSLP FQGTEAEIAH GHLNVIPPST GAPARLDWVI LDSLSKPPSL
RPTAEQFKVK LSQRAAVFNS KAAMAMGQAN HELRSLEEQA EQRLRQAVAE AELRQHHVDR
AGKLLSRIGE EVLTTLQGFA DRVQSQPQKD GGGKLTFHKA SLMISPIIPN TNGHLMAQEE
DPFVVLATAH ITLWQFSGVN GYPGRSHALW YADAKEEGNF QWYETAFIQN GGMQPTQRFR
PFAAEFESRE ATAALRGEGN FLVAWPFAPL DADDLDEFIE RWGVWFAQAS KGELQAEQIH
NIGDIQDSWR KA