Gene Arth_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0020 
Symbol 
ID4447523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp23164 
End bp25062 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content66% 
IMG OID639687813 
Productserine/threonine protein kinase 
Protein accessionYP_829521 
Protein GI116668588 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACACCC AGCGCGTCCT CAACGGACGG TACGAACTCG GTGAGCTGAT CGGCCGCGGC 
GGTATGGCGG ACGTCCACCG GGGCCTGGAC ACCCGACTGG GCCGGACAGT GGCCATCAAG
CTGCTGCGAC CGGACCTTGC CCGGGATCCA CAGTTCCAGG CGCGGTTCAA GCGCGAAGCC
CAGGCCGTGG CCGCGTTGAA CCATCCTTCG ATCGTTGCCA TCTACGACAC GGGGGACCAC
GCAGTGCCGG GCGGTCCCGA GGACACTGTC CGTGTGCCGT ACATCGTGAT GGAATTCGTG
TCCGGAAAGA CCCTGAGGGA TCTCATCCGT GCGAAGGAAG TCAGCATCGA CCACGCCATC
GACTTCACGC TCGGCGTGCT CTCCGCCCTC GAGTACAGCC ACCGGGCGGG AATCGTACAC
CGGGATATCA AGCCCGCCAA CGTGATGTTC TGCGAAGACT CGGACACCAT CAAAGTCATG
GATTTCGGGA TTGCCCGGGC CATGGCCGAT TCGTCCGCCA CCATGACCCA GACCCAGGCG
GTCGTGGGCA CGGCGCAGTA TCTCTCTCCG GAACAGGCCC GCGGTGAAAC TGTGGACGCC
CGGAGTGATC TCTACTCCGC GGCGTGCCTG CTGTACGAAA TGCTGACGGG AAGGCCCCCG
TTCATCGGTG ACAGTCCCGT ATCAGTCGCC TACCAGCACG TCCGCGAGAT TCCGGAACCG
GCCAGCAGCC TCAACCCCGA GGTGTCAGAG GCCCTGGACA GCGTCCTTTC GAAGGCCCTG
CAGAAGAACC GTGCTGACCG TTTCCAGGAT GCGGCCGCAT TCCAGCGGGC ACTCCGGGCG
GCCCGCAACG GCATCCCCGT GCCTGATGTG GCGGCGGGCG AGGCCCCGAC AGATCCCAAC
AACACGGTCC CGGCCGGGGA ACGGACGGCC CTCGCCGCGC CTTACTCGCT GACGGGGGCA
AGCTTCCTCG ATGATTCACC GAGCGGCCGG CTGCGGCCCG TCCATGACAC CCTTGGCGAC
GACCAGGCGA TTCCGGCGCA GGTTTATGAG CCCTCGGAAT CCAGTGATCT TCCCCTCGGG
TTTCCGCCGG AACGTGAGCG CACCCCGCGG CAGAAATCCC GCCGTCGAAC CTGGATTGCC
ACGTTGGTGA TCTTCACCCT GCTGGTGCTG GCCGGCGGCG GCCTCTGGCT CTACAACATG
ATGAACCAGG CGCCCCCTCC GGTGGCGAAG GTAGAGGTGC CGGCCGTATC GTCGCTGACG
GAGTCCGAGG CGCTTCAGCG GTTGTACAAC GCCAGGCTGA GCCCGCAGAT CACCAGGTTG
CCGCACGACA CCATCACCAA GGGCACGGCC ATCGGCACGG TGCCGGCCGC CGGCACCGCC
ATGGAACCGG ACTCGAAGGT AACCCTGAAC ATCTCCGACG GCCCGAGCGC CGTCAAAATC
CCGGATGATC TGCCGGGGCG GACCGAAGCG GCGGCCCGGG ACGTTCTTCG CCAGATCGGC
CTCGCCGGCG CCCCCGGAAC CACCATGGCC AACAGCGCCA CCGTTCCCAC CGGAATCGTG
ATCACCACCA AGCCGGCGCC GGGTCAGACC GTCGCAGTCG GAAGCACCGT GGAAATCGTG
GTGTCCACCG GCAAGGTGGC CATGCCCGAA CTCCGCGGGC TGCCCAGGGC GGAGGCGGAG
ACGGCACTCA AGAACCTGGG CCTTGGCATC GATGTGAAGG AAGTCGAAAA CTCCGAAGTT
GAACCGGGGA AGGTCACCGA GCAGAGCGAC GCCGTCAACT CGCTGGTGGA GCAGGGCAAA
ACCATCTCCA TTATCGTCGC CAAGGCGCCG GCGCCCAGCC CCAAGCCGAC TCCAACGCCG
ACGCCCACGC CTACCGAGAC GAGCCGGGAC CGGGGATAG
 
Protein sequence
MNTQRVLNGR YELGELIGRG GMADVHRGLD TRLGRTVAIK LLRPDLARDP QFQARFKREA 
QAVAALNHPS IVAIYDTGDH AVPGGPEDTV RVPYIVMEFV SGKTLRDLIR AKEVSIDHAI
DFTLGVLSAL EYSHRAGIVH RDIKPANVMF CEDSDTIKVM DFGIARAMAD SSATMTQTQA
VVGTAQYLSP EQARGETVDA RSDLYSAACL LYEMLTGRPP FIGDSPVSVA YQHVREIPEP
ASSLNPEVSE ALDSVLSKAL QKNRADRFQD AAAFQRALRA ARNGIPVPDV AAGEAPTDPN
NTVPAGERTA LAAPYSLTGA SFLDDSPSGR LRPVHDTLGD DQAIPAQVYE PSESSDLPLG
FPPERERTPR QKSRRRTWIA TLVIFTLLVL AGGGLWLYNM MNQAPPPVAK VEVPAVSSLT
ESEALQRLYN ARLSPQITRL PHDTITKGTA IGTVPAAGTA MEPDSKVTLN ISDGPSAVKI
PDDLPGRTEA AARDVLRQIG LAGAPGTTMA NSATVPTGIV ITTKPAPGQT VAVGSTVEIV
VSTGKVAMPE LRGLPRAEAE TALKNLGLGI DVKEVENSEV EPGKVTEQSD AVNSLVEQGK
TISIIVAKAP APSPKPTPTP TPTPTETSRD RG