Gene Arth_1908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1908 
Symbol 
ID4445562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2148214 
End bp2149389 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content63% 
IMG OID639689718 
ProductROK family protein 
Protein accessionYP_831390 
Protein GI116670457 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.477922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTTC CCCCTCGCGT TTCGGCGGGA GAGGTCTTCC AGCATTTTCG TGGCACGTCA 
CCGTTGACCC GGGCCCGGCT TTCAGCACTG ACAGGCCTTT CGCGGGCCGC AACGACAGAC
AGGATGAGAA CCTTGGCGTC GCGGGGACTC ATTGGTCCCG CCAACGAGGC GCCATCGACG
GGTGGCCGGC CATCCGCTCA GTTTCGCCTC ATGCCAAACA GCGGAGCGGT TATGAGCGTC
CTGCTGGGTG CCAGCGAGGC AAAGGTGCAA GCATTCGACC TTTCCGGACA GCCGTTATCT
GCTGCGAAGC CGGTGGAAAC CGTCCGTGGC GGGTATGCCG CTATCTTGGA TTCCTGCCTG
GCTTCAGCGT ATTCGCTCCT GGATTCCATG GACGACATTT CCGGCCAGCT TGTAGCAACG
GGGGTTGTTC TCGATGAAGG TGCGCCGGAT CTGGACTGGC CTGAATATTT TGCTGGCCGC
CCGGTCGTCG TCGACTCCGC GCTGGGGGCC ATGGCAACCG CGGAGGCCCT GTCGCGAACC
CCCCGGCCGC AAAACATGCT CTTCCTGGAC GTGGGAAAGA CCATTGGCTG TGCCGTGCTC
GTCCACGGAA GGACGATGGG GGGTTTCAGG ACCTCCAAAG AGGCGTTCGG GCACACGCCG
GGGAAGGGCA CGCCGACGCT GCCCTGTGCT TGCGGCATCA TGAATTGCCT GCAGGCCATC
GCAGGGGAAG AGGCAATAAT TGCCGGTCTG TCGTCGGACC TTGCAGACGA ACCAGACGCG
ATCAGCGGGG CTGTCCGGCG GAGCGATGCG GCTGCAGTCA GTGCTCTGCG ACAGGCAGGT
CGGGACATAG GTGACACCCT GTTAGGGAGC ATTCACCTCC TTCAACCCGA GTTCATCACA
GTCAGAACCC GGTGGCCCGG TGCCGCCGAC TTTTTGCTGG CCGGCCTTAG GGAGGCAATA
TACGCAAGCG GCGTCCCTGC TGTGACGGAG AATCTGGTGT TGGCGAGTTC AACGACCGGG
TCCCCCGCTA CCGGGATTGC TCTTCGAGCC CTGGACGCCG GACTGGCGGT GGAATCAGTG
GACCGCTTGC TGTCAGCGCC ACCCAACCTC AGCGGACAGC GGAACTATTG GCCGGCACCG
TTGAAGTCGA TCGACCGTCA ACGGCACGCC AGCTGA
 
Protein sequence
MTVPPRVSAG EVFQHFRGTS PLTRARLSAL TGLSRAATTD RMRTLASRGL IGPANEAPST 
GGRPSAQFRL MPNSGAVMSV LLGASEAKVQ AFDLSGQPLS AAKPVETVRG GYAAILDSCL
ASAYSLLDSM DDISGQLVAT GVVLDEGAPD LDWPEYFAGR PVVVDSALGA MATAEALSRT
PRPQNMLFLD VGKTIGCAVL VHGRTMGGFR TSKEAFGHTP GKGTPTLPCA CGIMNCLQAI
AGEEAIIAGL SSDLADEPDA ISGAVRRSDA AAVSALRQAG RDIGDTLLGS IHLLQPEFIT
VRTRWPGAAD FLLAGLREAI YASGVPAVTE NLVLASSTTG SPATGIALRA LDAGLAVESV
DRLLSAPPNL SGQRNYWPAP LKSIDRQRHA S