Gene Arth_2429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2429 
Symbol 
ID4445031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2723614 
End bp2724870 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content69% 
IMG OID639690242 
ProductROK family protein 
Protein accessionYP_831908 
Protein GI116670975 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGGG AAGATAATGC CAACGGCCGT GCAGGCGGTT CCTTCAAAGA ACCGGGTCCG 
GCACCCGGCC GGGTGGAGGA CGTCCGGCGC GGGAACCTCG TGCGGGTCCT CGCGGCAATT
GCGCAGGCAC GGACGGATCC ACAGCGCTAC CCTACGCGCG CCGAGCTGGC CTCCCTGACG
GGCTTGACCA AGGCCTCAGT GTCCAGCCTG GTCGCGGAGC TCGCGGACTC CGGCCTGGTC
ATGGAGTCCG GCGCCACGCG CGACGGTGAA CGCGGCCGGC CCGGGGTAGG CCTTCAGCTG
AGCACCCGCC GCGGCGTGGT GGGCATGGAA ATCAACGTGG ACTACATCTC GGCAGGCCTT
CTGGACCTCG GCGGTGCGCT GCGTGCTTCC AGGACGCTGG AGTGCGGAAA CCGCGGCCAG
TCGCCTGAAT CCGTTATGGC CCTGTTGTCC GGGCTCGTGA ACGGCGTCGT TGCCGAGGCG
GCAGCCGCCG GGATCGAAAT CCTGGGCGGC GGACTGGCGG TGCCCGGCCT CGTGGATACG
GCCTCCGGAA CTGTTTCCAG CGCCCCCAAC CTGCAGTGGC ACAGTGTTGC CCTTGAACTG
GGCGGGCTGC TGCCGGGCGC ACCGCTGGGC ACTGTTCTGT ATAACGAGGC TAACTGCGCC
GCCCTGGCCG AGCTCTGGTA CGGGCACGGA CTGGATTTCC GCGACTACCT GTTTGTTTCC
GGTGAGGTGG GTGTCGGTGG CGGCCTGGTC ATCGGCTCCC GGCTCTTCGC CGGACCCCAC
GGACAGGCGG GGGAGGTAGG CCACGTTGTG GTTGACCCCT CGGGTCCGGA CTGCTCGTGC
GGCGGCCGCG GCTGCCTGGA AACGTTCGCC GGCCAGGAGG CCATCTTTGC CGAGGCCGGC
ATTCCGGCAG GCACTGCCTC CGTGCGGCTG GGGCAACTCG TGGAACAACT TGACGCCGGC
AATGCAGCCG CCACATCTGC CGTGGCCCGC GCGGGCCGCT ACCTTGGCAT CGCCGCAGCA
TCCACGGCAC GGCTGATGAA CCTCTCCGCC GTTGTCCTCG GCGGCCACTT CACCCGGATG
GGGCCGTGGC TTGCACCGGC CGTGATAGAA AGCCTCGCCA ACCATGCGCC CGGCGTCGTC
AGTCCCGCCA GGGTGGCGGT TTCGGAGCTT GGCCAGTCGG CTGCCCTCCT GGGCGCGGCA
GGGAGCGCCC TGCGTTCCGT CCTTGCCGCT CCCTCCGCGC TGACGCCCGC CGGTTAG
 
Protein sequence
MSREDNANGR AGGSFKEPGP APGRVEDVRR GNLVRVLAAI AQARTDPQRY PTRAELASLT 
GLTKASVSSL VAELADSGLV MESGATRDGE RGRPGVGLQL STRRGVVGME INVDYISAGL
LDLGGALRAS RTLECGNRGQ SPESVMALLS GLVNGVVAEA AAAGIEILGG GLAVPGLVDT
ASGTVSSAPN LQWHSVALEL GGLLPGAPLG TVLYNEANCA ALAELWYGHG LDFRDYLFVS
GEVGVGGGLV IGSRLFAGPH GQAGEVGHVV VDPSGPDCSC GGRGCLETFA GQEAIFAEAG
IPAGTASVRL GQLVEQLDAG NAAATSAVAR AGRYLGIAAA STARLMNLSA VVLGGHFTRM
GPWLAPAVIE SLANHAPGVV SPARVAVSEL GQSAALLGAA GSALRSVLAA PSALTPAG