Gene Arth_1715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1715 
Symbol 
ID4445754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1916057 
End bp1917334 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID639689537 
ProductROK family protein 
Protein accessionYP_831209 
Protein GI116670276 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.853629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCCT TATGGTTTCT GTTGCCCGGC TCGCGAGTCG GTCAGGCACT CACCCTCATG 
GGCCTGTCGG TGACGGACTA TGATTCAGAG ATGCCCGACG CCACCGAACC AGGAACCACG
GAACGATCGC TTGTCGAGCT CCTGCTGGCC GAAGGTCCGT CCACCCGCCT GGAACTTCAA
GCCAAGCTGG GGGTTTCGCG CCCGACCCTT TCGTCTGCTG TGAACAAACT GTTGAGCCTG
GGACTTCTCC AGGAACAGGG AACGGCAGCA TACGGTGCCG GAAGAAACGG CCGCCCCCAA
GCATTGCTCG CGCCGAACAG GGCTATGGGA GCGGCGGTAG GTATCGAACT GGGCAAAGCT
CAGGTTGCGG TCACTATCCT CGCGATCGAC GGGACGGTTC ACGCCCAAAA GGTCACCTCG
ACGTCACCGG GGACGACGCT GCAGCGACGG CTCAATATCG CGCTGGGCTC CGTCGGCACT
TTCATCAGCT CCAATATTCT CAACCCCGAG TCTGTTCTGG GTGTGGGCGT GGGGGTATCC
GGCCTTCATC CCGATGCCCG GCCGGCCGGT GGCTCCGCTC TCGTTGATCC GCCCGGCGCG
AAGCTTGACA AGCTGAGAAC ACTCCTTGCC GCCCCGGTCG TCTGGGACAA CAACACCCGG
ATGGCCACCT TTCGCCACCT TGGCGGTTCC GGGCTCGACT CCCCCGGTGC CGTTCTCTAT
GTTGTCCTTT CCGCTGGGGT CAGCGCGGGC ATTGTGGACG GCGGGGAGGT CCTTCGAGGC
CGAGGCGCCG CCGGTGAGCT GGGGCATGTC TGCCTCGACC CCGAAGGCCC CGTATGCGGG
TGCGGTTCAA GGGGGTGTCT CGAGGCCTAC GTGGGCGTGG AGGCCGTCCT CCGGTCGGCC
CGGGGCAAGG GTGCCACTGT CGCAGACCTC GAGGAACTGG CCGCCGTTGT CCAGTCAGGT
GATGCCGATG CGCTGGCCGT GATCGGGCTA GTTGGTCGGA TGCTTGGCAT CGGTCTTAAC
AATGCTGCGA TGTTGGTCGA CCCTCGCCGC ATCATTCTCA CGGGTCCTCT CCTTAGCCTG
GGTCCGGCGC TGGTCTCGGC GGCCACGGAG GAACTACGGA TCCGGCGAAT GGCGGTCACT
TTAGGGGTAC CCGACGTCGT GGCCGAGATC GGGTCGCCTT TCGACTCCAG CCACGGTGCC
GCGCTGACGG CGCTTAGGCG GTGGGGCCCC GGTTTCATGG GAATGCTGAC GCAAAATGGG
ATAGCCACGA CGGGCTAA
 
Protein sequence
MPSLWFLLPG SRVGQALTLM GLSVTDYDSE MPDATEPGTT ERSLVELLLA EGPSTRLELQ 
AKLGVSRPTL SSAVNKLLSL GLLQEQGTAA YGAGRNGRPQ ALLAPNRAMG AAVGIELGKA
QVAVTILAID GTVHAQKVTS TSPGTTLQRR LNIALGSVGT FISSNILNPE SVLGVGVGVS
GLHPDARPAG GSALVDPPGA KLDKLRTLLA APVVWDNNTR MATFRHLGGS GLDSPGAVLY
VVLSAGVSAG IVDGGEVLRG RGAAGELGHV CLDPEGPVCG CGSRGCLEAY VGVEAVLRSA
RGKGATVADL EELAAVVQSG DADALAVIGL VGRMLGIGLN NAAMLVDPRR IILTGPLLSL
GPALVSAATE ELRIRRMAVT LGVPDVVAEI GSPFDSSHGA ALTALRRWGP GFMGMLTQNG
IATTG