Gene Arth_2379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2379 
Symbol 
ID4444993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2666572 
End bp2667825 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content67% 
IMG OID639690187 
ProductROK family protein 
Protein accessionYP_831858 
Protein GI116670925 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.257768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGAGA TCCGCACGCC AACGCCGAGG CGCGGAACCA ATTTGCCCCG TATGGGGGAC 
TTCAATCTCA CGGTGATTTT GGATGCCATC CGCCGGTCCT CGGGAGGCCT CAGCCGGGTG
GAACTCGCCC AGATCGTAGG GCTCTCCCCG CAAACCATCT CCAACATCTC CCGCCGGCTG
CTTGACCAGA ACCTCATCGT CGAAGCCGGT AAGGAAGGAA GCGGGCCGGG TAAGCCGCGC
ACCATCCTCC GGTTGAACCC TGCCGGAATG TATGCCGTGG GGGTCCACCT CGATCCCGCC
GTGACCACTT TCGTGGTGCT GGATCTCGTC GGCTCCGTCG TCAGGCACTC ACGGATCAAC
ACCCCGGGCG CCAGCGACCC CGACGGCATC ATCGCCACCA TTGCCGCCGA GATCAAGGAC
CTGGTGGCAG CCTCCGGCGT CGACCCGGAC AAGATCGCCG GGCTCGGCGT CGCAGCCCCC
GGCCCCATCA ACCTGGATGA GGGAACCGTC GTGGATCCGC CGCTGTTGCT GGGCTGGGAC
CGTGTGCCCC TGCGCGATGC CCTCGCGGAG GCCACCGGAC TGTCCACCCT GGTGGACAAG
GACGTCACCA GCGCCGCCGT CGCCGAAACC TGGGCAGGCG GCCCGAGCGG CTCCGGCAGC
TTCATTTTTA TGTACATGGG CACAGGCATC GGCTGCGGCA TCGTCCTGAA CGATGAAGTG
GTGCGCGGGA CTTCGGGGAA TGCCGGTGAA ATCGGGCACA TCATTGTTGA CCCGGACGGT
CCGCCCTGCG ACTGCGGACT TCGGGGCTGC GTGAAGTCAA GCAGCATCCC GCAGGTCCTC
GTGGCGCAGG CCGAAGCCGC CGGCGTGCTG GAGGTTGTGC GCCATCCCTC CGGTGCGCTG
GACATCCAGG AGAGCTTCGC CAAACTTTGC GACGAGGCCG ACGCCGGCAA CAGCCAGGCG
GGGGAGATCA TCGACCACTC CGCCGTCCTG GTCGCCCGGG CCGTGGCAGT GGTCACGAAC
ACCCTCGACG TCGAAAGGGT CGTCTTCGGC GGCCCCTTCT GGACGCGCCT CTCACGGAGG
TACCTGGACC GTGTTCCCCA ACTGCTGGCA GACAACAGCG CAGCCCGCGA GATCCACGGG
ATAGAAGTTG TCGGGACCGG GGTGGGAGAG GACGTCGGAG CCATCGGTGC GGCCTGCCTG
GTGCTGGAGC ATACGCTGGC CCCGCGCGCA CAGCGCTTGC TGCTGGAAGG CTGA
 
Protein sequence
MTEIRTPTPR RGTNLPRMGD FNLTVILDAI RRSSGGLSRV ELAQIVGLSP QTISNISRRL 
LDQNLIVEAG KEGSGPGKPR TILRLNPAGM YAVGVHLDPA VTTFVVLDLV GSVVRHSRIN
TPGASDPDGI IATIAAEIKD LVAASGVDPD KIAGLGVAAP GPINLDEGTV VDPPLLLGWD
RVPLRDALAE ATGLSTLVDK DVTSAAVAET WAGGPSGSGS FIFMYMGTGI GCGIVLNDEV
VRGTSGNAGE IGHIIVDPDG PPCDCGLRGC VKSSSIPQVL VAQAEAAGVL EVVRHPSGAL
DIQESFAKLC DEADAGNSQA GEIIDHSAVL VARAVAVVTN TLDVERVVFG GPFWTRLSRR
YLDRVPQLLA DNSAAREIHG IEVVGTGVGE DVGAIGAACL VLEHTLAPRA QRLLLEG