Gene Arth_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1824 
Symbol 
ID4445653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2042898 
End bp2043854 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID639689642 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_831314 
Protein GI116670381 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00471967 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACAA CCCGCAAAAA CATGATTGCC AACGTCAGTG TTATCGCGGC CGTATGCGTG 
TTCGGATCGA TCGGACTCAC AGGATGCGCA ACGGCATCCG GCGCCGCCGG CGACCGCCCG
ATGAAGATCG GAGTGACGGT CGCCAACAGC ACCAATCCCT TCTTCCAACA GGAATCCAAG
ACCGCCGAAA GCTACGGCAA GTCGGTCGGC GCCGAGGTCC TCTCTCAGGT GGCCAATGAA
GACGTGCAGA CCCAGTCGAA CCAGATCGAC CAGTTCATCA CTGCCGGGGT CAAATTCATC
GTGATCGACG CCGCCGACAC CGACGGCGTC GGGCCGGCCG TCAAACGCGC CGTCAGTGCG
GGCATCCCCG TCATCGGCGT TGACAACCAA TCCAAGAATG CCACCGTCAA CATCACCACC
GACAACAAAC AGGCAGGCGA GATCTCGTGC CGTTCCCTGG CCGACAAGCT GGGCGGCAAA
GGCAAAATAG CCATCCTGAA CGGTACGCCG GTGTCCGCCG TTGACGATCG CGTCACCGGC
TGCAAAGGCA TTCTCGGTCA GTACCCCGAC ATCAAGATCG TGGCGGACCA GCGGGGTGAA
AACAGCCGTG ACTCGGCATT GCCCATCGCC ACAGATATCC TGACCGCAAA CCCCGATCTT
GACGGCTTCT TCGCTATCAA CGACCCGAGC GCCGTTGGTG TGCAGCTAGC GGCCGAACAG
AAGGGCGCAT CGGTCGTCAT CACGTCGGTC GACGGTGCCA GCTCGGCCAC AGACGCGATC
GCTGCCGGGG GTCTGATCAC CGCAACTGCT GCGCAGGACC CTGCAGCGCT CATGCGCCAG
GCCATTGATC TTGGGATCTC GATCGTGAAC GGCAAGGAGC CTGATCAGAA AGTGATCCTC
GTTCCGACGG AACTCGTCGA CGCCTCGAAT GTCGCCAAGT ACAAGCCGTG GGGCTGA
 
Protein sequence
MKTTRKNMIA NVSVIAAVCV FGSIGLTGCA TASGAAGDRP MKIGVTVANS TNPFFQQESK 
TAESYGKSVG AEVLSQVANE DVQTQSNQID QFITAGVKFI VIDAADTDGV GPAVKRAVSA
GIPVIGVDNQ SKNATVNITT DNKQAGEISC RSLADKLGGK GKIAILNGTP VSAVDDRVTG
CKGILGQYPD IKIVADQRGE NSRDSALPIA TDILTANPDL DGFFAINDPS AVGVQLAAEQ
KGASVVITSV DGASSATDAI AAGGLITATA AQDPAALMRQ AIDLGISIVN GKEPDQKVIL
VPTELVDASN VAKYKPWG