Gene Arth_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0830 
Symbol 
ID4446668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp898954 
End bp899898 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content64% 
IMG OID639688637 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_830328 
Protein GI116669395 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.84032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCGT TGAGCGCGTG TTCAAGCACC GGAGGAAAGC CCGCCGAGAC GGGTGGCGGC 
GCAGGTGGTG GCCAGGCCGC AAGCACGGAC CGCATCAAGG TGGCTCTCAT TACCCACGCG
GCCGCGGGTG ATACCTTCTG GGACATCGTC CGCAAGGGTG CCGAGGAAGC GTCGGCGAAG
GACAACGTTG AACTCCTCTA CACGTCCGAC CCCGAAGCCG GGCGTCAGGC TCAGCTCATC
CAGCAGGCAA TAGATCAGAA GGTCGACGGC ATCGCGGTCA CGCTCGCCAA GCCCGAAGCC
CTCAAAGATG TCCTGAAGAA GGCCGCCGAC GCGGGCATCC CGATTGTGAG CCTCAATGCC
GGCGAGGGTG TCTCGGCGCA GCTGGGAGCG TTTACGCACT TCGGCTCCAA CGAGCAGCTC
GCCGGTCAGG CCGTGGGCAC CAAGCTCGCC GCGGACGGAT TCAAGCATCC GATCTGCGTG
ATACAGGAAC AGGGGCACGT CGGACTTGAA GCACGGTGCG CTGGTGTCAA GGCGAAGGTG
CCCGGAACGG AGATCCTTTA CGTTGACGGC AAGGACATGA CCTCCGTCCA GTCCACCGCG
ACCGCCAAGC TCCAGGCTTC CAAGGAGGCC GACGTCATCA TCGGCCTCGG GGCTCCCATC
ACGCTGACGC TCCTCAAATC GGTCACTGAC GCGGGCAGTT CGGCCAAGGT GGCAAGTTTC
GACCTGAACG CGGATCTCTC CCGGAAGATT GTGGATGGTG CAGTTCTGTT CACCGTGGAC
CAGCAGCCGT GGCTGCAGGG ATACGGCGCA GTCGACGCGC TGTGGCAGAA CAAGCGCGGC
GGCTTCAGTA TCGGTGGCGG CCAGCCCGTC CTGACCGGCC CCGCGATCGT CGACAAGTCA
AATGCGGCCG ATGTCCTGAA GTTCGCCGAG CAGGGTATCC GCTAA
 
Protein sequence
MMALSACSST GGKPAETGGG AGGGQAASTD RIKVALITHA AAGDTFWDIV RKGAEEASAK 
DNVELLYTSD PEAGRQAQLI QQAIDQKVDG IAVTLAKPEA LKDVLKKAAD AGIPIVSLNA
GEGVSAQLGA FTHFGSNEQL AGQAVGTKLA ADGFKHPICV IQEQGHVGLE ARCAGVKAKV
PGTEILYVDG KDMTSVQSTA TAKLQASKEA DVIIGLGAPI TLTLLKSVTD AGSSAKVASF
DLNADLSRKI VDGAVLFTVD QQPWLQGYGA VDALWQNKRG GFSIGGGQPV LTGPAIVDKS
NAADVLKFAE QGIR