Gene Arth_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1705 
Symbol 
ID4445778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1905285 
End bp1906619 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content60% 
IMG OID639689527 
Productextracellular solute-binding protein 
Protein accessionYP_831199 
Protein GI116670266 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.751939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAATG CCCGGATTTT TCGCCCTCTA GCCCTGCTTC TTGGTTCGAC CCTGGCCCTT 
TCGGCCACCG CCTGCGGTGG CCCGGGCGAG TCCAGCACCG AAGCCAAGAC CACCGACATC
ACCAGCTCCG TCGCGGGCCA GGAACTGACG TACTGGTCGA TGTGGAAGGA GGGGGAACCC
CAGCAGAAAA TCATCGCGGC GGCAATCGCG GACTTCGAAA AGGAATCCGG CGCCTCCGTC
AAGGTGCAAT GGCAGGGACG CAGCAGCACG GAAAAGCTCG TGCCCGCGCT CAACACGAAC
AACGTCCCCG ACATCGTGGA CGGCGCTTTC GCCAAATTGG CCCCCGTCAT CGGTGACACG
GACCAGGGTC TGGGGCTCGG CGCCACCTAC GAAGCAAGCG TTGACGGGAA GAAGGTCTCG
GACCTGATCC CGGCGAAGTA CCTTGCGAAC GCCGCTATTG ACGGTAAGGA CGGCCAGCCC
TGGATGCTGC CGTACAGCTT CAGTTCGGAC GGATTGTGGT TCAACGAAGC CAGCCACCCG
GAGCTTGCCT CGGCACCGCC CAAAACATGG GATGAGTTCC TCGCTACTCT CGATGTCCTG
AAGAAATCCG GGGAAGTTCC GCTGGCCGCC GACGGTGACA TCGCGGGATA CAACTCCGCC
TGGTTCATTA CCCTGATGCA GCGATACGGC GGCCCGGGTG CCTTCAAGGA GCTCGCATCA
GACAAAACAG GCAGCGCCTG GGATGACCCG CAGGTCCTGG AAGCAGCCAA AAAAGTCGAA
TCGCTGGTCA AAGGCGGTTA CCTCATCAAC GGCTACGATT CCAGTAAGTG GCCGGCGCAG
CAGCAACTCT GGGCCACCGG AAAAGCAGCC CTGCTGCTGA ATGGGTCGTG GCTGCCCACC
GAAACCGCCC CGTACGCGAC CCCGGGCTTC AAATACTCCT CCTTCCAGTT CCCGGCAGTC
GGCGACAAGC CCGCCAGTGT ACGCGCAGAC TTCGTCGGAT TCGCGATTCC CAAGAAGGCG
AAGAACGCCG CCGCGGCCCA GCAGCTGGCA GTCTTCATGC TGAAAAAGAA ATATCAGGAC
GCCTACGGAA CACAGGCGAA GGTACTTCCC ATCCGGACAG ACGCAGCCAC GTCCCCGGAG
ATGGCATCGA TCAAGAGCGC CCTTGACTCT GCGCCGCAGA TCCACCAAGC GTTCGACGCT
GTCGTCTTCC CCGGGTACCT GGACAAAGTC TTCAACCCCA AAAATGACCA ACTCTTCCTC
GGAAAAATTT CAGCTGAGAC ATTCCTGAAA GAGATGAAAC AGGCGCAGAG CCAGTATTGG
AAGGACAACG GCTAA
 
Protein sequence
MGNARIFRPL ALLLGSTLAL SATACGGPGE SSTEAKTTDI TSSVAGQELT YWSMWKEGEP 
QQKIIAAAIA DFEKESGASV KVQWQGRSST EKLVPALNTN NVPDIVDGAF AKLAPVIGDT
DQGLGLGATY EASVDGKKVS DLIPAKYLAN AAIDGKDGQP WMLPYSFSSD GLWFNEASHP
ELASAPPKTW DEFLATLDVL KKSGEVPLAA DGDIAGYNSA WFITLMQRYG GPGAFKELAS
DKTGSAWDDP QVLEAAKKVE SLVKGGYLIN GYDSSKWPAQ QQLWATGKAA LLLNGSWLPT
ETAPYATPGF KYSSFQFPAV GDKPASVRAD FVGFAIPKKA KNAAAAQQLA VFMLKKKYQD
AYGTQAKVLP IRTDAATSPE MASIKSALDS APQIHQAFDA VVFPGYLDKV FNPKNDQLFL
GKISAETFLK EMKQAQSQYW KDNG