Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1705 |
Symbol | |
ID | 4445778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1905285 |
End bp | 1906619 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639689527 |
Product | extracellular solute-binding protein |
Protein accession | YP_831199 |
Protein GI | 116670266 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.751939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAATG CCCGGATTTT TCGCCCTCTA GCCCTGCTTC TTGGTTCGAC CCTGGCCCTT TCGGCCACCG CCTGCGGTGG CCCGGGCGAG TCCAGCACCG AAGCCAAGAC CACCGACATC ACCAGCTCCG TCGCGGGCCA GGAACTGACG TACTGGTCGA TGTGGAAGGA GGGGGAACCC CAGCAGAAAA TCATCGCGGC GGCAATCGCG GACTTCGAAA AGGAATCCGG CGCCTCCGTC AAGGTGCAAT GGCAGGGACG CAGCAGCACG GAAAAGCTCG TGCCCGCGCT CAACACGAAC AACGTCCCCG ACATCGTGGA CGGCGCTTTC GCCAAATTGG CCCCCGTCAT CGGTGACACG GACCAGGGTC TGGGGCTCGG CGCCACCTAC GAAGCAAGCG TTGACGGGAA GAAGGTCTCG GACCTGATCC CGGCGAAGTA CCTTGCGAAC GCCGCTATTG ACGGTAAGGA CGGCCAGCCC TGGATGCTGC CGTACAGCTT CAGTTCGGAC GGATTGTGGT TCAACGAAGC CAGCCACCCG GAGCTTGCCT CGGCACCGCC CAAAACATGG GATGAGTTCC TCGCTACTCT CGATGTCCTG AAGAAATCCG GGGAAGTTCC GCTGGCCGCC GACGGTGACA TCGCGGGATA CAACTCCGCC TGGTTCATTA CCCTGATGCA GCGATACGGC GGCCCGGGTG CCTTCAAGGA GCTCGCATCA GACAAAACAG GCAGCGCCTG GGATGACCCG CAGGTCCTGG AAGCAGCCAA AAAAGTCGAA TCGCTGGTCA AAGGCGGTTA CCTCATCAAC GGCTACGATT CCAGTAAGTG GCCGGCGCAG CAGCAACTCT GGGCCACCGG AAAAGCAGCC CTGCTGCTGA ATGGGTCGTG GCTGCCCACC GAAACCGCCC CGTACGCGAC CCCGGGCTTC AAATACTCCT CCTTCCAGTT CCCGGCAGTC GGCGACAAGC CCGCCAGTGT ACGCGCAGAC TTCGTCGGAT TCGCGATTCC CAAGAAGGCG AAGAACGCCG CCGCGGCCCA GCAGCTGGCA GTCTTCATGC TGAAAAAGAA ATATCAGGAC GCCTACGGAA CACAGGCGAA GGTACTTCCC ATCCGGACAG ACGCAGCCAC GTCCCCGGAG ATGGCATCGA TCAAGAGCGC CCTTGACTCT GCGCCGCAGA TCCACCAAGC GTTCGACGCT GTCGTCTTCC CCGGGTACCT GGACAAAGTC TTCAACCCCA AAAATGACCA ACTCTTCCTC GGAAAAATTT CAGCTGAGAC ATTCCTGAAA GAGATGAAAC AGGCGCAGAG CCAGTATTGG AAGGACAACG GCTAA
|
Protein sequence | MGNARIFRPL ALLLGSTLAL SATACGGPGE SSTEAKTTDI TSSVAGQELT YWSMWKEGEP QQKIIAAAIA DFEKESGASV KVQWQGRSST EKLVPALNTN NVPDIVDGAF AKLAPVIGDT DQGLGLGATY EASVDGKKVS DLIPAKYLAN AAIDGKDGQP WMLPYSFSSD GLWFNEASHP ELASAPPKTW DEFLATLDVL KKSGEVPLAA DGDIAGYNSA WFITLMQRYG GPGAFKELAS DKTGSAWDDP QVLEAAKKVE SLVKGGYLIN GYDSSKWPAQ QQLWATGKAA LLLNGSWLPT ETAPYATPGF KYSSFQFPAV GDKPASVRAD FVGFAIPKKA KNAAAAQQLA VFMLKKKYQD AYGTQAKVLP IRTDAATSPE MASIKSALDS APQIHQAFDA VVFPGYLDKV FNPKNDQLFL GKISAETFLK EMKQAQSQYW KDNG
|
| |