Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oant_1906 |
Symbol | |
ID | 5379971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ochrobactrum anthropi ATCC 49188 |
Kingdom | Bacteria |
Replicon accession | NC_009667 |
Strand | - |
Start bp | 2008054 |
End bp | 2009685 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640834569 |
Product | extracellular solute-binding protein |
Protein accession | YP_001370451 |
Protein GI | 153009236 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0377443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCATT TCGGACGTTT TCGTATTCTG GCTGCGGGTG CAGCACTTGC CGCCGTACTG GCCGCAAATC CAGCCTGGTC GGCCACGCCA GCCGACACGC TGGTTCAGGC ATGGGCCATC GATGACACGA TCACGCTGGA CCCGGCAGAA TCCTTCGAAC TGAGCCCGGC CGAATTCATC GGCAACGCTT ATGACATGCT CGTTCGCCTC GATATCAACG ACACGTCGAA AGTCATACCG GGCGTCGCTG AAAGCTGGAC CGTTTCCGAC GATGGTCTCA CCTATACGTT CAAGCTGAAG AAGGACATCA AGTTCGCTTC CGGCAACCCG ATTACCTCCG CTGACGTGGC TTATTCCTTC GAACGCGCCG TGCGTCTCGA CAAGAGCCCG GCCTTCATCC TGACCCAGTT CGGCCTGACC AAGGATAACG TCAACGAAAA GGCGAAAGCT GTTGACGATA CTACCTTCGT CTTCACCGTC GACAAGCCGT ATGCACCAAG CTTCGTGCTG AACTGCCTGA CCGCGACCGT CGCTGCTGTC GTCGACAAGA AGCTCCTCGA AGAACACGCC GAGAAGGTTA CGCCGACCGA CGACTACAAA TACGACACCG ACTATGCCAA CGCATGGCTG AAGACCAAGT CGGCAGGTTC GGGCCCGTTC GCGATCCGCG AATGGCGCGC CAATGAAGTC GTCATTCTGG AACGCAACGA CAATTATTAC GGCGAAAAGG CCAAGCTCAA GCGCGTCTTC TATCGTCACG TCAAGGAAAG CGCGACGCAG CGCCTGATGC TTGAATCCGG CGACGTCGAT GTGGCACGCA ATCTGGAGCC GGGCGACTAC GAAGCCGTTC TGAAGAACGA CAAGCTGACG ACCGCCAGCG CTGCCAAGGG CACGGTTTAT TACATCAGCC TCAACCAGAA GAACGCAAAT CTGGCCAAGC CGGAAGTTCG CGAAGCGTTC AAATATCTGG TCGATTATGA TGCAATCGGC TCGACCCTCA TCAAGGGCAT TGGTGAAATC CGCCAGACCT ATCAGGCAAA GGGCGTTCTT GGTTCTCTCG ATGCTGCGCC TTACAAGCTC GACGTTGCCA AGGCGAAGGA ACTTCTCGCC AAGGCGGGTC TGAAGGACGG TCTCTCGGTC ACGTTCGACG TCCGTAACGG CCAGCCGGTC ACCGGCATTG CCGAATCCTT CCAGCAGACC GCTGCTCAGG CTGGCGTGAA GATCGAGATC ATCCCCGGCG ACGGCAAGCA GACGCTGACC AAGTATCGTG CCCGTAACCA CGACATGTAT ATCGGCCAGT GGGGCATGGA CTATTGGGAT CCGAATTCCA ACGCCGAAGC TTTCACCAGC AACCCGGATA ATAGCGACGA CGCATCGACC AAGACGCTTG CATGGCGCAA CGCCTGGGAC ATCCCGGAAC TAACCAAGCA GACGCAGGCA GCTCTTCTGG AACGCGACAA CGACAAGCGT GCTGAACAGT ACAAGAAGCT TCAACAGGAA GCTCTCGACC AGAGCCCGTT CGTCATGCTG TTCCAGCAGG TTGAAGTTGC CGGCATTGGC GGCAACGTGA AGGGCTACAA GCTCGGCCCG ACATTCGACT CCAACTTCCT GGCGAACGTC AGCAAGGAAT AG
|
Protein sequence | MMHFGRFRIL AAGAALAAVL AANPAWSATP ADTLVQAWAI DDTITLDPAE SFELSPAEFI GNAYDMLVRL DINDTSKVIP GVAESWTVSD DGLTYTFKLK KDIKFASGNP ITSADVAYSF ERAVRLDKSP AFILTQFGLT KDNVNEKAKA VDDTTFVFTV DKPYAPSFVL NCLTATVAAV VDKKLLEEHA EKVTPTDDYK YDTDYANAWL KTKSAGSGPF AIREWRANEV VILERNDNYY GEKAKLKRVF YRHVKESATQ RLMLESGDVD VARNLEPGDY EAVLKNDKLT TASAAKGTVY YISLNQKNAN LAKPEVREAF KYLVDYDAIG STLIKGIGEI RQTYQAKGVL GSLDAAPYKL DVAKAKELLA KAGLKDGLSV TFDVRNGQPV TGIAESFQQT AAQAGVKIEI IPGDGKQTLT KYRARNHDMY IGQWGMDYWD PNSNAEAFTS NPDNSDDAST KTLAWRNAWD IPELTKQTQA ALLERDNDKR AEQYKKLQQE ALDQSPFVML FQQVEVAGIG GNVKGYKLGP TFDSNFLANV SKE
|
| |