Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3466 |
Symbol | |
ID | 8327656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 4029343 |
End bp | 4031133 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644943966 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003101206 |
Protein GI | 256377546 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0853012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTGTTT GGCAGCGTAT CGTCGCCGCG TCGGCGGCGG CAGGTCTGTC CGTGTTGCCC TTCGGGGTAC CGGCCCAGGC CCAGAACGAG GTCGTCTTAC GGGTGGCCAT CACCCAGCAG GTCGACTCGC TCAACCCGTT CCTGGCCACC TTCCAGGCCA GCACCGAGGT CGGCAGGCTC ATGTACGACT TCCTCACCGC CTACGACCAG CGCGACCAGA CCCCCGTCCC CGCCCTCGCC GACCGGTGGT CCTCCAGCGA GGACCGGCTC ACCTGGACCT TCCACGTCCC CGAGGGCCGC AAGTGGTCCG ACGGCCAGGA CATCACCGCC GACGACGTCG CCTTCACCTA CGACCTGATG ATGCGCGACA CCACCGCCGC CACCGCCAAC GGCAGCTTCA CCGCCGACTT CGAGTCCGTC ACCGCCTCCG CCGACGGCCG CGAGGTCGTC ATCAGGACCA AGCAGCCGCA GGCCACCATG CTCGCCCTGG ACGTGCCCAT CGTCCCCGAG CACGTCTGGT CCGAGGTCAC CGACGTCGGC GACTACACCA ACGACCAGGG CCCGGTCGTG GGCAGCGGCC CGTTCGTGCT CACCGAGCAC AAGCCCAACG AGTTCATCCG CTTCGCCGCC AACAAGACCT ACTGGCGCGG CGCGCCCAAG TACGACAGCC TCGTCTTCGT CTACTACAAG ACCACCGACG CCGCCGCCCA GGCCCTGGCC AAGGGCGAGG TCGACCTGGT CAACCGCCTC GGACCCGCCC AGTTCGACTC GCTGGAAGGC GCCGAGGGCG TCACCCGCAA CAAGGCCAAC GGCCGCCGCT TCAACGAGCT GGTCCTCAAC TCCGGGGCCG CCACCAACAC CGGCGAGCCC ATCGGCGACG GCCACCCCGC GCTGCGCGAC CTCGTCGTGC GCCGCGCCAT CGCCCAGGCC ATCGACCGCG ACGCCATCAT CGCCCGCGTC AACAACGGCT ACGCCCAGCG CGGCACCGGC CCCATCCCGC CGGTCTTCCC CGCCTACCAC CTGCCGCAGC CCGACCCCGA CCCGCTGCCG CACGACCCCG CCGCCGCCAA CGCCGCCCTC GACGCCGCGG GCTACGCGCG CGGGGCCGAC GGCACCCGCG CCAAGGACGG CCGCCCGCTG CGGCTGCGCC TGCTCGGCCA CGCCAGCCGC GCCTACGACG AGCAGGCCGC CGAGTTCGTC AAGGGCGGCC TCGCGGCCAT CGGCGTCGCC GTCGACGTGC AGATCGTCTC CGACAACCAG CTCAACGAGT CCGCCACCGC GGGCACCTTC GACCTGGTCT TCTCCGGCTG GGGCACCAAC CCCGACCCCG ACTTCATCCT GTCGCTGCAC ACCTGCGCCC AGCGCCCCGG AGCCGACGGC AAGGGCGGCA CCACCGACAC GTTCTTCTGC GACCCCGAGT ACGACGCGCT GCACGCCCGC CAGCGCGCCG AGTTCGACCA GGCCAAGCGC GCCGACCTGG TCAAGCAGAT GCAGCGCCGC TTCTACGAGC AGGTCCCGGC CGTCGTCCTC GGGTACGACA ACGTGCTGGA GGCCTACCGC AGCGACAAGT TCACCGGCTT CCCCGTCCAG CCCGACCCCG GCGGCGTGAT CATGGCCCAG AACGGCGTGT GGGGCTACTA CGGCGCCACC CCCGCCGCCG CCGGCGCGCA GCGGGACACC GGCAACGGCA CCCTCGTCGT CATCCTCATC GTCGGCGTCG CGGTGGTCGT CGTGGCGGGC GGGGTGCTGC TGGCCCGCAG GCGCGGCGCG GGCGCCGAGG ACCGCGAGTG A
|
Protein sequence | MGVWQRIVAA SAAAGLSVLP FGVPAQAQNE VVLRVAITQQ VDSLNPFLAT FQASTEVGRL MYDFLTAYDQ RDQTPVPALA DRWSSSEDRL TWTFHVPEGR KWSDGQDITA DDVAFTYDLM MRDTTAATAN GSFTADFESV TASADGREVV IRTKQPQATM LALDVPIVPE HVWSEVTDVG DYTNDQGPVV GSGPFVLTEH KPNEFIRFAA NKTYWRGAPK YDSLVFVYYK TTDAAAQALA KGEVDLVNRL GPAQFDSLEG AEGVTRNKAN GRRFNELVLN SGAATNTGEP IGDGHPALRD LVVRRAIAQA IDRDAIIARV NNGYAQRGTG PIPPVFPAYH LPQPDPDPLP HDPAAANAAL DAAGYARGAD GTRAKDGRPL RLRLLGHASR AYDEQAAEFV KGGLAAIGVA VDVQIVSDNQ LNESATAGTF DLVFSGWGTN PDPDFILSLH TCAQRPGADG KGGTTDTFFC DPEYDALHAR QRAEFDQAKR ADLVKQMQRR FYEQVPAVVL GYDNVLEAYR SDKFTGFPVQ PDPGGVIMAQ NGVWGYYGAT PAAAGAQRDT GNGTLVVILI VGVAVVVVAG GVLLARRRGA GAEDRE
|
| |