Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0014 |
Symbol | |
ID | 8396761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 15199 |
End bp | 17343 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644994351 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003151790 |
Protein GI | 257065534 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.021356 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTA AAAAAGTTTA CTCATCTCTA ATGGCCCTAG GACTCGTCCT AACACTTACA GCTTGTGGAT CTAGCACAGA TTCAACCAAG GGTGGAGCGG AGTCAAAGAT CGAAAATGAA GAGGCTAACA AAGCTACAGA CTCTAAAAAT GCCTCAGAGA CTCCGGAAGA TTTTGACAAG CAAACTTCTG ATGACACCAT AGTTATGGGA GTTGATTCCT TAAATGGAGA CTTCATCCAA GGCTATGCCA ATGATGGAAA TGACGTAAAG GTAAGAAGAT TCATGGGTAT AGAAGGAAAC AATGGCTATG ATTGCTATGT CCAAGACGAG GAAGGTAAGT TTCAAACAAA CACAGCTGCC CTTGAAAAAG ACCCTGAAGT AAAGATTAAC GAAGATGGAT CAAGGACAAC AACTTATACC ATAAAGAAAG ACCTAAAATG GTCAGATGGA GAACCTATCA CAGCTGATGA TTATATATTT GGAATCCTCC TAGAATCAGA TAAGGACTTT AACCCACTAA CAGCATCTAT GAATATAGGA GCAGACTCAC TCTTAGGCTA CAAGGCCTTC AAAAATGGGG AGACTGATAG TTTTGAAGGA ATCGAAAAAC ACGACGACTA TAGCTTTAGC CTAACAGTAG ATTCTTCCCA ACTACCATAT TTTGAAGTAG AAGTCCTATC AAATGCAGGC CCTAGCCCAA TGCACTATAT AGGAGAAAAC CTAGCTGTTT CAGAAGATGG CAAGAAGCTT GTAGTTAAGG AAGGCTATGA AGTCACAGAT AAGGATAGAG ACGACTACAA GAAATCTATA GACAAGCAAA TAGAAATCCT AAAAGAAGGC TTCGAAGAAG ATAGTGAAGG CCTTGATAAG GAAAGTGACG AATACAAGGA AGCCAAAGCT GACTTAGATA GCAAAGTAGG AGATCTTGAA TCTCGTAAAG AGGGAGATGT AGATCCAACT AGATTACTAA TAGAAGAAGC TATGATTAAG CTTACAAGTG ATTACAGGTT TAACCCTAAG GTTACCTGTG GACCATACAA GTTTGACAAG TTCGAAAACA ACATGGTTAG ACTTGACCTT AACGAAAACT ACCAAGGAAA CTTCAAGGGA GATAAGGCAA GTATTCCTCA CATCATAGTT CAATTGGTAA ACAAAAACAT AGGACCAGAC CTTCTAGAAA ATGGAGATAT AGACATTTGG GAAGGCGAAA CTGACGGATC AAAGATCGAC CAACTCAAAA AAGCAGCAGA CGATGGCAAA ATCCAAGTAG GCTCTTACGA AAGAAATGGT TACGGAAACC TAACCTTCCT AGTAGACAGG GGAGCAACCC AATACAAGGA GGTAAGACAA GCTATAGCAA GCCTAATGGA TAGGAACGAA TTTGTTCAAT CATTCTTGGG TGGCTACGGT GTTGTAACCA ACGGTATGTA CGGTACAAGC CAATGGATGT ATAAGGAAAG AGGAGCAGAC GTCGAAGGAA AATTGGTAAA TTGGGTTCTA AATATCGACA AGGCAAATGA ACTTCTTGAC AAGACTCCTT TCAAGTTCGA AGCTGATGGA AAAACTCCTT GGGATAAGAA CAAGGCTCTT GAAGAATTTA ACAAAAACCA AGAAGGCTTC GACTATTATA GATATGATGA AAACGGAAAT AAGCTTGTTG TAAACCAATA TGGGGCAGAA CAATCTCCAA TTACAACCCT AATATCCAAC CAACTTCCTC CAAATGCCAA GCAAGCTGGA ATGGAATATA ACGTGACAAG TGGATCCTTC TCAACCCTAA TAGACCTCTA TACCTTCCCT AAGGAAGATG CGGAATATAC AGCCTTCTCA ATGGCATCAG ATTTTGCTAC ACCATTTGAT CCATGGCTAT ACTATTCAAA GGAAGGTCCA TTTAATAGAA ATAAGGTAGA CGATCCAAAG GCAGATGAGG TAACAACTGC CCTAAGAAGA ACAGCTCCAG AGGAGAAGGA AGCTTATCTA GATAAGTGGG AAGAATTCCA AAAATGGTAC AACGACTACC TACCAGAAAT TCCACTCTAC TCAAATGTCT TCCACACAGG ATACAGTAAC AGGATCAAGG GCTTTGATAT AATGACACCG GTATGGAAGG CATCTGATCA AATAAATGCT ATGACAATTG AATAA
|
Protein sequence | MKIKKVYSSL MALGLVLTLT ACGSSTDSTK GGAESKIENE EANKATDSKN ASETPEDFDK QTSDDTIVMG VDSLNGDFIQ GYANDGNDVK VRRFMGIEGN NGYDCYVQDE EGKFQTNTAA LEKDPEVKIN EDGSRTTTYT IKKDLKWSDG EPITADDYIF GILLESDKDF NPLTASMNIG ADSLLGYKAF KNGETDSFEG IEKHDDYSFS LTVDSSQLPY FEVEVLSNAG PSPMHYIGEN LAVSEDGKKL VVKEGYEVTD KDRDDYKKSI DKQIEILKEG FEEDSEGLDK ESDEYKEAKA DLDSKVGDLE SRKEGDVDPT RLLIEEAMIK LTSDYRFNPK VTCGPYKFDK FENNMVRLDL NENYQGNFKG DKASIPHIIV QLVNKNIGPD LLENGDIDIW EGETDGSKID QLKKAADDGK IQVGSYERNG YGNLTFLVDR GATQYKEVRQ AIASLMDRNE FVQSFLGGYG VVTNGMYGTS QWMYKERGAD VEGKLVNWVL NIDKANELLD KTPFKFEADG KTPWDKNKAL EEFNKNQEGF DYYRYDENGN KLVVNQYGAE QSPITTLISN QLPPNAKQAG MEYNVTSGSF STLIDLYTFP KEDAEYTAFS MASDFATPFD PWLYYSKEGP FNRNKVDDPK ADEVTTALRR TAPEEKEAYL DKWEEFQKWY NDYLPEIPLY SNVFHTGYSN RIKGFDIMTP VWKASDQINA MTIE
|
| |