Gene Apre_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0014 
Symbol 
ID8396761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp15199 
End bp17343 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content40% 
IMG OID644994351 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003151790 
Protein GI257065534 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.021356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTA AAAAAGTTTA CTCATCTCTA ATGGCCCTAG GACTCGTCCT AACACTTACA 
GCTTGTGGAT CTAGCACAGA TTCAACCAAG GGTGGAGCGG AGTCAAAGAT CGAAAATGAA
GAGGCTAACA AAGCTACAGA CTCTAAAAAT GCCTCAGAGA CTCCGGAAGA TTTTGACAAG
CAAACTTCTG ATGACACCAT AGTTATGGGA GTTGATTCCT TAAATGGAGA CTTCATCCAA
GGCTATGCCA ATGATGGAAA TGACGTAAAG GTAAGAAGAT TCATGGGTAT AGAAGGAAAC
AATGGCTATG ATTGCTATGT CCAAGACGAG GAAGGTAAGT TTCAAACAAA CACAGCTGCC
CTTGAAAAAG ACCCTGAAGT AAAGATTAAC GAAGATGGAT CAAGGACAAC AACTTATACC
ATAAAGAAAG ACCTAAAATG GTCAGATGGA GAACCTATCA CAGCTGATGA TTATATATTT
GGAATCCTCC TAGAATCAGA TAAGGACTTT AACCCACTAA CAGCATCTAT GAATATAGGA
GCAGACTCAC TCTTAGGCTA CAAGGCCTTC AAAAATGGGG AGACTGATAG TTTTGAAGGA
ATCGAAAAAC ACGACGACTA TAGCTTTAGC CTAACAGTAG ATTCTTCCCA ACTACCATAT
TTTGAAGTAG AAGTCCTATC AAATGCAGGC CCTAGCCCAA TGCACTATAT AGGAGAAAAC
CTAGCTGTTT CAGAAGATGG CAAGAAGCTT GTAGTTAAGG AAGGCTATGA AGTCACAGAT
AAGGATAGAG ACGACTACAA GAAATCTATA GACAAGCAAA TAGAAATCCT AAAAGAAGGC
TTCGAAGAAG ATAGTGAAGG CCTTGATAAG GAAAGTGACG AATACAAGGA AGCCAAAGCT
GACTTAGATA GCAAAGTAGG AGATCTTGAA TCTCGTAAAG AGGGAGATGT AGATCCAACT
AGATTACTAA TAGAAGAAGC TATGATTAAG CTTACAAGTG ATTACAGGTT TAACCCTAAG
GTTACCTGTG GACCATACAA GTTTGACAAG TTCGAAAACA ACATGGTTAG ACTTGACCTT
AACGAAAACT ACCAAGGAAA CTTCAAGGGA GATAAGGCAA GTATTCCTCA CATCATAGTT
CAATTGGTAA ACAAAAACAT AGGACCAGAC CTTCTAGAAA ATGGAGATAT AGACATTTGG
GAAGGCGAAA CTGACGGATC AAAGATCGAC CAACTCAAAA AAGCAGCAGA CGATGGCAAA
ATCCAAGTAG GCTCTTACGA AAGAAATGGT TACGGAAACC TAACCTTCCT AGTAGACAGG
GGAGCAACCC AATACAAGGA GGTAAGACAA GCTATAGCAA GCCTAATGGA TAGGAACGAA
TTTGTTCAAT CATTCTTGGG TGGCTACGGT GTTGTAACCA ACGGTATGTA CGGTACAAGC
CAATGGATGT ATAAGGAAAG AGGAGCAGAC GTCGAAGGAA AATTGGTAAA TTGGGTTCTA
AATATCGACA AGGCAAATGA ACTTCTTGAC AAGACTCCTT TCAAGTTCGA AGCTGATGGA
AAAACTCCTT GGGATAAGAA CAAGGCTCTT GAAGAATTTA ACAAAAACCA AGAAGGCTTC
GACTATTATA GATATGATGA AAACGGAAAT AAGCTTGTTG TAAACCAATA TGGGGCAGAA
CAATCTCCAA TTACAACCCT AATATCCAAC CAACTTCCTC CAAATGCCAA GCAAGCTGGA
ATGGAATATA ACGTGACAAG TGGATCCTTC TCAACCCTAA TAGACCTCTA TACCTTCCCT
AAGGAAGATG CGGAATATAC AGCCTTCTCA ATGGCATCAG ATTTTGCTAC ACCATTTGAT
CCATGGCTAT ACTATTCAAA GGAAGGTCCA TTTAATAGAA ATAAGGTAGA CGATCCAAAG
GCAGATGAGG TAACAACTGC CCTAAGAAGA ACAGCTCCAG AGGAGAAGGA AGCTTATCTA
GATAAGTGGG AAGAATTCCA AAAATGGTAC AACGACTACC TACCAGAAAT TCCACTCTAC
TCAAATGTCT TCCACACAGG ATACAGTAAC AGGATCAAGG GCTTTGATAT AATGACACCG
GTATGGAAGG CATCTGATCA AATAAATGCT ATGACAATTG AATAA
 
Protein sequence
MKIKKVYSSL MALGLVLTLT ACGSSTDSTK GGAESKIENE EANKATDSKN ASETPEDFDK 
QTSDDTIVMG VDSLNGDFIQ GYANDGNDVK VRRFMGIEGN NGYDCYVQDE EGKFQTNTAA
LEKDPEVKIN EDGSRTTTYT IKKDLKWSDG EPITADDYIF GILLESDKDF NPLTASMNIG
ADSLLGYKAF KNGETDSFEG IEKHDDYSFS LTVDSSQLPY FEVEVLSNAG PSPMHYIGEN
LAVSEDGKKL VVKEGYEVTD KDRDDYKKSI DKQIEILKEG FEEDSEGLDK ESDEYKEAKA
DLDSKVGDLE SRKEGDVDPT RLLIEEAMIK LTSDYRFNPK VTCGPYKFDK FENNMVRLDL
NENYQGNFKG DKASIPHIIV QLVNKNIGPD LLENGDIDIW EGETDGSKID QLKKAADDGK
IQVGSYERNG YGNLTFLVDR GATQYKEVRQ AIASLMDRNE FVQSFLGGYG VVTNGMYGTS
QWMYKERGAD VEGKLVNWVL NIDKANELLD KTPFKFEADG KTPWDKNKAL EEFNKNQEGF
DYYRYDENGN KLVVNQYGAE QSPITTLISN QLPPNAKQAG MEYNVTSGSF STLIDLYTFP
KEDAEYTAFS MASDFATPFD PWLYYSKEGP FNRNKVDDPK ADEVTTALRR TAPEEKEAYL
DKWEEFQKWY NDYLPEIPLY SNVFHTGYSN RIKGFDIMTP VWKASDQINA MTIE