Gene Apre_0061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0061 
Symbol 
ID8396808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp73722 
End bp75110 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content35% 
IMG OID644994398 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003151837 
Protein GI257065581 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAA AAATCGCTAG CAGACTAATG TTAGTCTTGA TGTTTGTTAC TTTAACTGCA 
TGTGCAGGAC GTAATGAGGA TTCTAAAGCA ATTGATACAG GAAAAGATAG TAAAAATAGT
GAAAGCTCTG AATCAAACAG TAGCAATGAT GCCACTAGTT CAGATGGCAA GACAACTATA
GTATTTTGGC ACTCAATGGG TGGTAATCTA AACGACGCTA TAGACCATCT AGTTGCTGAA
TATAATAAGT CACAAGATAA ATATTATGTG AAGGCAGAAT TCCAAGGAGA ATACGACGAT
GCTCTTACAA AACTTAGATC ATCTGCTTCA GGTAAGGATG TCGGCGCTGA TTTGGTTCAA
GTTTTTGAAT TAGGAACAAG ATATATGATT GACTCAGGTC TTATCAAACC TATGCAAGAT
TTTATTGATG AAGAAAATTA TGACACAAGT CAATTAGAAG ATAATCTTCT CGCTTATTAT
ACAGTAGATG GAAAATTAAA CTCTATGCCA TTTAACTCTT CAACACCTCT TCTTTATTAC
AACAAGGATA TGTTTGAAGA AGCTGGAGTG GAAGTTCCAA AGTCTTTAGA AGAGATGAAG
CTTGTAGGTC AAAAACTTAA GGAAGCTGGT GTTACAGATA TGCCAATTTC ATTTAGTGTA
TATGGTTGGT GGATTGATCA ATTCATGAAT AAACAAGGAC TTGATTTATT TGATAATGAA
AATGGTAGAA AAGCTAATCC TACAAAATCG GTATTCGATG AAAATGGTGG TGTTACTAAT
ATACTTAAGG CATGGAAAGA TCTAGCTGAT GAAGGGATAG CACCTAATGT TGGCAAGCAA
GGTGGAACTG CAGAATTTAA TTCAGGAACA AGTGCTATGA CATTTGCATC AACTGCTTCT
CTTACACAAA TATTATCAGA AGTAGGAGAT AGATTCGAAG TTGGTACTGC ATATTTCCCA
TCTGTAAAAG ATTCTGATAA AAACGGAGTT TCAATCGGAG GTGCATCTTT ATATGCTATA
AATTCAGGCG ATGAGGAAAA AATGGCAGGA ACATGGGACT TCATTAAGTT CTTAGTAAGT
GTCGAATCTC AAACATATTG GAATGCAAAT ACAGGATATT TCCCAGTAAA TAAGGGAGTA
CTCGATACAC AAGAATTCAA AGACCTAATT AAGAAATCTC CACAATTTGA GACTGCAATC
GATCAATTAC ATGATTCAAA ACCAGAAGAT CAAGGGGCAC TTGCAACAGT GTATCAAGAA
TCTCGACAAA TTATGGAAAA GCACATCGAA GATGTCCTAA ATGGAAATAT AAGTCCAGAA
GATGCAAGCA AAAAGATGGC AGAAGAAATA GATGCTGCCT TAGAATCATA CAACCAGGTT
AACAAATAA
 
Protein sequence
MFKKIASRLM LVLMFVTLTA CAGRNEDSKA IDTGKDSKNS ESSESNSSND ATSSDGKTTI 
VFWHSMGGNL NDAIDHLVAE YNKSQDKYYV KAEFQGEYDD ALTKLRSSAS GKDVGADLVQ
VFELGTRYMI DSGLIKPMQD FIDEENYDTS QLEDNLLAYY TVDGKLNSMP FNSSTPLLYY
NKDMFEEAGV EVPKSLEEMK LVGQKLKEAG VTDMPISFSV YGWWIDQFMN KQGLDLFDNE
NGRKANPTKS VFDENGGVTN ILKAWKDLAD EGIAPNVGKQ GGTAEFNSGT SAMTFASTAS
LTQILSEVGD RFEVGTAYFP SVKDSDKNGV SIGGASLYAI NSGDEEKMAG TWDFIKFLVS
VESQTYWNAN TGYFPVNKGV LDTQEFKDLI KKSPQFETAI DQLHDSKPED QGALATVYQE
SRQIMEKHIE DVLNGNISPE DASKKMAEEI DAALESYNQV NK