Gene Apre_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0051 
Symbol 
ID8396798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp61401 
End bp62537 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content41% 
IMG OID644994388 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003151827 
Protein GI257065571 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.115612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA TTACGAGATT GTTAATGGCA ATTGTTATGA TCTTTACCCT AAGTGCTTGT 
GGGAATGCCG ATAACAAGAC AGAAGAAGTA AACGAAGACC AAAAGACTGA GACTAAGGTT
GAAGAAAAGA ACGACGATAA GGAAGTCGAA GATAAGAATG AGGGAGTAGG AGAAGCTGAA
ATAGACCTTC CTGACTTTGA GGGAAGAAGT CTAAATGTAG TCGCTACAAG TGATTCTTAC
GTTCCTTTGT TCGATAGGTT CAGTGAACTG ACAGGAGCTA AGGTAGAATT TTTATCAATG
TCTTCTGGTG AAGTTATAAC TAGAACAAAG GCTGAAGGCA AGCCAATGGC GGACCTATGG
TTTGGTGGTG GACTCGATGC CTTTATGGCA GCTAAGGAAG ATGGCCTCCT TGATTCCTAC
AAGTCTGAAA TGACAGATAA GGTTCCAGAA AGATTTAGAG ATGAGGAAGG TTATTATACA
TCCAAGGGTC TTACAGTAGT GGGTTTTATT GTAAATGATC AAATCCTTGA AGAAAAGGGA
CTTGAAGCGC CAAAAACATG GAAGGACCTT GCCAAGGAAG AGTACAAGGG AGAGATAATC
ATGTCAAACC CTGCAATCTC TGGGACAAAC TACGCTGCCC TTAAGGGACT TCTCGACCTA
TATGGGGAAG AAGAAGGCTG GGCCCTTTTT GAGAAAATCA ATGAAAATAT AGATTTCTAC
TCAAAAAGAG GAAAAGACCC ACAAGAGAAG ACTGCCCAAG GAGAATTTGC TATTGGAATC
ATTCCTGTAG ACAAAAAGGC CTTTGATGCA GCTCGCGACA ATGGACTTTC TGTAGTTTAT
CCAGAAGATG GGGTAAGCTG GGTGCCAGAA GGAGTTGCTG TATTTAAAGA TAGTGAAAAT
GCTGATGTAG CCAAGGCTTT CGAAGACTTT ATGTTGACAA AGGAAGCCCA AAAGATGATT
GCAGAAATCG ACGGAAAAGA CACTAACCAG CTAATCGTCG AAGGAGCAGA GGGCTTTGAC
CTAGATCTTC CTAAGGATAA GCTAGTCGAC GAGGACCTAT CAACATTTGG TACGAAGAGA
GACGAAATAT TAAATAAATT CAAAGAAATA GCCAAGGATA AGGCTAGAGA AGAATAA
 
Protein sequence
MKKITRLLMA IVMIFTLSAC GNADNKTEEV NEDQKTETKV EEKNDDKEVE DKNEGVGEAE 
IDLPDFEGRS LNVVATSDSY VPLFDRFSEL TGAKVEFLSM SSGEVITRTK AEGKPMADLW
FGGGLDAFMA AKEDGLLDSY KSEMTDKVPE RFRDEEGYYT SKGLTVVGFI VNDQILEEKG
LEAPKTWKDL AKEEYKGEII MSNPAISGTN YAALKGLLDL YGEEEGWALF EKINENIDFY
SKRGKDPQEK TAQGEFAIGI IPVDKKAFDA ARDNGLSVVY PEDGVSWVPE GVAVFKDSEN
ADVAKAFEDF MLTKEAQKMI AEIDGKDTNQ LIVEGAEGFD LDLPKDKLVD EDLSTFGTKR
DEILNKFKEI AKDKAREE