Gene Apre_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0244 
Symbol 
ID8397018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp277815 
End bp278993 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content40% 
IMG OID644994605 
Productsodium/glutamate symporter 
Protein accessionYP_003152017 
Protein GI257065761 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID[TIGR00210] sodium--glutamate symport carrier (gltS) 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATAG AATTAGACAT AATACAGACT CTAGGAATTG GGATGATTGC CTACATTATC 
GGAGTCTTAA TTAAAAACAA ATTTGAATTT TTTAGGAAAT TTTTCATTCC ATCTCCTGTA
ATAGGAGGTC TTATTATTAG CCTTGTGATA TTTATAATCA AGGAAAGGGC CAAGATCGAT
TTTAAATTCG ATGGAACTCT CCAAGACTTC TTTATGAATA TCTTTTTTGC ATCCATAGGC
CTCGCCGCAT CCTTTAAGAG TTTGAAAAAG GCGGGGATAA TCGGCCTTAA GCTTGCAATC
GTAATAATTA TCCTCCTACC CTTACAAAAT GTAATCTCTG TAGGAATTTC CCTCATCCTT
GGCATAAATC CCCTCCACGG TGTGGCTATG GGATCTACTT CAATGACTGG AGGAATAGGA
TCTGCCATAT CTTTTGGAAA AATCATGGAA GCTAAGGGAG CAGTGGGAAG CACCTCCATA
GGAGTAGCAG CGGCGACCTT CGGACTTTTG GTTGGAAGCC TTGTTGGAGG ACCTGTCGCA
AAAAGACTCA TCAAGACCCA TTCACTTAGG TCAAAGGGCG CCTTGTACGA ATCAAAAGAT
AACAATATTA GAAGAATAAT AAATCAATCT TCCTTGATGA AGGCCATAAT ACTGGTAAGT
CTGTCAGCCT TCTTGGGAAC TTTTATAAAC AAAATCCTTG CCCTAACTTC CCTAAGCTTT
CCCTACTATG TAGGCTGCCA ATTCGGAGGC CTTGTCGTAA GAAATATCTA CGACCTACTA
GGAAAAGACG TGGACCTTAC TAATGTAAAT ATAGTAGGAA ATATTTCCCT CAACCTCTTC
CTATCTCTTG CCTTAATCAA CTTGAATATT TCGGCAATAA TAGGCCTTGC AGGACCTATG
TTTGTAATCC TAATCTCCCA AGCGATTTTT ATAGGAATCT ATACAAGTCT TGTAACCTAC
AGATTTTGTG GCAAAGACTA TGATGCTGCA GTAATGGCAG CTGGTCATTG CGGAGTAGGC
CTCGGCCAGA CACCAAACGC CATGGCAAAT ATGGAAGCAG TCATAGAGGA AAAGGGGCCA
GCAGATGTTG CCATGTTCGT ATTTCCGATA GTATTAGCAG TTGCAGTAAA TCTCTTCAAC
CCTCTAGTCA TTACATTTTT TATAGATTTA TTGGCATAG
 
Protein sequence
MTIELDIIQT LGIGMIAYII GVLIKNKFEF FRKFFIPSPV IGGLIISLVI FIIKERAKID 
FKFDGTLQDF FMNIFFASIG LAASFKSLKK AGIIGLKLAI VIIILLPLQN VISVGISLIL
GINPLHGVAM GSTSMTGGIG SAISFGKIME AKGAVGSTSI GVAAATFGLL VGSLVGGPVA
KRLIKTHSLR SKGALYESKD NNIRRIINQS SLMKAIILVS LSAFLGTFIN KILALTSLSF
PYYVGCQFGG LVVRNIYDLL GKDVDLTNVN IVGNISLNLF LSLALINLNI SAIIGLAGPM
FVILISQAIF IGIYTSLVTY RFCGKDYDAA VMAAGHCGVG LGQTPNAMAN MEAVIEEKGP
ADVAMFVFPI VLAVAVNLFN PLVITFFIDL LA