Gene Apre_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0397 
Symbol 
ID8397171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp449140 
End bp450369 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content39% 
IMG OID644994755 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003152167 
Protein GI257065911 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000267322 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAGA AGAAAGTTAA AAAGAAATCG AAGCTTACCA GAAACATACT GATTGCACTA 
GTTTCGGCGA TTATTCTGGG TCTTCTTTTA CAAAGCAAGG CAGAGATTTT AAATGAGTAT
ATCAAACCAT TCGGTGATAT ATTTCTTAAC CTAATCAAAT TTATAGTAAC ACCTATCGTA
TTTTTCTCAA TAATTGGAGG AATTGTCTCC ATGAAAGATA TCAAAAAGGT CGGAGAAGTC
GGAATTTTTA CCATAATTTA TTATTTTCTT ACCACATCCT GTGCCATAGT TATAGGACTT
GGCTTTGCTA ACATCTTTAA GAGAGTATTT CCAGTAATAG CAACTACTAA CCTGGAGTTT
GAAGTCGGGG AGAAAGTTTC CTTTATGGAT ACCCTAGTAA ACATCTTTCC TAAAAATTTC
CTAACACCAA TGGTAGAGGC CAACATGCTC CAGATAATAG TAGGGGCAAT GATAGTTGGT
TTTTCCATCT TACTTATCAA AAAGGAAAGC CAAGATAAGG CCATAGGAGC AGTTGAAGTC
CTAAATGATA TCTTTATGAA GGCAATGGAG TTAATCCTAA GCCTATCTCC AATAGGAGTA
TTTTGCTTAC TAGTTCCAGT AATAGCAGAA AATGGAGCAA TGATTATAGG ATCTCTCGCA
TCAGTATTGC TTGTAGCCTA TCTAGCCTAC GCCCTACACG GCCTTGTAGT CTACTCATTT
ACTATAAAGA CCTTGGCTAA AATAAGTCCT ATACAATTCT TCAAGGGCAT GGCACCAGCT
ATAATGTTTG CCTTCTCATC AGCATCTTCA GTAGGAACTA TTCCAATCAA CATCAAATGC
TGTGAAGAGC TTGGAGCAGA TCATGACGTA ACAAGCTTCG TCCTACCCTT AGGAGCTACC
ATAAACATGG ACGGAACAGC AATCTATCAG GGAGTGTGCG CAGTCTTTAT CGCTTCTTGT
TATGGGATCG ACCTAACACT TGGCCAAATG GTAAATATAG TTCTTACAGC AACACTCGCG
TCCATAGGAA CAGCAGGAGT TCCAGGAAGC GGAATGATAA TGCTTGCCAT GGTCCTTCAA
TCAGTAGGTC TTCCTGTAGA AGGAATCGCC CTAGTAGCAG GAATCGATAG GATATTTGAC
ATGGGAAGAA CTACCCTAAA TATTACAGGA GATGCAACTG CTGCTATAAT AAATACAGAA
AGACTTAGAA GAAAAGGGAA AATTGCATAA
 
Protein sequence
MSEKKVKKKS KLTRNILIAL VSAIILGLLL QSKAEILNEY IKPFGDIFLN LIKFIVTPIV 
FFSIIGGIVS MKDIKKVGEV GIFTIIYYFL TTSCAIVIGL GFANIFKRVF PVIATTNLEF
EVGEKVSFMD TLVNIFPKNF LTPMVEANML QIIVGAMIVG FSILLIKKES QDKAIGAVEV
LNDIFMKAME LILSLSPIGV FCLLVPVIAE NGAMIIGSLA SVLLVAYLAY ALHGLVVYSF
TIKTLAKISP IQFFKGMAPA IMFAFSSASS VGTIPINIKC CEELGADHDV TSFVLPLGAT
INMDGTAIYQ GVCAVFIASC YGIDLTLGQM VNIVLTATLA SIGTAGVPGS GMIMLAMVLQ
SVGLPVEGIA LVAGIDRIFD MGRTTLNITG DATAAIINTE RLRRKGKIA