Gene Apre_0510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0510 
Symbol 
ID8397287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp584132 
End bp585409 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content40% 
IMG OID644994869 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003152278 
Protein GI257066022 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.249513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA ATAAGAAAAA GAAACTAGGA CTCTCCCAAC AAATATTTAT AGCCTTAATT 
GCAGGACTTG TAGTTGGAAT CCTAATCCAC TACTTCATGC CAGCCGGCCA CTTCCGTGAT
GATGTCCTAG TAGAAGGAAT ATTTTACACC ATAGGACAAG TTTTTATCAG GCTTATGCAA
ATGCTAGTAG TTCCACTAGT ATTCTTCTCT ATAGCGGACG GCTGTAGAAA CTTAGGAGAT
ACGGAAACCC TAGGTAAGGT TGGAGTAAGA ATTGTATTAT TTTATATATG TACAACAGCT
CTTGCAATAT TTTTATCCCT AATGCTTGCA AGAATTATCG GTCCAGGCAA GGGAATGAAT
ATGAGCCTAG GAGCTAATGA GTTCGAGGTA GACGGAGGAG AAGAATTCTC CCTATCAAAA
ACCATCCTAA ACTTTGTTCC AACCAACCCA ATAGGTGCCC TAGCCAATGG AGAAATGATC
CAAATAATTA TATTTGCAGT TATAGTAGGT CTACTCATAG CTTCTATGGA AGATAGGCTA
ATCACATTGG GAAATGTCGT GACAGAGATG AACGATCTAA TGATGGGAAT GACCATGTGG
GTAATGAAAC TTGCCCCAAT CGGAGTATTC TTCCTCATAT CAAGGACCTT CGCATCTCTT
GGTTATGATG TAATAATCTC CATGCTATCA TACATGGCGA CTGTACTTGG TGGACTTTTA
GTACAACTTA TACTTGTCTA CATGGTCTTA CTTACAGTAT TTACTAGAGT CAATCCGATA
AACTTCCTAA AGAAGTTTGC TCCAGTAATG ACCTTCGCCT TCTCTACTGC ATCAAGTAAC
GCAACAGTTC CAGTAAATAT CAAAACCCTA GAAGAAATGG GAGTAGATAG AAAAATATCT
TCCTTTACCA TTCCCCTAGG AGCGACCATC AACATGGACG GAACTGCCAT CATGCAGGGA
GTTGCGGTAG TCTTTATTGC AAACGCCTAT AACATCGACC TAACGGCAGC AGACTTTGCT
ACAGTAATAC TTACAGCGAC AATAGCATCA GTAGGAACAG CAGGAATCCC ATCTGTCGGA
CTAATCACCC TATCCATGGT CCTTCAATCA GTAGGACTTC CAGTAGAAGG AATAGCGATG
ATCATGGGTA TTGACAGAAT CCTAGACATG GCAAGATCTG CCATAAACAT CTCAGGAGAT
GCCACTGGAA CAATAATAGT AGCAAACTCA GTAGGATCCT TTAACAAGGA AAAATATATT
AGAAAAGTTG AAAAATGA
 
Protein sequence
MEKNKKKKLG LSQQIFIALI AGLVVGILIH YFMPAGHFRD DVLVEGIFYT IGQVFIRLMQ 
MLVVPLVFFS IADGCRNLGD TETLGKVGVR IVLFYICTTA LAIFLSLMLA RIIGPGKGMN
MSLGANEFEV DGGEEFSLSK TILNFVPTNP IGALANGEMI QIIIFAVIVG LLIASMEDRL
ITLGNVVTEM NDLMMGMTMW VMKLAPIGVF FLISRTFASL GYDVIISMLS YMATVLGGLL
VQLILVYMVL LTVFTRVNPI NFLKKFAPVM TFAFSTASSN ATVPVNIKTL EEMGVDRKIS
SFTIPLGATI NMDGTAIMQG VAVVFIANAY NIDLTAADFA TVILTATIAS VGTAGIPSVG
LITLSMVLQS VGLPVEGIAM IMGIDRILDM ARSAINISGD ATGTIIVANS VGSFNKEKYI
RKVEK