Gene Apre_0480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0480 
Symbol 
ID8397255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp550681 
End bp551967 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content38% 
IMG OID644994837 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003152248 
Protein GI257065992 
COG category[R] General function prediction only 
COG ID[COG1823] Predicted Na+/dicarboxylate symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0200306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA TTTCATTAAT AATAACGATA GGCCTATTTT TCCTCTTAGG GAAAATGAAA 
GAAAAAAATA TTTCTTTTGC TAAGAGAGTA ATTTTGGCGG CTATCTTGGG TCTAGGCTTG
GGATATTTCT TTGCTGGATC AACTGAATAT GTAGCAATTT TTGGGAAAGT TTTTATAAGA
TTGTTATCAG CAATGGTTAT ACCGCTCTTA TTTGCTACCA TTATAAGAAC AACCCTAAAT
ACTGGTTCTC TGGGCAAGCT TAGAAGCGTA GGGCTTAAGT CTGTGGGCAT ACTAAGTCTC
CATAATATCT TAGGATCAAC AATTGGTCTA ATCCTTGCTG TTATCTTTTC TATAGGTAAG
GGAGCGAGTA TCCCTTTGCC AGAAGCAAGC GAAGTTAAGG AAGTGCCTAC TGTAGCAGAG
ACTATTACAA ACTTCTTTCC ATCAAATATT ATTTCAAACG CAGGAGATGG GATGGTAATT
CCTGTAATAA TTTTCTCTAT CCTAGTTTCA ATAGCAGCCC TTAAGCTAGT TGATAAGGGA
GAAGGAGAGA AAGTTACGGC CTTTAAGGAC TTTATCAATT CCTTTGCAGA GATAATCTAT
CAGCTTACAA GTATGATTAC AAGTCTTACA CCTTTTGCTG TCCTATCTTT AATGGCAGAA
GCTGTATCAA AGATTGACTA TGAAGCTGTT AAGCCACTTT TACTAATCTT AGTTTTAAAC
TATGTTGCAA GTGCTATCCA TTCATTTATT ACGACAGGAG CCCTTGTTTC TGTATTTGCT
AAGGTAAATC CTGTAAAGTA CTTCAAAAAT GCATGGCCTG CTCAAGTAGT TGGTTTTACA
ACTCAATCAT CAATGGGATC TCTTCCAGTA AATATGGAAA ATCTAGAAAA GACTCAAGGA
GTCTCAGAAG ATATAGCCTC TTTCGTAGCT CCACTTGGGG CAACCATGGG CATGCCAGGT
TGTGCAGGCT TTTGGCCAGT GATGAATGCT GTTCTTACAA TAAATGTTAT GGGACTTGAT
TTTGGAGGAT TTGACTATGT TAAGCTAGTA CTAATAGCCC TTCTAGTTTC TCTAGGAACT
GTCGGTGTAC CAGGAACGGC AACTATAGCA ACTACTGCAG TTTTTGCTGC TATGGGACTT
CCTCTAGAAA TGGTAGTTCT CCTAGCACCA ATTTCTTCTC TAGCAGATAT GGGAAGGACA
TCTACAAATG TAGTCGCAGC CAACTCTTCT GCCCTAATAG TTGCAGCAAG TGAAGGAAAG
CTAAATAGAG AAGTATTTAA TAGCTAA
 
Protein sequence
MKFISLIITI GLFFLLGKMK EKNISFAKRV ILAAILGLGL GYFFAGSTEY VAIFGKVFIR 
LLSAMVIPLL FATIIRTTLN TGSLGKLRSV GLKSVGILSL HNILGSTIGL ILAVIFSIGK
GASIPLPEAS EVKEVPTVAE TITNFFPSNI ISNAGDGMVI PVIIFSILVS IAALKLVDKG
EGEKVTAFKD FINSFAEIIY QLTSMITSLT PFAVLSLMAE AVSKIDYEAV KPLLLILVLN
YVASAIHSFI TTGALVSVFA KVNPVKYFKN AWPAQVVGFT TQSSMGSLPV NMENLEKTQG
VSEDIASFVA PLGATMGMPG CAGFWPVMNA VLTINVMGLD FGGFDYVKLV LIALLVSLGT
VGVPGTATIA TTAVFAAMGL PLEMVVLLAP ISSLADMGRT STNVVAANSS ALIVAASEGK
LNREVFNS