Gene Apre_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1064 
Symbol 
ID8397851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1139652 
End bp1140857 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content36% 
IMG OID644995411 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003152812 
Protein GI257066556 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000188211 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TAGGTTTACT GCCAAGGCTA ATCATAGCCA TCATTTTAGG TATTCTAATC 
GGTCTATTTG GACCAGCAGT TCTTGTTAGA ATATTAATTA CCTTTAATGG ACTATTTGGC
AACTTCCTAA ACTTTGTTAT TCCTTTAATC ATCATGGGAT TTGTTATCCC AGGTATAGCA
GATTTAGGAA ATGATGCAGG TAAGACACTA GCTATTACTG CTGGAATAGC ATATCTATCA
ACAGTAATAT TCGGTACGGC TACATATTTC ACAGGAAGCG CTATATTACC ACACTTCATA
GATCCAGGTA CAATGAACTT TAACCCTGAC CAAAACACAG GTAGAGTCCT AGAACCTTTC
TTTGAATCAC CAATGGATCC TATATTTAGT GTAACAACCG CACTTATTAT GTCATTCATA
CTTGGTGTAG GTATTGCTGC AGTTAAGGGA GAATTCCTTA AAAAAGCAGT ACACGAGTTT
TCTGATATTA TTACAAAATT AATTTCAAAC ATAGTAATTC CTTTATTGCC TATCCACATT
GCTGGTATTT TTGCTAATTT GGCATATCAA GGAACAGTAG CAAAAGTATT ATCAGTTTTC
TCAAAAGTAT TTATAATGGT AATTATACTT CACTGGTTGA CTATTTTGAT CCAATACACA
ATAGCAAGCT CTATGGGTGG AGGTAATCCA TTTGAGAAAA TCAAAAATAT CTTCCCAGCT
TATATGACAG CAATTGGAAC CCAATCATCA GCAGCAACAA TTCCTGTAAC ATTAAGACAA
ACCTACAAAA TGGGTGTTAA CAAGGGTATA GCAGACTTCG TTATTCCACT TTGTGCGACA
ATCCACTTAT CAGGTTCAAC CATAACGCTT ACATCTTGTG CTATGTCTAT ACTCATGCTA
CAAGGTGGAG ATGTTACCTT AGCTCACATA TTCCCATTCA TATTAATGCT TGGTGTGACT
ATGGTTGCAG CACCTGGAGT ACCTGGAGGA GCAGTAATGG CAGCTCTTGG TATCTTACAA
TCTATGCTTG GATTTACTGA ACCTATGACA GCCCTAATCA TTGCTTTGTA TGTTGCCCAA
GATTCTTTTG GTACAGCTTG CAATATCTCT GGTGACGGTG CTATAGCATG TATAGTAAAC
AAAATTAGTG GATTCAAATT AGACCCACAA GCTAATGAAG CTTATATTGA TGAATTAGTT
AAATAA
 
Protein sequence
MKKLGLLPRL IIAIILGILI GLFGPAVLVR ILITFNGLFG NFLNFVIPLI IMGFVIPGIA 
DLGNDAGKTL AITAGIAYLS TVIFGTATYF TGSAILPHFI DPGTMNFNPD QNTGRVLEPF
FESPMDPIFS VTTALIMSFI LGVGIAAVKG EFLKKAVHEF SDIITKLISN IVIPLLPIHI
AGIFANLAYQ GTVAKVLSVF SKVFIMVIIL HWLTILIQYT IASSMGGGNP FEKIKNIFPA
YMTAIGTQSS AATIPVTLRQ TYKMGVNKGI ADFVIPLCAT IHLSGSTITL TSCAMSILML
QGGDVTLAHI FPFILMLGVT MVAAPGVPGG AVMAALGILQ SMLGFTEPMT ALIIALYVAQ
DSFGTACNIS GDGAIACIVN KISGFKLDPQ ANEAYIDELV K