Gene Apre_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1691 
Symbol 
ID8398503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1839301 
End bp1841241 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content35% 
IMG OID644996054 
ProductSodium-transporting two-sector ATPase 
Protein accessionYP_003153432 
Protein GI257067176 
COG category[C] Energy production and conversion 
COG ID[COG1269] Archaeal/vacuolar-type H+-ATPase subunit I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAG TTAAAATGAA TAAGTTTAAC TTGCTAGCTT TTAAATCCGA AAGGGATGAC 
TTGCTAAATA TCCTTCAAGC TTTTAACTAT GTTCACTTTA ACGATTTGGA AGAAGATGAG
CAAAGATCCT ATCTTTCAGA AGTTAGAAAC ACAGACCAAC TTCAAAAAAT CGACTCTAGT
ATAAACAGGG CCGATTATGT AATTAATCTT TTAGAAAATT ACATGAAAGA TAGGGACATC
AAACCAGATA CAAGTTTTAC TGAACTTAGC TTAGAAGATG TAAAGAAAAA GGGTGCGAAT
TTCGACTTCG ATCTTATATA TGGCAAGATA AAAGACCTCG TAGGCCAAAG AGAGATGGCC
CTAGGCCAAA GAAATGAGCT TACAAATAAA ATAGAAGCTC TTAAGCCATG GAAGGATATA
GATGTAGACA TCCAAGAACT CTACGATTCT AAGAGAGTCT TTGTAGAGAC AGGGACAATT
TCTGATCAAT TCTATGATGC ACTTGAAAAA GCCATAGTAG AAAGAAATCT AGAAAAGTCT
CTAGTATACA AAGTCTCTGA ACTTGATAAG ACAAACTATA TAGTCGCCCT ATCAAGTATA
GAAGAAAAGG AAGATCTAGT AGAACTTCTA AGAGAGTTCG GCTATAGCAG AGTTAAAATT
AATTCAACTA GCAAGATAGG AGAAGAAATC TATGACCTAT CTGGCAAACT CGAAGATAAA
GAACAATTAA TTAAAAATCT CGAAAATGAA ATTCTAGATT TCAAAAAATA CTTAAAGGAC
CTATATATTT ACAAGGCCTA CGTCCTAAAC CTAAGAAGAA AGGAAGAATC AAGTGAATTT
TTCCTAGAGA CAGGGCTTAT GAACGTTATT GAAGGCTATG TTCCGGTAGA AGCTACCGAG
AGATTCAAAA AAGATATTAA AAATGTTTTA GGTGATGCCT ACATACTTGA TATCGAAGAA
GCTGATAAGG AAGATAGTTG TGTTCCAATA ATACTTAAGA ACAATAAGTT GGTCGATCCA
TATGAGGAGG TTGTAAAGAC CTATTCACTT CCAAAATACA ACGAAGTAGA TCCAAGTGGA
CTTGTAGCAA TATTTTATAC AATATTTACA GGTTTTATGA TAGGAGACTT AGGCTATGGG
GCCCTTGCAA CAATAGCAAT ATTACTTGCC CTTAAGCTAA AGAAGTTCCC AACTTCTACT
GAGAAGAATT TGAGATTATT CTTAAGGATA TCTCTTTCTG CATGTGTATT TGGAGCTATA
TTTGGATCAG TATTTGGTGG AATAATAGAT GTACCATTTG GTTTAATAGA TCCAGCGACA
GACATCAACG AGCTTATAGT GATGAGCTTG GTAATAGGAG CTGTTTCACT GTTCGTCGCT
TTAGCTGTAA AAGCTTATAT GTATATCAGA GACGGAAAGC CAATGGATGC TATGTACGAT
GTAGGATTTA TGTATATGGT TGTAGGCGGA GCGGTAGCTT TAGCCTTAAC TAAAAACCCT
ATAGCAAAAT GGGTTATGAT TATTGGTATT TTAGGAATCT TCCTATTTTC TGGTAGAGAA
GCTAAATCTA TCGGAGGAAG GATTGGATCA GGTTTCTATG AAGTCTATGG ACTTACGAGC
TGGATAGGAG ATTTCGTATC CTTCCTAAGA CTTATGGCCC TAGTATTATC AGGAAGCTTC
GTGGCTTATT CTGTAAACTT AATTGTAGAC TTGGTCGCAG GTAATGGTTC AATCGGAGGA
ATCATAGCTG GAATTATCAT ATTTGTTGTA TTCCAACTAT TTAACATGTT CCTATCTTAT
CTATCAGCCT ATGTACATGG TCTAAGACTT ATATATGTAG AGATGTTTAA TAAATTTTAT
GAAGGTGGCG GAGTTAAGTT CCGTGAGATG ATTGAAGATA CAAAATTTGT CAAAATATTA
AGAGGAGGAG ACAATGAGTA A
 
Protein sequence
MAIVKMNKFN LLAFKSERDD LLNILQAFNY VHFNDLEEDE QRSYLSEVRN TDQLQKIDSS 
INRADYVINL LENYMKDRDI KPDTSFTELS LEDVKKKGAN FDFDLIYGKI KDLVGQREMA
LGQRNELTNK IEALKPWKDI DVDIQELYDS KRVFVETGTI SDQFYDALEK AIVERNLEKS
LVYKVSELDK TNYIVALSSI EEKEDLVELL REFGYSRVKI NSTSKIGEEI YDLSGKLEDK
EQLIKNLENE ILDFKKYLKD LYIYKAYVLN LRRKEESSEF FLETGLMNVI EGYVPVEATE
RFKKDIKNVL GDAYILDIEE ADKEDSCVPI ILKNNKLVDP YEEVVKTYSL PKYNEVDPSG
LVAIFYTIFT GFMIGDLGYG ALATIAILLA LKLKKFPTST EKNLRLFLRI SLSACVFGAI
FGSVFGGIID VPFGLIDPAT DINELIVMSL VIGAVSLFVA LAVKAYMYIR DGKPMDAMYD
VGFMYMVVGG AVALALTKNP IAKWVMIIGI LGIFLFSGRE AKSIGGRIGS GFYEVYGLTS
WIGDFVSFLR LMALVLSGSF VAYSVNLIVD LVAGNGSIGG IIAGIIIFVV FQLFNMFLSY
LSAYVHGLRL IYVEMFNKFY EGGGVKFREM IEDTKFVKIL RGGDNE