Gene Apre_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1011 
Symbol 
ID8397798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1084072 
End bp1085820 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content36% 
IMG OID644995359 
ProductH+transporting two-sector ATPase alpha/beta subunit central region 
Protein accessionYP_003152760 
Protein GI257066504 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAA AAATTAAAAC AATAAATGGT CCTGTTGTAA TTGCGACAGA TGCGAAAATC 
TTGACTGTTC GTGAGATGGT TTCAGTTGGT AAGCTAAAGC TTATAGGTGA AGTAATATCA
CTTGAGGGTG ATCTTGCGAC AATTCAGGTT TATGAAGATA CATCAGGACT TAAAGTAAAT
GAAGATATAA TTCCAACTGG TAGACCTCTT TCAGTAAGAC TGGGCCCAGG TATGCTTGGA
AACATGTTTG ATGGTATCCA AAGACCCTTA AAAAATATAA TGGATAAAAA TGGATCCTTT
ATTCCTTCAG GAATAGGATT TGAAAATCTA GATACTGAGA AAAAATGGGA TGTCAAACTT
ACTGTTAAGG TTGGAGATAA GATTAAAAGA GGCGACATCT ACGCTACCAT AAAAGAAACT
GAAACTATCG AGCATAGACT TATGACTCAA GTTAGTGGTG AAGTTGTTGA AGTAGCAGAA
GATGGTTTGT ACACCCTTGA GGATACTGTA GTAAAGATTA AGACAGAAAA AAATGTGGTT
GAGGAAAAAC TATACCAATA TTGGCCAGTT AGAAATCAAA GACCAGTTAT GGCTAATATG
CCAATAGAGA AGCTATTTGA GACAGGTCAA AGGGTCCTTG ATGTTTTCTT CCCTCTTGCT
AAGGGAGGTA CTGTTGCAAT TCCTGGTGGA TTCGGTACAG GAAAGACCAT GCTTCAACAC
CAGCTTGCCC AATATTCTGA CGTAGATATA ATAATTTATA TTGGATGTGG AGAGCGTGGA
AACGAGATGA CCCAGGTTCT TGAGGAGTTC CCTGATCTTA TAGATCCAAA CACAGGTAAG
GGACTTATGG AAAGGACTAT CTTGATTGCT AACACTTCAA ACATGCCGGT AGCAGCTAGG
GAAGCATCTA TCTATACTGG TATTACTATG GCTGAATACT TTAGAGACAT GGGTTATGAT
GTAGCTCTTA TGGCAGACTC ATCATCAAGA TGGGCAGAGG CCTTACGTGA AATATCAGGA
AGACTTGAAG AAATCCCAGC CGAGGAAGGC TATCCTGCAT ATCTTGGATC AAGACTTTCA
CAATTTTATG AAAGAGCGGG ATATTTCAAA AACTTAAACG GAAGCGAAGG TTCTGTTACT
CTTATCGGGG CCGTATCTCC TTCAGGAGGA GACTTCTCAG AGCCTGTAAC AGAAAATACT
AGAAGATGTG TAAATGTATT TTTAGGACTT GATAGAAAGC TTGCTTATTC AAGACACTAT
CCAGCTATAA ACTGGCTTTC TTCTTATTCT AATTATATTG AAAAAATCAG AGCCTACTAT
GAGGAAAGAC TGGGTAAGGA CATAATTGCT ATAAGAAATG AATTGATGAA TGTCCTTCTT
GAAGAAGACG AAGTTAGATC TATCATGATG CTAGTTGGTG AAGATGCACT TTCTAATAGT
CAAAAGAACA TATTAGATGT TTCTGGTCTT ATAAGAAATG GCTTCTTACA ACAAAATGCT
TACAATGATA TTGATAAATA TGTACCACTT GAAAAACAAG TCAAGATGCT CGACATTATT
TATAATTATT ATCAAAAGAG TAAAGCAGCT ATTAGTGGTG GTCTTAGCCA TAAGGCTATA
TATGATTCAA ATCTAATTAA CGAGATTACT CAAATGAAAT ATAATATAAA AAATGATGAG
TTAGACAAGT TTATTGATCT TAATTCAAAG ATAAACAATC ATTTTGTAAG TGTGAAGGAA
GGTTTATAG
 
Protein sequence
MNPKIKTING PVVIATDAKI LTVREMVSVG KLKLIGEVIS LEGDLATIQV YEDTSGLKVN 
EDIIPTGRPL SVRLGPGMLG NMFDGIQRPL KNIMDKNGSF IPSGIGFENL DTEKKWDVKL
TVKVGDKIKR GDIYATIKET ETIEHRLMTQ VSGEVVEVAE DGLYTLEDTV VKIKTEKNVV
EEKLYQYWPV RNQRPVMANM PIEKLFETGQ RVLDVFFPLA KGGTVAIPGG FGTGKTMLQH
QLAQYSDVDI IIYIGCGERG NEMTQVLEEF PDLIDPNTGK GLMERTILIA NTSNMPVAAR
EASIYTGITM AEYFRDMGYD VALMADSSSR WAEALREISG RLEEIPAEEG YPAYLGSRLS
QFYERAGYFK NLNGSEGSVT LIGAVSPSGG DFSEPVTENT RRCVNVFLGL DRKLAYSRHY
PAINWLSSYS NYIEKIRAYY EERLGKDIIA IRNELMNVLL EEDEVRSIMM LVGEDALSNS
QKNILDVSGL IRNGFLQQNA YNDIDKYVPL EKQVKMLDII YNYYQKSKAA ISGGLSHKAI
YDSNLINEIT QMKYNIKNDE LDKFIDLNSK INNHFVSVKE GL