Gene Apre_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1406 
Symbol 
ID8398216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1509713 
End bp1512520 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content43% 
IMG OID644995771 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003153150 
Protein GI257066894 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACAGAAA TTAAGATAAA AGGAGCAAGA ACCAACAATC TCAAAAACGT AGATATCAAT 
CTTCCAAGAG ATAAGATGAT TGTATTTACA GGCCTTTCAG GCTCTGGCAA GTCTACTCTT
GCCTTTGATA CCATCTATGC GGAAGGACAA AGGCGTTATG TCGAAAGCTT ATCTTCCTAT
GCTAGGCAGT TTTTGGGCAA TGTAGATAAG CCGGATGTGG ACTCCATTGA GGGACTTTCC
CCATCCATAT CAATCGACCA GAAGACAACT AACAGGAACC CAAGATCTAC AGTAGCAACG
GTTACAGAAA TCTACGATTA TTATAGGCTC CTTTACGCAA GAGTAGGAGA TGCCTATTGT
CCAGTCTGTG GTAGACCGAT AGAAGCCCAA AGTATAGACC AGATGGTCGA TAGGGTAAAG
GAACTTCCTG AAAGAACGAG AATCCAAATT CTTGCCCCAG TAATCAGAGG AAAAAAGGGG
GCTCACAAAA GAGCCCTAGA AAATATCCAG AAGGATGGAT ATGTAAGGGT CGTAATCGAT
GGAGAAAAGT ACGACTTGGC AGAAGATATC GACCTATCCA AGACCAAAAA GCACGATATT
TCTGTCGTTG TAGATAGGAT TGTGATCAAG GATGGAATCG ATGCGAGACT TACAGATTCC
ATAGAGACAG CCCTAGGACT TGCAGATGGC CTAGTTATCA TAGATGTGAT AGGAGAAGAT
GAGATACTCC TATCTTCTAA GCTCGCCTGT CCAGAAGGCC ACGTATCCCT CCCAGAGATT
ACACCAAATA TGTTCTCCTT CAACGCCCCA ATAGGGATGT GTCCCGACTG TAATGGACTT
GGTTTTCACC TTCAAGTCGA CAGGGACCTT GTTATACCAG ACTACGACCT ATCTATAAAC
GAGGGGGCTA TTGACCCTTA TGCAACTTCC ACCAAGGAGA GCTATTATTA TGAGATGATA
AGAGCTATAG CAGATCATTA CAAGTTCTCC CATGATGATC CAATAAAAAA GGCTCCTAAG
AAAATGATCG AAGATATACT CCATGGTACA AATTACGACC TAAGTTTTGT CTTCGATAGT
CATTTTTCTG GTAGGAAAAG ATACACCGGA CCCTTTGAAG GAGCAATCAA AAATATCTAC
GACCGCTATC AAAGGACAGG ATCTGATGCT CAAAAGAAAA AGTTTAGAGA ATATATGTCG
GAAGAAGAAT GTGACACCTG CCATGGAGAT AGACTTAAGC CAGAAGTCCT CGCTATCAAG
GTCGGAGGAG TAAATATTTC AGAGCTTACA AGACTTTCCG TAGAAAAGTC CGTAGACTTT
TTCGCTAAAC TCGAACTTTC TCCAATGAAG GCGAAGATTG CAGACCTTAT AGTAAAGGAA
ATTAAGGCGA GGCTCTCCTT CCTAAATGAC GTGGGACTTA CCTATTTGAC CTTAAATAGG
GCGGCAGCGA CCTTATCCGG TGGAGAAAGC CAGAGAATCA GACTAGCTAC TCAAATAGGA
TCTGGTCTTG TAGGAGTTTG CTATGTCCTT GATGAGCCTT CCATAGGACT TCACCAGAGA
GATAACGATA AGCTCATAGC AGCCCTAAGA AATCTCACAG ATATTGGAAA CACCCTAATC
ATAGTAGAAC ACGACGAAGA TACTATGAAG GAAGCTGACT ATATAGTTGA TATTGGACCA
AAGGCAGGAG TTCACGGTGG AGAGATTGTC GCCAAGGGAA GTCTTCGTGA TATAATGAAT
TCCAAAAAGT CAATCACAGG AGACTATCTA GCTGGAAGGA AAAAAATCCC AGTTCCTAAG
GAAAGAAGAA CAAGCGATGA ATATATAGAA ATAAAGGGGG CTGCGGTAAA CAACCTCAAA
AACATTGACG TCAAAATCCC CCTAGGAGTC CTTACAACTG TAACTGGAGT ATCAGGATCA
GGGAAGTCTT CCCTAGTAAA TGAAATTCTC TATAAGCAAG CTACAAAGAA GATTAACAAG
ACCAAGATAA GAGCTGGTAA GCACAAGGAA ATCCTAGGTC TTGATAAGAT TGATAAGGTA
ATAGCCATAG ACCAATCCCC AATTGGAAGA ACTCCAAGGT CAAACCCAGC TACCTACACC
AAGGTCTTCG ATGCCATAAG GGATGTCTTC GCCATGACCA ATGAGGCCAA GATGAAAGGC
TATGATAAGG GGAGATTTTC CTTTAACGTT AAGGGAGGAA GGTGCGAGGC CTGCAAGGGA
GATGGAACAA TCAAGGTGGA TATGATGTTC TTGCCAGATG TATACGTTCC ATGCGAAGTC
TGCCACGGCA AAAGATACAA CAGGGAAACC CTTGAAGTTA AATACAAGGG CAAAGATATA
TCAGACGTCC TAGATATGAC TGTAGAAGAA GGAATAGAAT TTTTCCAAAA CCATCCTAGC
ATAGTAAGAA AGCTCCAAAC TCTCTACGAT GTAGGCCTAG GCTATATTAA AATAGGCCAA
CCATCCACAG AGCTATCTGG AGGAGAGGCT CAAAGGGTGA AACTAGCAAC AGAACTTGCT
AAAGTCTCCA CAGGAAAAAC CCTCTACATC CTAGACGAAC CAACAACAGG CCTTCACATG
GCAGACGTCC ACAAGCTAAT AGAAGTTCTA AACAGACTAG TAGATCAGGA TAATACAGTA
GTTGTAATTG AGCACAACCT AGACGTAATA AAAGTCTCCG ACAACCTAAT AGACCTAGGT
CCAGAAGGAG GAGACGGGGG AGGTACTCTA GTAGCGAGCG GTACTCCAGA AGAGATAGCA
AAAAACAAAA AGTCCTATAC TGGGCAGTAT CTTAAAAAGA TATTATAG
 
Protein sequence
MTEIKIKGAR TNNLKNVDIN LPRDKMIVFT GLSGSGKSTL AFDTIYAEGQ RRYVESLSSY 
ARQFLGNVDK PDVDSIEGLS PSISIDQKTT NRNPRSTVAT VTEIYDYYRL LYARVGDAYC
PVCGRPIEAQ SIDQMVDRVK ELPERTRIQI LAPVIRGKKG AHKRALENIQ KDGYVRVVID
GEKYDLAEDI DLSKTKKHDI SVVVDRIVIK DGIDARLTDS IETALGLADG LVIIDVIGED
EILLSSKLAC PEGHVSLPEI TPNMFSFNAP IGMCPDCNGL GFHLQVDRDL VIPDYDLSIN
EGAIDPYATS TKESYYYEMI RAIADHYKFS HDDPIKKAPK KMIEDILHGT NYDLSFVFDS
HFSGRKRYTG PFEGAIKNIY DRYQRTGSDA QKKKFREYMS EEECDTCHGD RLKPEVLAIK
VGGVNISELT RLSVEKSVDF FAKLELSPMK AKIADLIVKE IKARLSFLND VGLTYLTLNR
AAATLSGGES QRIRLATQIG SGLVGVCYVL DEPSIGLHQR DNDKLIAALR NLTDIGNTLI
IVEHDEDTMK EADYIVDIGP KAGVHGGEIV AKGSLRDIMN SKKSITGDYL AGRKKIPVPK
ERRTSDEYIE IKGAAVNNLK NIDVKIPLGV LTTVTGVSGS GKSSLVNEIL YKQATKKINK
TKIRAGKHKE ILGLDKIDKV IAIDQSPIGR TPRSNPATYT KVFDAIRDVF AMTNEAKMKG
YDKGRFSFNV KGGRCEACKG DGTIKVDMMF LPDVYVPCEV CHGKRYNRET LEVKYKGKDI
SDVLDMTVEE GIEFFQNHPS IVRKLQTLYD VGLGYIKIGQ PSTELSGGEA QRVKLATELA
KVSTGKTLYI LDEPTTGLHM ADVHKLIEVL NRLVDQDNTV VVIEHNLDVI KVSDNLIDLG
PEGGDGGGTL VASGTPEEIA KNKKSYTGQY LKKIL