Gene Paes_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0474 
Symbol 
ID6460716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp508018 
End bp510858 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content52% 
IMG OID642724473 
Productexcinuclease ABC, A subunit 
Protein accessionYP_002015177 
Protein GI194333317 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0183853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCAT TCAGTCATAT CATCATACGA GGGGCCAGGG TTCACAACCT GAAAAACATC 
TCCCTCGACA TTCCCCGCAA CAGGCTCGTG GTGATCACCG GCCTGTCGGG ATCCGGAAAA
TCCAGTCTGG CATTCGATAC CATCTATGCC GAAGGGCAGC GACGTTTCAT GGAAACCCTT
TCGGCCTATG CGCGTCAGTA CATCGGCAAT ATCGAACGTC CTGATGTGGA CAGCATTGAC
GGGCTCTCTC CGGTCATATC CATCGACCAG AAAAGCACCA GTCGATCACC CCGTTCGACG
GTCGGCACCA TTACTGAAAT CCATGATTTT ATCCGGCTGC TCTACGCCAA AGCAGGGCGG
CGCTACGATC CCGCAACCGG CCATATGTTG CAGAAACAAA GCGAGGAGTT CATTACTGAA
TCCATCCTCA AGCTGCCCGA AGGAACGAAA GTTCAGCTCC TGGCCCCTCT GGTTACCGGC
AGAAAAGGAC ACTATCGCGA GCTGTTCGAT ACCCTCCTCA AAAAGGGGTT TCTGCGCATC
CGCATCGATG GTGAATTCCG GGAAATGGAA AAAGGGATGC AAATCGAGCG CTACAAGAGC
CATACTATCG AACTCGTTGT TGACAGGCTG GTCATTACCT CCGGCGTCAA CGAACGTCTG
AGGGAGGCTG TCAAACTGGC TGTCGGGATG TCCGAACATA AATCCTCTGT CATCTGCGCT
CCGTTTGAAA GCGACGAAAA AGAACAATTT TTCAGCACCA AATACGCCTA TTCAGACGGT
TCTGTTCCGA TCGACACCCT TGCACCCAAT AATTTCAGTT TCAACTCTCC CTATGGCGCC
TGCCCTGAAT GCAACGGCCT CGGCGAGATT CGCCAGTTGT CGGCAGATCT GATGGTCCCC
GACAAGTCGC TCTCGATCAA TCAGGGAGCG ATCGAACCAT TCGGCAAACC GGGAAAACGC
AATCTCTGGC AGGTCATCAG AGCGATCGCA AAAACATTCA ACTTCGATCT CGACACGCCG
TTTTCAAAAA TCCCGAAAGC CGCTGCCGAT ATTCTGCTCA ATGGTTCCGG AAAAAAAACC
TTTGACGTCA CCTACAGCTA TGCAGGCAAG GAACATCTCT ATCCGCAGGA TTTCCCCGGA
GCCATCAGCT ATGTGCGCGA ACTCCAGAAA AACTCAGGAA CAGCAAAAAT AACTGAATGG
GCAGAAAGCT TCATGTTCCG CCAGCCATGT CCGGAGTGCG GCGGAGCCCG TTTGAGAAAA
GAAAGCCTGC AGGTTAAAAT CGGTGACAAA AACATCCATG AGCTCGAGTC CATTCCTCTG
CCTGAATCAC TTCAGCTCTT TCGGGAACTT CCAGCAAAAC TGACGGAGAA GGAAAACCTC
ATCGCGACCC CGATTCTGCA CGAAATCACG AAACGAATAG AGTTCCTGCT CAATGTCGGC
CTTGACTACC TGGCACTCAA CAGAAGTTCC CAGACCCTCT CCGGCGGCGA AGCACAGCGC
ATTCGCCTCG CATCACAGCT CGGATCGCAG CTCAGCGGGG TTCTTTACGT GCTTGACGAG
CCAAGCATCG GACTGCATCA GCGTGACAAC AACAAACTGA TCACATCGCT GCAGCACCTG
CGCGATATCG GCAACACCGT TCTCGTGGTT GAGCATGACA AAGACACCAT GCTTGCCGCT
GATGAGATTA TCGATCTCGG ACCGGGAGCC GGAGAACATG GCGGCGAGCT TGTCGCTCAG
GGACTGGCCG ATAAGCTGAA CCCTCATTCT TTAACCGCTC AATACCTGAA CGGCCAGCTC
GAGGTCTGCA CAGAAAAAGC GGACAGAACA GAAGAAAGTG AAAACCGTTT CATCACCCTC
CGCGGGTGCC GCGGCAACAA TCTGAAAGGG ATTGATGTCA GCTTTCCGCT CGCCAGACTG
ATCTGCGTCA CAGGAGTAAG CGGTTCAGGC AAATCAACCC TGATCAATGA AACACTCTAT
CCGGTTCTGG CAAGGCACTT CTACCGATCC AAGATTCAGA CCTATCCCTA CGAAAGTATT
GAAGGGCTGA AACTCCTCGA CAAGGTCGTC AACGTCGACC AGTCCCCGAT AGGCAGGACA
CCGAGGTCAA ACCCTGCGAC CTATACCGGA GCGTTTACCT TCATCCGGGA CTTTTTCGCC
CGCCTTCCTG AAGCCCAGAT TCGAGGCTAT AAACCAGGAC GGTTCAGTTT CAACGTCAAG
GGAGGACGCT GTGAAACCTG TCAAGGTGCC GGAACGAAAA AAATCGAGAT GAATTTTCTG
CCCGATGTCT ATGTCGAATG TGATACGTGC AAAGGCAAAC GCTACAACCG CGAAACACTG
CAGGTCCGCT ACAAGGGAAG TTCCATCGCT GACGTCCTTG ATATGACCGT CGAGGAAGCG
CTGGGATTCT TTGAAGATTT CCCGCGAATA CGCCGTATTC TTTCCACCAT GCAGAGTGTC
GGGCTCGGCT ATATAAAGCT CGGGCAGCAG TCGCCTCTCC TCTCGGGAGG TGAAGCGCAA
AGAATAAAAC TATCGGCTGA ACTAGCCAAA ATTCAGACTG GAAATACCCT CTATATCCTT
GATGAACCAA CTACCGGCCT TCATTTCCAG GACATCCAGC ACCTCCTTGA CGTTCTGCAG
AAACTTGTCG ACAAAGGCAA CACGGTTATC GTCATCGAAC ACAATCTCGA TATCATCAAA
AATGCTGACT GGATCATAGA CCTCGGCCCG GAAGGAGGCA ATAAAGGCGG GAACCTGGTC
GCTGAAGGAA CCCCGGAAGT CATTGCCGCT GCACAATGCT CTCACACAGG AAAATTTCTT
GCTGCGGAAC TGAAGCCTTG A
 
Protein sequence
MSAFSHIIIR GARVHNLKNI SLDIPRNRLV VITGLSGSGK SSLAFDTIYA EGQRRFMETL 
SAYARQYIGN IERPDVDSID GLSPVISIDQ KSTSRSPRST VGTITEIHDF IRLLYAKAGR
RYDPATGHML QKQSEEFITE SILKLPEGTK VQLLAPLVTG RKGHYRELFD TLLKKGFLRI
RIDGEFREME KGMQIERYKS HTIELVVDRL VITSGVNERL REAVKLAVGM SEHKSSVICA
PFESDEKEQF FSTKYAYSDG SVPIDTLAPN NFSFNSPYGA CPECNGLGEI RQLSADLMVP
DKSLSINQGA IEPFGKPGKR NLWQVIRAIA KTFNFDLDTP FSKIPKAAAD ILLNGSGKKT
FDVTYSYAGK EHLYPQDFPG AISYVRELQK NSGTAKITEW AESFMFRQPC PECGGARLRK
ESLQVKIGDK NIHELESIPL PESLQLFREL PAKLTEKENL IATPILHEIT KRIEFLLNVG
LDYLALNRSS QTLSGGEAQR IRLASQLGSQ LSGVLYVLDE PSIGLHQRDN NKLITSLQHL
RDIGNTVLVV EHDKDTMLAA DEIIDLGPGA GEHGGELVAQ GLADKLNPHS LTAQYLNGQL
EVCTEKADRT EESENRFITL RGCRGNNLKG IDVSFPLARL ICVTGVSGSG KSTLINETLY
PVLARHFYRS KIQTYPYESI EGLKLLDKVV NVDQSPIGRT PRSNPATYTG AFTFIRDFFA
RLPEAQIRGY KPGRFSFNVK GGRCETCQGA GTKKIEMNFL PDVYVECDTC KGKRYNRETL
QVRYKGSSIA DVLDMTVEEA LGFFEDFPRI RRILSTMQSV GLGYIKLGQQ SPLLSGGEAQ
RIKLSAELAK IQTGNTLYIL DEPTTGLHFQ DIQHLLDVLQ KLVDKGNTVI VIEHNLDIIK
NADWIIDLGP EGGNKGGNLV AEGTPEVIAA AQCSHTGKFL AAELKP