Gene Pnec_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1610 
Symbol 
ID6183967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp1411834 
End bp1414731 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content48% 
IMG OID641672127 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001798298 
Protein GI171464185 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACG AAATCAAGAT CCGCGGTGCC CGCACCCACA ACCTCAAAAA CATCAATCTA 
GACATTCCTA GAGAGAAATT AGTCGTCCTT ACCGGCTTAT CTGGATCAGG CAAAAGCTCA
TTAGCGTTCG ACACTCTCTA TGCAGAGGGC CAGCGCCGCT ATGTTGAGTC TCTTTCAGCC
TATGCCCGTC AGTTTTTACA ACTTATGGAA AAACCAGACG TTGACACTAT TGAAGGGTTA
TCGCCAGCGA TTTCGATTGA ACAAAAAGCA ACGAGTCATA ACCCGCGATC AACTGTTGGT
ACCGTTACTG AAATTCATGA CTACTTACGT TTGTTGTTTG CACGCGCCGG TACGCCGCAT
TGCCCCGATC ACGACCTGCC GCTAGAAGCG CAAAGTGTTT CACAAATGGT CGATACCGTG
TTGTCGATGC CAGAAGATAC TAAGTTAATG GTTCTGGCTC CAGTAGTGAG TGAGCGCAAA
GGCGAATTCG TTGACCTCTT TCAAGACCTA CAGGCACAAG GCTTTGTGCG CTTTCGAGTG
CGCTCTGGTG GCGGCACAGC AAACGCCGCC AAGGCAGAAA TCTTTGAAGT GGATCAGCTG
CCCACTCTAA AGAAAAACGA TAAGCACTCT ATTGAAATGG TTGTTGATCG AATTAAGGTG
CGCCCAGATA TTCAGCAGCG TTTAGCGGAA TCTTTTGAAA CAGCCTTACG CTTGGCTGAT
GGCAAAGCCA TGATTGTGGA TATGGACACT GGCAAAGAAA TGATTTTCTC CAGTAAGTTC
GCTTGTCCTG TTTGCTCATA CTCATTACAA GAATTGGAGC CTCGCCTCTT CTCCTTCAAT
AACCCCATGG GCGCTTGCCC ATCATGCGAT GGCCTGGGAC ATCAGTCTTT CTTTGACCCA
AAGCGTATCG TTGCACATCC AGATTTATCA CTGGCATCAG GGGCAATTAA AGGCTGGGAT
CGTCGCAATC AGTTTTACTT CAAACTGCTT CAGACGCTTG CCAAGCACGG TGGCTTCGAT
GTTGAAAAGC CATTTGAAAC ACTATCCAAA AAACAGCAAG ATCTCATTCT CTTGGGTTCT
GGTGATGTCA CCATTCCTTT TGAATACATC AACGAGCGCG GCAAAAACAG TATTCGCGAG
CATGCTTTTG AAGGCATTGT TGCCAACTTT GAGAGACGTT ATCGTGAAAC TGATTCGATG
ACAGTTCGCG AAGAATTATC ACGCTATCAA AATGTTCAAA CTTGCCCGGG ATGTAATGGC
AGTCGTTTAC GAAAAGAAGC TCGCTTTGTC GAAGTGGGTG AGAAAAAACA GTCGCGTGCA
ATTTATGAAA TCAGTGCTTT GCCCCTGAAA GAAGCTAAAG AGTATTTTGA AACACTCGAA
CTCAAAGGTG CGAAAAGAGA AATCGCAGAC AAGATTGTTA AAGAGATCAG TGCACGCCTA
CGCTTCTTAA ACGACGTGGG CCTTGATTAC CTATCGCTCG AGCGTAGTGC TGACACCCTT
TCAGGCGGTG AAGCCCAACG AATTCATCTG GCAAGTCAGA TTGGCTCCGG CTTAACTGGC
GTGATGTATG TCCTGGATGA ACCGTCGATT GGTCTGCACC AACGCGATAA CGATCGTTTG
ATTGGAACCC TGAAACACTT ACGTGATTTA GGTAATAGCG TCTTGGTAGT TGAGCATGAT
GAAGACATGA TCCGCGCATC TGACTGGGTA ATTGATATTG GCCCTGGTGC CGGTGTCCAT
GGTGGCGAAG TGGTTGCTCA AGGTACGCCT ACAGAAATTG AAGCAAACCC GAACTCCTTA
ACTGGCGCCT ACCTTGCTGG CCGTGAAGTC ATTGCAGTTC CAGACAAACG CATCCCAGTA
AATGAGCGCT TCTTGGAGAT CATTGGTGCG CGTGGCAATA ACCTGCAATC TGTACACGCC
AAGATTCCAG TTGGACTTCT GACCTGCGTT ACCGGTGTAT CAGGCTCAGG TAAGTCGACC
TTAATTAACG ACACCCTGCA TCATGCAGTT GCACAACATC TGTATGGCTC GAATGCTGAG
CCTGCAGCAC ACGATGCAAT CAAGGGTCTG GAGCATTTCG ATAAGGTGAT TAGTGTTGAC
CAATCTCCGA TTGGTAGAAC TCCACGCTCT AACCCCGCAA CCTATACCGG TCTTTTCACT
CCGATTAGAG AACTCTTTGC TGGTGTTCCT GCATCACGCG AACGAGGCTA TGAAGCTGGC
CGCTTCTCTT TTAACGTTAA AGGCGGTCGC TGTGATTCTT GTGAAGGTGA TGGCGTTCTC
AAAGTAGAGA TGCATTTCTT GCCAGACGTG TATGTGCCTT GCGATGTTTG TCATGGCAAG
CGCTATAACC GCGAAACTTT AGATATTCGT TACAAGGGTA AAAATATTCA TGAAGTGCTA
TCGATGACCA TCGAACAAGC CCATGAATTC TTTGAAGCGA TTCCGGTCGT AAAGCGCAAA
CTCAAAACAC TCCTTGACGT TGGCTTGGGT TACGTAAAAC TGGGGCAAAG CGCAACCACC
CTTTCAGGCG GCGAAGCGCA ACGCGTCAAA CTCTCGCTGG AACTCTCAAA ACGCGACACT
GGCAGAACCT TATACATCCT TGATGAGCCA ACCACTGGCC TGCATTTCCA TGACATTCAG
TTGTTGCTAA CTGTCATTCA AACGCTCAAG AAACAAGGCA ATACGATTGT CATCATTGAG
CACAATCTCG ATGTCATTAA GACGGCTGAT TGGATTATTG ACTTGGGGCC TAAAGGTGGA
GCGGGCGGCG GACAGATCAT TGCCACTGGC ACACCAGAAG ATGTGGCCAA TAATGAAGTG
AGTTTTACAG GTCACTACTT AGCGCCCTTG CTAACTCGTA AGAGTCCTAC TCCGGCAGCC
AGCAAAAAGA AAAAGTAG
 
Protein sequence
MNNEIKIRGA RTHNLKNINL DIPREKLVVL TGLSGSGKSS LAFDTLYAEG QRRYVESLSA 
YARQFLQLME KPDVDTIEGL SPAISIEQKA TSHNPRSTVG TVTEIHDYLR LLFARAGTPH
CPDHDLPLEA QSVSQMVDTV LSMPEDTKLM VLAPVVSERK GEFVDLFQDL QAQGFVRFRV
RSGGGTANAA KAEIFEVDQL PTLKKNDKHS IEMVVDRIKV RPDIQQRLAE SFETALRLAD
GKAMIVDMDT GKEMIFSSKF ACPVCSYSLQ ELEPRLFSFN NPMGACPSCD GLGHQSFFDP
KRIVAHPDLS LASGAIKGWD RRNQFYFKLL QTLAKHGGFD VEKPFETLSK KQQDLILLGS
GDVTIPFEYI NERGKNSIRE HAFEGIVANF ERRYRETDSM TVREELSRYQ NVQTCPGCNG
SRLRKEARFV EVGEKKQSRA IYEISALPLK EAKEYFETLE LKGAKREIAD KIVKEISARL
RFLNDVGLDY LSLERSADTL SGGEAQRIHL ASQIGSGLTG VMYVLDEPSI GLHQRDNDRL
IGTLKHLRDL GNSVLVVEHD EDMIRASDWV IDIGPGAGVH GGEVVAQGTP TEIEANPNSL
TGAYLAGREV IAVPDKRIPV NERFLEIIGA RGNNLQSVHA KIPVGLLTCV TGVSGSGKST
LINDTLHHAV AQHLYGSNAE PAAHDAIKGL EHFDKVISVD QSPIGRTPRS NPATYTGLFT
PIRELFAGVP ASRERGYEAG RFSFNVKGGR CDSCEGDGVL KVEMHFLPDV YVPCDVCHGK
RYNRETLDIR YKGKNIHEVL SMTIEQAHEF FEAIPVVKRK LKTLLDVGLG YVKLGQSATT
LSGGEAQRVK LSLELSKRDT GRTLYILDEP TTGLHFHDIQ LLLTVIQTLK KQGNTIVIIE
HNLDVIKTAD WIIDLGPKGG AGGGQIIATG TPEDVANNEV SFTGHYLAPL LTRKSPTPAA
SKKKK