Gene Pnuc_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_2073 
Symbol 
ID5052583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp2140109 
End bp2142313 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content46% 
IMG OID640472248 
Producthypothetical protein 
Protein accessionYP_001156848 
Protein GI145590251 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03549] conserved hypothetical protein TIGR03549 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATTA AAGTCAACTT TCTCGATAAG CTACGCCTTG AAGCGAAGTT TGATGATTTC 
ACAGTAATTG CTGATCAACC CATTCGATAC AAAGGAGATG GTTCGGCGCC AGGCCCCTTT
GATTATTTTT TGGCTTCATC GGCCTTGTGC GCAGCGTATT TTGTAAAGCT CTATTGCGAG
ACTCGCAACA TTTCAACTGA CAATATTCGC CTATCGCAGA ATAATATTGT TGATCCAGAC
AATCGCTATC AGCAAATTTT TAAGATTCAG GTTGAATTGC CAGCTGATAT CTCTGCCAAT
GATCGTCAGG GAATTTTGCG CTCCATCGAG CGCTGTACAG TTAAAAAAGT AGTGCAAGCT
GGGCCAGAGT TTGTCATCGA AGTAGTGGAG AACCTGGACT CCGACGCACA GAGCCTATTA
GCCTTGAAGC CCGCTGCTGA TGCCAGTACT TTTATAGCAG GCAAAGATCT ACCCTTGGAA
CAAACGATTG CCAATATGTC GGGCGTTTTG GGCAATCTTG GGATTAAGAT CGAGATTGCT
TCGTGGCGAA ATATTATCCC CAATGTATGG TCACTACACA TTCGTGATGC GCATTCACCA
ATGTGTTTTA CGAATGGCAA AGGCTCAACG AAAGAAAGCG CTTTAGCATC GGCCTTAGGT
GAATATATAG AGCGCCTCAG TAACAACCAT TTTTATGCGG GCACTTTTTG GGGTGAAGAC
ATTGGCAATG CAGAATTTGT GCATTACCCA AGTGAGCGCT GGTTTCAGCC TGGCCCGAAT
GACACACTAC CAACAGAAAT CCTAGACGAC TACTGTCGAA CAATTTATGA CCCCGACGGA
GAGCTGCGCG CAATACATCT GATTGATACC AATTCTGGCA ATGTAGACCG TGGCATCTGC
TCATTGCCAT ACATTCGTCA GTCTGATGGG AGGGTAGTTT ATTTTCCTTC CAACTTAATT
GAGAACCTCT TTGTCAGTAA TGGCATGAGT GCTGGTAATA CACTGGCTGA GGCGCAAGTG
CAATGCTTGT CAGAGATTTT CGAACGCGCC GTCAAGCGTG AAATTCTGGA AGGCGAAATT
GCTTTGCCAG ATGTGCCGCA GGAGGTAATA GCTAAGTACC CAAGGATTCT TGCCGGCATT
CAGGGCCTAG AGGAACAGGG CTTCCCAGTA TTGGTAAAAG ATGCATCGCT GGGCGGCATT
TATCCGGTAA TGTGCGTCAC TTTAATGAAT CCTCGAACAG GTGGCGTGTT TGCCTCATTC
GGCGCTCATC CAAGCTTAGA GGTTGCACTA GAGCGGAGCT TGACTGAGCT ACTACAAGGG
CGAAGTTTAG AGGGCTTAAA CGACTTACCA CCACCTACTT TTGCAAGCGA AGCGGTAACC
GAGCCAAATA ACTTTGTAGA GCATTTCATT GATTCGAGTG GGATTGTTTC ATGGCGCTTT
TTCAGCGCAA AACCAGATTA TGAGTTTGTT CAGTGGGACT TCTCTAGCCA TGGTGAAAAC
TCGAATGCCG ATGAAGCTGC AACATTGTTT GGCATTCTTA AAGCCATGGG CAAAGAATCT
TATGTGGCCG TGTATGACGA GCTTGGTGCA ATTGCCTGTC GCATCTTAGT GCCTGGTTAT
TCTGAGGTGT ATCCAGTAGA GGATCTGATT TGGGATAACA CCAATAAAGC GCTATTGTTC
CGCAACGATA TTTTGAATTT ATCTCGCTTG GATAATGTCG GTCTTGAAGG ATTGCTTGAG
CGCTTAGAAA ACAATGAGCT TGATGAGTAT GGTGACATTG CTACGTTAAT CGGTGTTGAG
TTTGATGAGA ATACGGTTTG GGGTCAGCTA ACTGTTCTTG AGCTCAAGCT ACTCATCCAT
CTTGCTTTGA GGCAACTAGA TGAAGCACAT GAATTGGTCG GGGCTTTTCT TCAATACAAC
GACAACACTA TCGATCGAAA GCTTTTTTAC CAAGCCTTAG ATGCAGTGCT TGAGGTAAAG
TTAAATGAGG ATTTAAAGCT TGAGGACTAT ATTGCCAACT TCCGCCGCAT GTTTGGTAAT
CCTAGAATGG ATGCTGTAGT AGGCTCCGTG GAAGGCGGCA TCCGGTTCTT TGGTTTAACA
CCAACCAGTA CAAAGCTGGA AGGCCTCGAT AGACATCACA AGATGATCGA TAGTTATAGA
AAATTGCATG CGGCACGAGC CAAAGCTTCT GCCAGTGGGC AATGA
 
Protein sequence
MEIKVNFLDK LRLEAKFDDF TVIADQPIRY KGDGSAPGPF DYFLASSALC AAYFVKLYCE 
TRNISTDNIR LSQNNIVDPD NRYQQIFKIQ VELPADISAN DRQGILRSIE RCTVKKVVQA
GPEFVIEVVE NLDSDAQSLL ALKPAADAST FIAGKDLPLE QTIANMSGVL GNLGIKIEIA
SWRNIIPNVW SLHIRDAHSP MCFTNGKGST KESALASALG EYIERLSNNH FYAGTFWGED
IGNAEFVHYP SERWFQPGPN DTLPTEILDD YCRTIYDPDG ELRAIHLIDT NSGNVDRGIC
SLPYIRQSDG RVVYFPSNLI ENLFVSNGMS AGNTLAEAQV QCLSEIFERA VKREILEGEI
ALPDVPQEVI AKYPRILAGI QGLEEQGFPV LVKDASLGGI YPVMCVTLMN PRTGGVFASF
GAHPSLEVAL ERSLTELLQG RSLEGLNDLP PPTFASEAVT EPNNFVEHFI DSSGIVSWRF
FSAKPDYEFV QWDFSSHGEN SNADEAATLF GILKAMGKES YVAVYDELGA IACRILVPGY
SEVYPVEDLI WDNTNKALLF RNDILNLSRL DNVGLEGLLE RLENNELDEY GDIATLIGVE
FDENTVWGQL TVLELKLLIH LALRQLDEAH ELVGAFLQYN DNTIDRKLFY QALDAVLEVK
LNEDLKLEDY IANFRRMFGN PRMDAVVGSV EGGIRFFGLT PTSTKLEGLD RHHKMIDSYR
KLHAARAKAS ASGQ