Gene Pnuc_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_1006 
Symbol 
ID5053038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp1003303 
End bp1004454 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content42% 
IMG OID640471171 
ProductHPP family protein+B94 
Protein accessionYP_001155786 
Protein GI145589189 
COG category[T] Signal transduction mechanisms 
COG ID[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCGC TTTTGAGACA TTTTTTTAAA AAGTTAATTA CCATTGTTCT TCCAGAAACT 
ACCTCAGTTT CTTGGTTGGA GCGGTTTAGA TCGGTTTTAG GAGTTGTGCT GGGCTTGCTC
GCAATGATGG GACTTGGTTT ATTGCTTGCA GGTTACGGAA ACGCATCACC ATGGGTTGTC
GCCTCGATGG GTGCCAGTGC ATTTTTATTA TTTGTCTTAC CCTCAAGTCC AATGGCGCAA
CCATGGGCAG TTATAGGGGG TAGTTGTGTT TCTGCTTTGG TAGGTGTTGC GTGTACGCAA
TTTGTCCATG AATTACCATT ATTAATTCCC TTATCTGTTG GACTCGCAAT CTTGGCAATG
TTTGCCTTGC GTTGCTTACA CGCACCTGCT GCAGCTTTAG CCCTTTTAAT TCCTTTGGGT
GGAATGACAG ATTTTCATTT TGTGCTTTTT CCGGTTATGG GTAATGCTGT TTTATTGGTT
TTATGTGCAG TCATTTATAA CTCCTTAACT GGCAAGCCAT ATCCGCAACG CCCTAAGGCC
GTCTTGGATT CATCTCCACT GCAGAAACGA AATCGTAAGA TCGAGGATCA GGAAATTAAC
GCTGTCTTGG AGAGCTATAA CCAAGTACTA GATATCAGTA AGGATGATTT AGCCAATTTG
ATTTCTCAAG TTGAACATGG GGCTTATCAA AAGAAGTTGC AAAGTATGCT ATGTAAAAAT
ATTATGACCA CTGAAGTCTT ATACGTTGGG ATGGATTCAC CTTTGGATCA AGCATGGAAT
TTACTGCGCA AGCGTCATGT TAAAGCTTTA CCAGTCATAG ATGGTGCAAA AAGGGTGCTT
GGAATTATTA CTCTAGAGGA TTTTCTTAAA AGTGCAGCAG TGGATTTCCA CCAAACTTTT
GGACAGCGCA TCAGGGGTTT TATGCGGACA GCTGTTCCCG GTTTGAATAG TTTGCCTAAC
GCTGTTGGGC AGGTGATGAG TAAGCCGGTA AGGGTAATTA GTGAAGATCG AAATATGCTT
GACTTAGCTG AGATATTTTG TGGTGATGGT CATCACCATA TCCCCGTAAT TAATGATAAT
AGGCAGTTGG TAGGAATGAT CACCCAATCT GACTTTGTAA AAGCTATTGA CCAGTCTATT
GATATTAGAT GA
 
Protein sequence
MRSLLRHFFK KLITIVLPET TSVSWLERFR SVLGVVLGLL AMMGLGLLLA GYGNASPWVV 
ASMGASAFLL FVLPSSPMAQ PWAVIGGSCV SALVGVACTQ FVHELPLLIP LSVGLAILAM
FALRCLHAPA AALALLIPLG GMTDFHFVLF PVMGNAVLLV LCAVIYNSLT GKPYPQRPKA
VLDSSPLQKR NRKIEDQEIN AVLESYNQVL DISKDDLANL ISQVEHGAYQ KKLQSMLCKN
IMTTEVLYVG MDSPLDQAWN LLRKRHVKAL PVIDGAKRVL GIITLEDFLK SAAVDFHQTF
GQRIRGFMRT AVPGLNSLPN AVGQVMSKPV RVISEDRNML DLAEIFCGDG HHHIPVINDN
RQLVGMITQS DFVKAIDQSI DIR