Gene Pnec_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1110 
Symbol 
ID6184018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp969962 
End bp971203 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content47% 
IMG OID641671720 
ProductBNR/Asp-box repeat protein 
Protein accessionYP_001797897 
Protein GI171463784 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4692] Predicted neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTA TTGTTGCCCT GTGTTTTTTA TTGCTCGCCG CAGTGATCGG GTTTCTGCAT 
ATTGATAGTT GTCCGAGTTG GGCACCTTTT GCTTTATCCT CTGCACTCCA AGCAGAGGAT
CAGGGTGATG GGTTGGTAGA AATCAAGCCT AAAGCTTTGT CCAAGGCAAT TATTCCAGCA
TCACAAACCA ATTGGCTCCC CGATACAGGT GCTGCGTCAG TTCATGCGGC CTCTATGATT
GCTTTAAAAG ATGGCGCAGT TCGGGTGTTT TGGTTTGCAG GCAGTCGCGA GGGCGCTGCT
GATGTGGCGA TCTATAACTC TGTATACGAC CCCCATTCAA CAAATTGGAG TGCACCTACC
GTTGTAATGG ACCGCGTAAG CGCTGAGAAG GGTTTGTTGC GTTACATTGC CAAATTAGGC
AATCCTGTAC CCACTAGATT GGTTGATGGA AGGTTGCAAT TATTTTTCGT AACGGTATCG
ATTGGCGGAT GGGCGGGTAG CTCGATTTCT GCAATCACTT CGGATGATGA GGGTTTAACT
TGGAGTAGTC CTCAGCGTTT GATCAGTTCG CCTTTATTGA ATCTGAGTAC TCTGGTGAAG
TCACCTGGCG TTATGTTTGT TGATGGATTA ATGGGTATGC CCGCCTATCA TGAGTGGGTA
GGGCGCTTTG GTGAATTCTT GAGGGTAGAT GCGGGCCGAG TCATTGATAA ACGACGTATG
AGCTCAGGGC GCGGCGCAAT TCAGCCGTTA GTTTTTGTCA ATGATGCCCA AGACGCTAGT
GCTTTTTTTC GGCAAACGCG CAGTGCAGGT TTGCCAAAAC AAATTCCAGT TAGCTACACC
CAAAATGCAG GTCAAAACTG GCATCAGTCT GAAGATTTAG CAATTGCCAA CCCAAATTCT
GCTGTAGCAG GCGTGATTCT TAAGAGTGGC ACCCGCATAT TGGTTTTAAA TGATATTGAG
TATGGTCGTC ATCGCCTGGT TTTAATGATG AGCAGTCCTA AAAATGGACA ATGGCAAACC
GTGGAGGTAT TAGAGGATGA TGAAGCTCTG CCTGATATCC AGCGTAAAGA ATTTTCCTAT
CCGTACTTGA TTACCGTTGA TGGTGAGGAT GCGCATTTGG TATATACCTG GGATCGAAAA
AAGATTCGTC ATCGCTATTT TTCAAGCGCT TGGTTAAAGC ACGCATTTAG TAAGGTACAG
ATACAAGCAG CGGATGTACC AAGTCAGGAG GCTCAGCAAT GA
 
Protein sequence
MSRIVALCFL LLAAVIGFLH IDSCPSWAPF ALSSALQAED QGDGLVEIKP KALSKAIIPA 
SQTNWLPDTG AASVHAASMI ALKDGAVRVF WFAGSREGAA DVAIYNSVYD PHSTNWSAPT
VVMDRVSAEK GLLRYIAKLG NPVPTRLVDG RLQLFFVTVS IGGWAGSSIS AITSDDEGLT
WSSPQRLISS PLLNLSTLVK SPGVMFVDGL MGMPAYHEWV GRFGEFLRVD AGRVIDKRRM
SSGRGAIQPL VFVNDAQDAS AFFRQTRSAG LPKQIPVSYT QNAGQNWHQS EDLAIANPNS
AVAGVILKSG TRILVLNDIE YGRHRLVLMM SSPKNGQWQT VEVLEDDEAL PDIQRKEFSY
PYLITVDGED AHLVYTWDRK KIRHRYFSSA WLKHAFSKVQ IQAADVPSQE AQQ