Gene Pnec_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_0422 
Symbol 
ID6183727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp375129 
End bp376199 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content46% 
IMG OID641671113 
Producthypothetical protein 
Protein accessionYP_001797312 
Protein GI171463199 
COG category[S] Function unknown 
COG ID[COG4394] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTGGG ATATTTTCTG TCAAATCGCA GATAACTATG GTGATGCTGG TGTTTGCTGG 
CGTCTAGCCC GAAACTTATC AAGCATTCAT GGACAAGAAG TGCGTATTTT TTGCGATGAT
CTGCCAACCC TCAATTTACT CGCTTCGGGT GTAGATCCAG CGATTAAGCA AAAAATAGAC
CTTCAGCCGT GGGAAGCAAG TTATGTCAAT GCAAGACACC CAGTACAAAC ACCCGACGTG
GTCATAGAAG CATTCGGATG CGAACTACCA GAGCGCTATC TTGCAGGCCT GTTTATAGCC
TCCATCAAAC CCATCATCAT TAATCTGGAA TATCTCAGCG CAGAATCCTG GATTACTAAG
TTCCACGGCA AAGCATCACC CCAGTCTCAT GGAATTCCGA AATATTTTTT CTTTCCAGGG
TTTCAAGATG AGGTAGGCGG CCTATTGCTT GACCCCATCC CCGCTGAGGG GCGCCTCACT
CATGAAGATA TTCCCAAAGA TCTTCAAGTA GCTTGGTCGA AGTTGCGACC TGGAGCAAAA
CGAACTAGTG TATTTTGCTA CCCAGGCGCA CCACTGAAAA AATGGCTAGA GGACCTAGGT
CGCCTTGATA TACAAGTAGA TGTTTTGCTT GCCCATGGTC ATGCGGAACA GCTTAATCTT
TATGGAGAGC AGCCAATCTC ATTGCCAACC AATTTACAGC TGATTTCAAT GCCTTTTGTT
TCTCAAGATG AATATGATTG GGTACTAACG CAATGTGACT TCAATATTGT GCGCGGGGAG
GATTCTTTTA TTCGAGCCCA GTTAGCAGGA AAACCATTTA TTTGGCATAT TTATCCGCAA
GAAGATCGCG CCCATGAAGT GAAATTAGCC GCCTTTCTGG ATCTTTATCT TGATGAGGCC
GATCAAGAGT TAAGGCTTGC CGCAATCTCA GCAATGACCT GGGCAATGCC TAGCGAATGG
TTTGGCAACC TAAGCGTCTG GAACAATCAC GCCGAGCACT GGCGTAGCCA TTTACTCAAA
AAACAAGGGG ATGGCGGCCT GCCAGCGCGT TTAACTCGCT TTGTCGCATA A
 
Protein sequence
MRWDIFCQIA DNYGDAGVCW RLARNLSSIH GQEVRIFCDD LPTLNLLASG VDPAIKQKID 
LQPWEASYVN ARHPVQTPDV VIEAFGCELP ERYLAGLFIA SIKPIIINLE YLSAESWITK
FHGKASPQSH GIPKYFFFPG FQDEVGGLLL DPIPAEGRLT HEDIPKDLQV AWSKLRPGAK
RTSVFCYPGA PLKKWLEDLG RLDIQVDVLL AHGHAEQLNL YGEQPISLPT NLQLISMPFV
SQDEYDWVLT QCDFNIVRGE DSFIRAQLAG KPFIWHIYPQ EDRAHEVKLA AFLDLYLDEA
DQELRLAAIS AMTWAMPSEW FGNLSVWNNH AEHWRSHLLK KQGDGGLPAR LTRFVA