Gene Pnec_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1111 
Symbol 
ID6183530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp971200 
End bp972921 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content46% 
IMG OID641671721 
Productglycosyl transferase, family 39 
Protein accessionYP_001797898 
Protein GI171463785 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.663669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCCT CATTTTCTTA CCGTTTTGCC ATATTAATAC TGCTTTTAGG CGTTTTTACT 
TACCTTTACG GCATTGATAG CCGTTTTGCC CCTAAAAACG GTGATGAATA TCCTTATATG
CACATTATGC GGATGACGGC TGATGCAGGA CATTGGCTGC CACTGCAGTC TGAAATGGCG
GGCATTAAAA ATACCAAGCC CCCGCTTATT TTTTGGCAGG GTATTGCAAG TACTTATTGG
GCGTCAGATT GGACTTTGGC CAATTTACGC TGGCCTAGTG TTTTATATAC TGGCTTGACA
GCATTATTTT TATTTTTAGC AGTGCGGCGC TTTAGCGGCA AGACACAAAC GGGATTCCTG
GCTGCATTGG TTTGGTTATC CTTTTTTGCA ACATACCGTT ATGGTCGCCC ATTCCTGGCG
GATCCTCCCG AGGTGTTCTG GATTTCACTG CCATTTTTTG CAACCCTCTA TTGGGGCAAA
AGTGCGTTTG AATCGAAACT CTTATTCCCT CTGATGGCAG GCATGTGTTT TGGTTTTGCG
CTGTTTGCAA AATCCTTTGC ATATATTGTG CCCGCATCGT TTGCTTTGGG TCTGTATTAC
TGGCGCTGGC GCCAGTGGAG TATTGCGCAG GTCGTGATCC GAGACCTTTA TAAGTTAATT
CTAATTGCAG CGTTTGCTTT GGGTGTGTTC GCCTTATGGT TTGTTATGGA TCCGAATCCA
GAGGCGGTAT GGAGTGAGTT TGTTCTGGGC GAGAATGCTG GCAAGTTTGC GGCACGCCAA
TCTAGCTACC TTATGGACTT GTTAAGAGGT GGCGATAGCA TTTGGCTTTT AATTATTGCC
ACGATTGCAA ATGCTGGCTT ATTTAGCTTT GTGCTGATTT CAGCTTTAGC TCGGTGTTGG
CGAGCGCGGC GCTTTCTCAC TCTGGAAGAA GTGCTCTTAT TGCTACTGGT GGCAGCTTTC
TTCATTGTGT TTAGTTTGCC AAGCCAGCGT TCAGGGCGCT ATCTTTTGCC AGTCATGCCC
GTATTTGCTA CATTGATTGC TCTGTATTGG GATAAGTTGC CTTTGTGGGG ATTTAGGATT
GCCTTGTTCT TGCAGTTATT GGTCCTATCG CTGCTCGGTT GGATTGGCAT CAATCTTCAG
TTTTCACAGT TTTTGGGTAA TGCTAGTCAG TGGACTTACT CCTATTGCCA CTGGATCATG
ATGTCAGTGA GTGTGTTCGT GGTGTTGGTT GGTTTGTTTA AGCGTAGCCA AACAAAAGCG
CTGGCATTGG CAGCATGCTT TCTGGTCTAT TGCGCGCTGA CCAGCAGTCT TGCGCCTCTG
GAAGGTCGCT TAGGACGATA CTCTATTGAG TCAATCAACC AGTTGCAAGG CAAAGACGTT
TGGATTCCCT GTGACTATCG GGCCAAAGAC GAGGAGTACC GTTTGTTAAT ACCAGGAGCA
AAGCTGCACG GGTATTTGGC AAAAGATGCT GGGGATATAA ATGGCCTCAC TGCCAGTTAC
CCTTTGGTGG CAGTGCAGTC GTCCCTTGGT GTAGTGCCAG TCATTTGCGA GTCTTGTCAG
ATTGTCGGGC AAAGAATGGA AATGCGGGCC CGTCACCCTA ATGAAGAAAT TGTAGAGATG
TTGGCTGGCC AAATTGGCAA ACATTTATTT GTTTATGAGT ATTTAATTGC AACCCCAGCA
GTTATGCCTG ATTTATCTAG CCAAAAGGAT GTCTGTAGAT GA
 
Protein sequence
MRASFSYRFA ILILLLGVFT YLYGIDSRFA PKNGDEYPYM HIMRMTADAG HWLPLQSEMA 
GIKNTKPPLI FWQGIASTYW ASDWTLANLR WPSVLYTGLT ALFLFLAVRR FSGKTQTGFL
AALVWLSFFA TYRYGRPFLA DPPEVFWISL PFFATLYWGK SAFESKLLFP LMAGMCFGFA
LFAKSFAYIV PASFALGLYY WRWRQWSIAQ VVIRDLYKLI LIAAFALGVF ALWFVMDPNP
EAVWSEFVLG ENAGKFAARQ SSYLMDLLRG GDSIWLLIIA TIANAGLFSF VLISALARCW
RARRFLTLEE VLLLLLVAAF FIVFSLPSQR SGRYLLPVMP VFATLIALYW DKLPLWGFRI
ALFLQLLVLS LLGWIGINLQ FSQFLGNASQ WTYSYCHWIM MSVSVFVVLV GLFKRSQTKA
LALAACFLVY CALTSSLAPL EGRLGRYSIE SINQLQGKDV WIPCDYRAKD EEYRLLIPGA
KLHGYLAKDA GDINGLTASY PLVAVQSSLG VVPVICESCQ IVGQRMEMRA RHPNEEIVEM
LAGQIGKHLF VYEYLIATPA VMPDLSSQKD VCR