Gene Pnec_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1535 
Symbol 
ID6183649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp1338467 
End bp1339507 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content45% 
IMG OID641672062 
Productlipopolysaccharide heptosyltransferase II 
Protein accessionYP_001798234 
Protein GI171464121 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGTA TTCTGATCAT CGCCCCAAAC TGGATTGGGG ATGCAGTGAT GACACAGCCT 
TTGCTGGCGT CTCTTAAATC ACAATATCCA GAATCAACCA TTGATGTACT TGCGAGCACT
TGGGTTACAC CGATCTATCG TGCTTGTGCC GAAGTAAATG ATGTCCTTGA AGCAAAGTTT
GAGCATAAGC AATTGCAGTG GGGCTTACGC AAGCAGTTAG CCAAAGAGCT GGCCGCAAAA
AAATATCAAG TGTGTTTTGT ATTACCTAAT AGCTTGAAGT CAGCGCTGAT TCCTTGGCTT
GCCAATATTC CCTTTCGAGT TGGCTATCGT GGTGAGTTGC GATTTGGTCT AATTAACGTT
GCTTTAGACA ACCCAAGCAA GGTCAATCGC CCACCCATAG TCGAACACTA CCTTCAATTG
GGCCGGCTTT TAAATAACGA GCGGACTTCA CCAACCACCG CCAATCTCAC ACCCCAGTTA
AATGTATCCG CTGAAGCTAC TCACTCAGTA GAAACAAAAT TAACAAATAT TCATATTGAT
CAAGCGAACA TTTATGTAAT GTGTCCCGGG GCTGAATATG GCCCAACTAA GCGCTGGCTG
ACCAGTCATT TTGCGCAATT AACGGAAGGT TTGATCGCCA ATAATCCAAA TAATCAAATC
GTTCTTTTGG GCAGTAAAGG CGATTACACA CTTGGCAGTG AGATTCAAGC TCAAGCCAAG
CAGAACGATC ATATTCATAA CTGGTGTGGC GACACGTCTC TTGATGAAGC GATTGCTCTC
ATAGGAATGA GCAAAGCGGT CATTAGCAAT GATTCCGGCT TAATGCATAT TGCAGCCGCC
CTCAAAACTC CGCAAGTTGC TATTTTTGGA TCGAGTGATC CAGCCCATAC GCCACCCCTA
TCCGACAAAG CCAAAGTCAT TTGGCTCAAT TTACCATGCA GTCCGTGTCA CAAAAGAGAG
TGTCCACTAA AGCATTTAAA GTGCTTAAAT AACATCTTGC CTGCACAGGT ATTGTCCACA
CTGAGCACAT TGCAGCCCTA A
 
Protein sequence
MHGILIIAPN WIGDAVMTQP LLASLKSQYP ESTIDVLAST WVTPIYRACA EVNDVLEAKF 
EHKQLQWGLR KQLAKELAAK KYQVCFVLPN SLKSALIPWL ANIPFRVGYR GELRFGLINV
ALDNPSKVNR PPIVEHYLQL GRLLNNERTS PTTANLTPQL NVSAEATHSV ETKLTNIHID
QANIYVMCPG AEYGPTKRWL TSHFAQLTEG LIANNPNNQI VLLGSKGDYT LGSEIQAQAK
QNDHIHNWCG DTSLDEAIAL IGMSKAVISN DSGLMHIAAA LKTPQVAIFG SSDPAHTPPL
SDKAKVIWLN LPCSPCHKRE CPLKHLKCLN NILPAQVLST LSTLQP