Gene Pnec_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1200 
Symbol 
ID6184003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp1037451 
End bp1038416 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content49% 
IMG OID641671783 
Productporphobilinogen deaminase 
Protein accessionYP_001797959 
Protein GI171463846 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.580193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.533215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAA CCCTGAATTC TTCTCCCCAA TCCGCCCCCA AACGCCTAGT AATTGCCTCC 
CGTGAAAGTC GTTTGGCCAT GTGGCAGGCT GAGCACGTCC AAGATTGCCT TAAAAAGCTC
TATCCGGACT GTGATGTCCA AATCTTGGGG ATGACTACTC GCGGTGACCA AATATTGGAT
AGAGCCCTCT CAAAAGTGGG TGGTAAAGGC CTTTTTGTAA AAGAGCTTGA AACAGCCCTT
GAGGATGGTC GGGCTGATTT AGCAGTGCAT TCCTTAAAAG ATGTCCCCAT GGTGATGCCA
GAGGGGTTTG ATCTTGCCTG CGTCATGGCC AGAGAGGATG CAAGGGATGC GTTTGTTTCA
AATGATTACG CTAGCCTTGA GGATCTTCCG ATCGGAGCAA TTGTGGGTAC CTCTAGCTTG
CGACGGGAAT CGGTTTTGCG TGCCAAGTTT CCTCATCTCG TGATTCAGCC TTTACGCGGT
AATTTGGATA CCCGTATGGG TAAATTGGAT AAAGGTGAGT ACCAGGCGAT TATTTTGGCT
GCTGCTGGTT TAAAGCGCTT AGGTTTAGAG TCACGCATAC GAGCATTCTT GCCATACGAT
CCTTATACGC CAGCTGCAGG GCAGGGCGCC CTAGGAATCG AAACCTTGAG TAAACATCCC
AATATTAAGC AATGGCTCGC GCCATTAAAT GATTTGCCTA CATTGTTCGC TGTTTCAGCT
GAACGCATGG TGTCACGTCA GCTAGGAGGG TCTTGTGAAG TGCCGCTCGC TGCACACGCT
GTACGGGATC AAAATCAAAT GCAGATTCGC TCTTTTGTTG CGAGCACTGA TGGCAAAGCA
ATTTGCTTGG CTCATGGCAG CGCATTAGTT GAGTCGGTCG AAGATGCAGA AGCATTGGGT
CTTGCGGTCG CGCAAGATTT GCTCTCACAG GGCGCGGCAG ATTTAATTCC TGCACTACCA
AAATAA
 
Protein sequence
MSQTLNSSPQ SAPKRLVIAS RESRLAMWQA EHVQDCLKKL YPDCDVQILG MTTRGDQILD 
RALSKVGGKG LFVKELETAL EDGRADLAVH SLKDVPMVMP EGFDLACVMA REDARDAFVS
NDYASLEDLP IGAIVGTSSL RRESVLRAKF PHLVIQPLRG NLDTRMGKLD KGEYQAIILA
AAGLKRLGLE SRIRAFLPYD PYTPAAGQGA LGIETLSKHP NIKQWLAPLN DLPTLFAVSA
ERMVSRQLGG SCEVPLAAHA VRDQNQMQIR SFVASTDGKA ICLAHGSALV ESVEDAEALG
LAVAQDLLSQ GAADLIPALP K