Gene Pnap_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3066 
Symbol 
ID4686385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3239890 
End bp3240933 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content59% 
IMG OID639836079 
Productputative sigma E regulatory protein, MucB/RseB 
Protein accessionYP_983286 
Protein GI121605957 
COG category[T] Signal transduction mechanisms 
COG ID[COG3026] Negative regulator of sigma E activity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGGAA GTTCTTTCAA AACCATGATT TTTAAGTGTT TTAGGCCTTT TGCGCTTGTC 
CTGTATGCGT GTACAGCTAT AAATTATGTA GCAGCACAAG TGCCGTCGAT TTCGATACCG
CCCGTCGCGG CTGCATCGGT TTCTGACCCG CGCAGCCTCA ATGACTGGTT GCTGCGCATG
CATCAGGCCT CCAGCCAACG CTCCTACGTG GGAACCTTCG TGGTGTCGGC GGGCGGCAAC
ATGTCCAGCG CCAAGATATG GCATGTCTGC GAAGGCAAAC AGCAGGTGGA GCGGGTCGAA
ACCCTGACCG GCGCGCAGCG TTCGATCTTC AGGCACAACA ACCAGGTGAT CACCTTCATG
CCGGAGCACA AGGTGGCGCG CAGTGAAAAA CGCGAATCCC TGGGCCTGTT CCCCGAAATG
TTCCAGTCGG CCGACAGCCG CATTGCCGAC TTCTACCAGT TCCGGCGGGA GGGCATCGAG
CGGGTGGCTG GCGTTGACGC CGACATCATC ACGCTCATGC CCAGGGACAG TCTGCGCTTT
GGCTACCGCG TCTGGAGCGA GCGCCAGAAC GGCCTGGTCG TGAAGCTCCA GACGCTTGAT
ACCGATGGCA AGGTGATTGA ACAGGCGGCG TTTTCAGAAC TGCAGCTCGA CGCGCCGGTC
AGCATGAACC AGCTCATCCA GATGATGGGC AAGGTGGAAG GCTACCGGCT TGAAAAACCA
GTGCTGGTCA AGACAACGGC AAGCGCTGAA GGCTGGGCCT TGAGGGCGCC CGTGGCCGGT
TTCACTGCCA TGAATTGCTA CAAGCGGCCC GCTTCTTCGG CCGGCGAAGG GCCGCTGCAG
TGGATTTTTT CAGATGGCCT GGCGTCGGTG TCCATTTTTG TGGAGCCGTT CGACCGGCAG
CGCCATGTCC GGGAATCCAG TCTTTCCCTG GGCGCGACCC AGACCCTGAC CCGGCAGCTT
GATGCGTACT GGATTACGCT GGTGGGTGAA GTCCCTGCCG CAACCTTGCA GCTTTTTGCC
AGCGGGCTGG AGCGAAAAAA ATAA
 
Protein sequence
MRGSSFKTMI FKCFRPFALV LYACTAINYV AAQVPSISIP PVAAASVSDP RSLNDWLLRM 
HQASSQRSYV GTFVVSAGGN MSSAKIWHVC EGKQQVERVE TLTGAQRSIF RHNNQVITFM
PEHKVARSEK RESLGLFPEM FQSADSRIAD FYQFRREGIE RVAGVDADII TLMPRDSLRF
GYRVWSERQN GLVVKLQTLD TDGKVIEQAA FSELQLDAPV SMNQLIQMMG KVEGYRLEKP
VLVKTTASAE GWALRAPVAG FTAMNCYKRP ASSAGEGPLQ WIFSDGLASV SIFVEPFDRQ
RHVRESSLSL GATQTLTRQL DAYWITLVGE VPAATLQLFA SGLERKK