Gene Pnap_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_2012 
Symbol 
ID4689359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp2138112 
End bp2139632 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content60% 
IMG OID639835020 
ProductNusA antitermination factor 
Protein accessionYP_982242 
Protein GI121604913 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000880424 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.51664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGCG AACTGTTGAT GTTGGTTGAT GCCATCTCGC GTGAAAAAAA CGTTGAACGC 
GACGTCGTCT TTGGCGCCGT CGAGCTTGCG CTCGCCTCTG CCACCAAGAA AGTGTATGCC
GATGGCGTGG ACATCCGTGT TGCGGTTGAC CGTGACAGCG GAAATTACGA AACCTTCCGC
CGCTGGCTGG TGGTTGCTGA CGAGGCTGGT CTGCAAAATC CCGAAGCCGA AGAGCTGGTG
ACCGATGCGC GCGACGAAAT CCCCGACATT GAAGAGGGCG ACTACATCGA AAAGCCGGTC
GAAAGCCTGC CGATTGGCCG CATTGGCGCG CAGGCGGCCA AGCAGGTCAT CCTGCAAAAA
ATCCGCGACG CCGAGCGCGA AATGCTGCTC AACGACTTCA TGTCGCGCGG TGACAAGATT
TTTGTCGGCA CCGTCAAGCG CATGGACAAG GGCGACCTGA TCGTCGAATC CGGCCGCGTC
GAGGGCCGTC TGCGCCGCAG CGACATGATT CCGAAAGAAA ACCTGCGCAC TGGCGACCGT
GTCCGCGCCA TGATCATGGA AGTCGATACC ACGCTGCGTG GCGCTCCCAT CATCCTGTCA
CGCACTTCGC CCGAGTACAT GATCGAGCTG TTCCGCCAGG AAGTCCCTGA AATCGAGCAG
GGCCTGCTTG AAATCAAGAC CTGCGCGCGC GACCCCGGCT CACGCGCCAA GATCGCCGTG
CTGTCGCATG ACAAGCGTGT CGATCCGATT GGCACCTGCG TCGGCGTTCG CGGCACCCGC
GTCAATGGCG TGACCAACGA GTTGGCTGGC GAACGCGTCG ATATCGTGCT GTGGAGCGAA
GACCCGGCCC AGTTCGTGAT CGGTGCGCTG GCGCCCGCCA ATGTGTCGTC CATCGTGGTC
GATGAAGAGC GTCACGCGAT GGACGTGGTG GTGGATGAGG AAAACCTCGC CATCGCCATT
GGCCGTGGCG GCCAGAACGT GCGCCTGGCG TCCGAGCTGA CCGGCTGGAA GATCAACATC
ATGGATGCCA ACGAGTCCGC CCAGAAGCAG GCCACCGAAA CCGACAGCAG CCGCAAGCTG
TTCATGGCCA AGCTCGATGT GGACCAGGAA ATCGCCGACA TCCTGATTGC CGAGGGCTTT
ACCAGCCTGG AAGAAGTGGC CTATGTGCCG CTGCAGGAAA TGCTCGAAAT CGAATCTTTC
GATGAAGATA CCGTCAACGA GCTGCGCACA CGCGCCAAAG ACGCTCTTTT GACCATGGAA
ATCGCCCAGG AAGAAAATGT CGGCGGTGTT TCGCAGAATC TGCGCGACGT TGAAGGCTTG
ACGCCCGAGT TGATTGCCAA ATTGACCGAA GCGGGTGTTG CCACCCGCGA CGACCTGGCC
GATCTGGCCG TGGATGAGCT TACCGATATA ACCGGCCAGT CTGCGGACGA GGCCAAAGCC
CTGATCATGA CTGCACGCGC CCATTGGTTT ACCGATGGCG CTGGCGACGC TGCTGCACCC
GCAGCAGCCC AAGAGCAGTG A
 
Protein sequence
MNRELLMLVD AISREKNVER DVVFGAVELA LASATKKVYA DGVDIRVAVD RDSGNYETFR 
RWLVVADEAG LQNPEAEELV TDARDEIPDI EEGDYIEKPV ESLPIGRIGA QAAKQVILQK
IRDAEREMLL NDFMSRGDKI FVGTVKRMDK GDLIVESGRV EGRLRRSDMI PKENLRTGDR
VRAMIMEVDT TLRGAPIILS RTSPEYMIEL FRQEVPEIEQ GLLEIKTCAR DPGSRAKIAV
LSHDKRVDPI GTCVGVRGTR VNGVTNELAG ERVDIVLWSE DPAQFVIGAL APANVSSIVV
DEERHAMDVV VDEENLAIAI GRGGQNVRLA SELTGWKINI MDANESAQKQ ATETDSSRKL
FMAKLDVDQE IADILIAEGF TSLEEVAYVP LQEMLEIESF DEDTVNELRT RAKDALLTME
IAQEENVGGV SQNLRDVEGL TPELIAKLTE AGVATRDDLA DLAVDELTDI TGQSADEAKA
LIMTARAHWF TDGAGDAAAP AAAQEQ