Gene Pnap_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4042 
Symbol 
ID4686148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4308570 
End bp4309562 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content60% 
IMG OID639837055 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_984254 
Protein GI121606925 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.941259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAA TTTGTTCAAC ATTCATCGCC GTGCTGGCCC TCGCCACTTG CGCAGTGATG 
GCACATGCCG AGCCGATCAC CCTGACGCTG GTCCATGCCG CCTCGACCAC GCACCCGGAA
CACCTGGCGG CGCTGCAGTT TGCCAGGCGC GTCGAAGAGC GCACCCACGG GCAAATCAAG
ACCAGGATAT TCCCGGCGGC CCAGCTCGGC AGCGAAACCG AAATGATCCA GAAAGTCCGA
CTCGGAGCCA TCGACATGGA CTTGGCCTCT CACCACTACC TCATCAATTA TGAGAAGGCC
TTCGCGGTTG TGATCATGCC TTACGTATTC GACAACTATG AGCATGCGCA CCGGGTGCTC
GACGGCCCGG CGATGGCCTG GCTCGCGCCG CTGGCCGAAA AGCAGGGCTT CGTGATCTTG
TCCAACTGGG AATGGGGCTT TCGCAACCTG AGCAACAACC AGCGTCCAAT CAACCAGCCC
GAGGATGTGC GCGGCCTGAA AATACGCGTG CCGCCTGGGG TTGGAATGGA GGCCAGCATG
GAAGCGCTGG GCGCGCAGAT CAGCAAGATC AGCTTCAAGG ATCTCTACGC AGCGTTGTCG
CAAGGACGGG TCGATGGCCA GGAAAACCCG CTCAGTGTCT TTTATCACCA CAAGCTGTAC
GAGTCTCAGA AGCACCTCGC GCTGACGCGG CATGTTTACT ACAACATGGT CCACTTCATC
AGCGTCAAAA GCTGGACCCG GCTCACGCCG GCGCAGCAAA CCATCGTGCG CGAAGAAAGC
AAGGCGGCGG GCGACGGCAT GCGCAAAAAA ATCATGGCCG AAGAGGACGA GCTGATCGCC
AGGCTGGCCG CTGCCGGGGT GAAGGTCACA CGCCCCGACC CCCAGCCGTT CCGCGCCATG
ATGGAGCCCG CCCACAAAAA GATCAAGCTC CTTGCCGGCG AAGAGAACGC ACGCAAGTTC
CTGAACATGG TCGCGGATGA ACGCCGGCCA TGA
 
Protein sequence
MKAICSTFIA VLALATCAVM AHAEPITLTL VHAASTTHPE HLAALQFARR VEERTHGQIK 
TRIFPAAQLG SETEMIQKVR LGAIDMDLAS HHYLINYEKA FAVVIMPYVF DNYEHAHRVL
DGPAMAWLAP LAEKQGFVIL SNWEWGFRNL SNNQRPINQP EDVRGLKIRV PPGVGMEASM
EALGAQISKI SFKDLYAALS QGRVDGQENP LSVFYHHKLY ESQKHLALTR HVYYNMVHFI
SVKSWTRLTP AQQTIVREES KAAGDGMRKK IMAEEDELIA RLAAAGVKVT RPDPQPFRAM
MEPAHKKIKL LAGEENARKF LNMVADERRP