Gene Pnap_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3194 
Symbol 
ID4686276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3392976 
End bp3395219 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content58% 
IMG OID639836207 
Productexopolysaccharide transport protein family 
Protein accessionYP_983413 
Protein GI121606084 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0576306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTT CTCAATCTCC AGTGCCTGCC TCGTACATTG CTGACGCACA AGACGATGAA 
ATTGACTTGT TAGGCTTGCT CGATGTGCTG CTTGATGCGC GCTGGCTCAT TGTCGGCGTG
ACTGCACTGG TGCTGGTGCT GGGCGGTGCC TATGCCTTCC TTAGCCGCCC GGTTTATGAA
GCTAATACAT TGATTCAGGT TGAAGACAGC AAGCCCGGTG CTGCCGGTGC CCTGGGCGAT
GCAGCCAGCC TGTTTGACAT CAAGTCACCC GCCACGGCCG AAATAGAAAT CCTGCGCTCT
CGCCTGGTGG TCGGCAAGGC CGTCGATGAC CTGCAGTTGT ATGTCACAGC CACCCCTCAA
TACCTCCCGC TGGTCGGTGG TTGGCTTGCA CGGCGTGCCA CCGATCTGTC CAATCCCGGA
TTCATGGGCA TGGGCGGTTA TGTCTCTGGC AATGAATCCA TTCGTTTGGG ACTGCTCGAA
GTACCCGCAG CGTTGCAAGC CCAACCGCTG CTGCTGGTGG CCACTGAAGG CGGTTATGAA
TTACGCGACC CTAATGGTCA GACGCTGGTG CAGGGCAAAA CCGGTACGCC GGCAGACTTT
GGCAGCGGGG AAGACAAGGG CCGCATTCTG GTGACCGAAC TCAAGGCCAA GCCCGGCGCG
TACTTCGACG TGTCGCGTTA TTCCCGTCTG GGTGTGATTC AAGGGCTGCA GCAGCAACTG
GCCATCTCGG AGCAAGGTCG TCAGTCCGGT GTGATTGCGG TGCAACTGCA AGGCACCGAC
CCGCAACAAA TTGCCCGTAC CCTGAATGCC GTAGGCACCA ACTATGTGCG CCAGAATGTC
GAGCGCAAAT CGGCCGAAGC AGAAAAATCA CTTGCTTTCC TGGGTGATTT CTTGCCTCAG
CTTAAAAAGC AGTTGGAAGA ATCCGAAGTT CGATTCAACA AGTATCGCAA CCAGAACGGC
ACGTTTGACC TGGGCGTTGA AGGGAAAACG TATCTTGAAA CAGCTGTCAA GCTACAAGGC
GACCTCCTGT TACTACAGCA AAAACGACGC GAGCAAATCG CGCAGTTCAC CGCTGCTCAC
CCCGTCATCC AGACGCTGGA TGCACAAATT TCCGCCGTCA GCAAGGAAAT TGCGGGCCTG
ACCACCAAGG TCAAGACACT GCCCAATACC GAGCAAGACT TGCTGCGCCT GACACGCGAT
GTGAAGGTCA ACAGCGAGCT GTATCTCAAC CTGCTGACCA GCTCACAGCA ATTGCTCCTG
GTCAAGGAAG GCAAGGTCGG TAATGTTCGG GTGGTTGATG CACCTGTTGT GCCTGAGCGG
GCCATCAAGC CCCAGCGCTC ACAAATACTG GCCATCAGCG GCGTGCTTGG CTTGCTGTTG
GGCATGGGCT TGGCATTTTT GCGCAACAGC CTGCGCCCTG GTATCAAGGA TCCGGCCGAT
ATCGAGTCGG CCACCGGCCT GCATGTATTT GCCACCGTGC CGCATTCGGC TGAGCAGGAC
AAGCTTTCCA GGCTGATCAA GATCCAGGCT CCAGGCAACC ACCTGCTGGC CATTACGCAT
CCCGAAGACC CTGGTGTGGA AAGCCTGCGC AGCCTGCGCA CTGCGCTGCA GTTTGCCATG
CTTGATGCGC GCAACAACGT GGTGCTGTTC ACTGGTCCCA CTCCGGGCAT TGGCAAATCC
TTTACGAGTG CCAACTTTGC CGCCGTGCTG GCCGCAGGCG GCAAGCGTGT GCTGCTGATT
GACGCCGACA TGCGCAAAGG CCACATTCAC CAGTTTTTTG GCATGAAGCG CGGCCACGGC
TTGAGCGAGC TGATTGCCGG CAGCCGCACG TTGGGCGATG TGGTGCGCCG CGCCGTTGCA
CCCAATCTGG ATCTGGTCAC CACTGGCACC ATGCCGCCCA ACCCCGGTGA ATTGCTGATG
TCGCCCGCTA CCGTGCAACT GCTGGAAGCC CTGTCTGCCC AATACGACCT GGTGCTGATC
GACACTCCGC CCGTGCTGGC CGTGTCAGAC ACGCAGGTAC TTGCACCGCA TGCCGGCACC
GTGTTTTTGG TAGCCCGGGC CGAAGTGACC GCACTCGGCG AATTGCAGGA AAGCACCAAG
CGCCTCGGCC AGACCGGTGT GCAGGTCAAA GGCGTCGTGT TTAACGATCT GGACACCAGC
CGCCAGCGCT ACGGCGGCTA TGGCTACAAA TACAGCCGCT ACCGCTACAC CAACTACCAA
TACGGCAAAA CAGACGGGCA GTAA
 
Protein sequence
MNPSQSPVPA SYIADAQDDE IDLLGLLDVL LDARWLIVGV TALVLVLGGA YAFLSRPVYE 
ANTLIQVEDS KPGAAGALGD AASLFDIKSP ATAEIEILRS RLVVGKAVDD LQLYVTATPQ
YLPLVGGWLA RRATDLSNPG FMGMGGYVSG NESIRLGLLE VPAALQAQPL LLVATEGGYE
LRDPNGQTLV QGKTGTPADF GSGEDKGRIL VTELKAKPGA YFDVSRYSRL GVIQGLQQQL
AISEQGRQSG VIAVQLQGTD PQQIARTLNA VGTNYVRQNV ERKSAEAEKS LAFLGDFLPQ
LKKQLEESEV RFNKYRNQNG TFDLGVEGKT YLETAVKLQG DLLLLQQKRR EQIAQFTAAH
PVIQTLDAQI SAVSKEIAGL TTKVKTLPNT EQDLLRLTRD VKVNSELYLN LLTSSQQLLL
VKEGKVGNVR VVDAPVVPER AIKPQRSQIL AISGVLGLLL GMGLAFLRNS LRPGIKDPAD
IESATGLHVF ATVPHSAEQD KLSRLIKIQA PGNHLLAITH PEDPGVESLR SLRTALQFAM
LDARNNVVLF TGPTPGIGKS FTSANFAAVL AAGGKRVLLI DADMRKGHIH QFFGMKRGHG
LSELIAGSRT LGDVVRRAVA PNLDLVTTGT MPPNPGELLM SPATVQLLEA LSAQYDLVLI
DTPPVLAVSD TQVLAPHAGT VFLVARAEVT ALGELQESTK RLGQTGVQVK GVVFNDLDTS
RQRYGGYGYK YSRYRYTNYQ YGKTDGQ