Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_4086 |
Symbol | |
ID | 4686153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | + |
Start bp | 4367746 |
End bp | 4370757 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639837099 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_984298 |
Protein GI | 121606969 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.791775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.807746 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAC ACCGCCTTGC CCCGGTTCTA CTGGCCGCAG CTGTTCTGGC TGCGTCCTGC GGCGGCGGTG ACACGCCATC CCCGGCAGGC GGCACCGGCA TCAACACCGG GGTCAGCCCC GGCACCGGCA GCGTGCCGCT GTACCAGTCC TCCACCGAGA CGCTGACACT GCCGCAGCTG CGGGTAGCCG GCAACATCCG GGCCGATGTC AAGCTTGTGC TCAAGAAGGA CGGCAACTGG GCGCTGCTTT CCTCCGGCCC GACTCGTCCC GCCACCGCAT CCGACACGCC CGGCGCCGCG CTGGCCGCGC CCGGCGGCAA CACCGACCTG GGCGGCACGC AGACCGACAC GACGCTGACG GTGGCGCGGC TGCACGTAGG CTCTCGCGTG TTTGGCAACG TGGCGGTCCG GCTCACCGGC AAGGCCTGGG CCTTTGTGAG CAGCCCGCAA GAGGTGAAGA CGCTGCACCA GGAGGACTTC AAATCCAACA CGGCGATTCG CGCCGACGAG TCGCACCACG TCATCCTTCA ATCCAGCCCG GACAGCGGTG TGCAGAACGT GCCGATGCAG TTGTCCGGCA GAAACTACAA GTTCTGCATG GACGCGCAGG CCGAGGGCGC CGACAGCACG ACGCTGCTGG ATGCGGCGGG CCATACCATT TTTACGCTCA AGGCGGGTGA GCCTTGCGTC ACCCTCCAGG CCAGGGAGGG GGCGTACACC CTGCAGCACC GCTATGGCGG CACCGGCAGC GCCCGCACCC TGTTCATGCG CAACCAGGCC AACACCACGA CAGCCACTGC GCTGGCGGCG CAAGCAGCGT CCGCACCGCC CGGTTTGCTC AACGCGCCCA AGCTGCTGGC CGCTTCGGCA ACCGCGACAA ACGCCATGGC GCCGGTCGCC GAATACTGGT CTGTCCGCAA CCCAGCGGCA AATTCCGGCG CGCAACCCAG ATCGCTCGGA AATGCGGGTA CTTTTTACGC GGAGCCGGTT TATGGGCTTG ATGCTCTCGG CACGGGCTGC AACGGAAAGA TCGCCTTCAG CTTCATGAAT CCCTGGAGCG CCCAGGCACT ATTTCGCATC GACAAGAATG CTTTCAACGT ACCCGTCTAC ATGGGCGTTC CTCTTGGATG TGAGTTTTAT GCGATGGAGC TTTATTCTGG AACGGGTTCG ATTCCACCGT TCGTATACGG CAACGCCATT TACCCTCAAT ACGACCCCAC GCTGGCATCC ACCCTCTGGA ACAGCGGGCT TGGCACCACC TACGCCAGCC TGGTAAAGGA TGACTACCTG ACAGAAGTGG ATGACAACTC TCTGCGCCTG TATCCAACGC TTGGCCTGGA TGCTTTTGTC GTGCCGTCGC ATCCGACCGT CACCATTGGC GGGGTGGCGT ACCCCGCTCC AATTCCGATG CCAGATCCGA ACGCGCCTGA GGGCTTCTAC AGCATGGGCG GTTTTTTTGC GAGTTCCACC CTGGTGGAAC GCAGGCAACA AACCATCACC GAACAGTCAA GCACGCGCTT TACCCTGGCG TCCATGTATT CGACCGTGTA TTCGCCGGCG GGTGTACAGG TGAGCAGCGC GACGGCACCT GTTCTGGCCA CCAGCGACTC CCTGCTCACG GTGGGAAATG CAGCCGCAGG CGATGTGGCC TCCTCGCTCT CGATTGCCTA CCGCTATTAC CCAGACGGAC CGCCATCGGC TGTTTTAGGT TTCGGCCAGA TTGCGCTCTA CACCGGGCCC AACTGCACGG GCGCCGTGGT CATGCCACAG GCTGACGTGC CGACCTTCGA CGTCTCGGGC GCTCCTGCGC TCAAGGGACT TGGCACCTCG TTCAAGCTGG GCCTGCAGAC CAGCGCCACG GCTTTTTCAT TGCCGCTGTA CAACGGCGAG CAGCAGCACT TTGACCAGCT GACCTGCTAT TCCGGCGGCT TTGGCTCAAC AGGCTGGACG CCCCAATCCA TGCAGATCGC AGTGGACACC GTCACCATGG TCATCAGCAC CGACTCGTGT GAATACTGCA ACCTGGCCGG TGTCGATTTT TCCGGCGTCA ACCTCACCAA CGTCAAGCTG AGCTACGCCA ACCTGAACGG CGCCATCCTG TCCAACATCG ACCTGTCGGG CGCGGACCTG CGCAGCGTCA GCCTGCAGGG CGCCTACCTC ATCAATGCCA ATCTGGACGG CGCCAATTTA TGCGCGGCGC AATTGAACGG CAGCCAGGGC GTCACCCAGG CGGCCACGCT GACCGGGGCG CACCTGCGCA ACACCAACCT GGCGCTGTCC AACCTCGATG GCGTGAAGCT GTCATCAGCC AGCTTTTACA GCAGCAATGG GCAGGGCACC TGCCAGCAGA CCAGCTGCAG CAGCTATGTG GCCTCCACCT GTGCCAGCGC CTACAACGCC TCCCTCAACA ACGCGAGTTT TGATTCGGCC TACCTGTCCA ATGTCGATAT GAGCAACGTC ACCGGCGCGG GTGTCAGCTT CAACAACGCC GCCTTGTTTG GCGTGCTGTT TGGCCAGGCC AATTTGGCGC ACAACAAGCT TTCATCCGTC AGTTCATCCT TCATCAATGC CTACCTGCAG GGAACCGACC TGTCTCGCGC CAACCTGCAG TTCGCAGACT TCACGGGGGC GCAGTTTGAT GCGGCCAGCA ATTGCATCCA GGCCAATTTA AACCCCGCCT ATTCAAACTT CCCCGGCGCC AAGGTGCCCG CCAGCCCGGG CAGCTCGACC TGCGTGCCGG GCAAGCCTGC TGCGGCGTTT TGCGTCCAGT CTTCTTTCGC CCCATCGGCG GGCTACCCCC AGACCGACTG CACCAACATC TGCGCCGATG GGAGCACGGC AGGCGTCGGG CTGACCAATG GCACCTGCCC GAACGCTTTC ACCTGCTCCT CGGCAAGCTG GACCACGCCG CTGAACGGCG GAGGCAACGG CGCCATGCCC ACCAGCAACT GCCAGGGCGC AGCGGCGCTG TGCGGCAACC CGTTCACGGG CGGCGCCGAC CCGTGCTGGT AA
|
Protein sequence | MMKHRLAPVL LAAAVLAASC GGGDTPSPAG GTGINTGVSP GTGSVPLYQS STETLTLPQL RVAGNIRADV KLVLKKDGNW ALLSSGPTRP ATASDTPGAA LAAPGGNTDL GGTQTDTTLT VARLHVGSRV FGNVAVRLTG KAWAFVSSPQ EVKTLHQEDF KSNTAIRADE SHHVILQSSP DSGVQNVPMQ LSGRNYKFCM DAQAEGADST TLLDAAGHTI FTLKAGEPCV TLQAREGAYT LQHRYGGTGS ARTLFMRNQA NTTTATALAA QAASAPPGLL NAPKLLAASA TATNAMAPVA EYWSVRNPAA NSGAQPRSLG NAGTFYAEPV YGLDALGTGC NGKIAFSFMN PWSAQALFRI DKNAFNVPVY MGVPLGCEFY AMELYSGTGS IPPFVYGNAI YPQYDPTLAS TLWNSGLGTT YASLVKDDYL TEVDDNSLRL YPTLGLDAFV VPSHPTVTIG GVAYPAPIPM PDPNAPEGFY SMGGFFASST LVERRQQTIT EQSSTRFTLA SMYSTVYSPA GVQVSSATAP VLATSDSLLT VGNAAAGDVA SSLSIAYRYY PDGPPSAVLG FGQIALYTGP NCTGAVVMPQ ADVPTFDVSG APALKGLGTS FKLGLQTSAT AFSLPLYNGE QQHFDQLTCY SGGFGSTGWT PQSMQIAVDT VTMVISTDSC EYCNLAGVDF SGVNLTNVKL SYANLNGAIL SNIDLSGADL RSVSLQGAYL INANLDGANL CAAQLNGSQG VTQAATLTGA HLRNTNLALS NLDGVKLSSA SFYSSNGQGT CQQTSCSSYV ASTCASAYNA SLNNASFDSA YLSNVDMSNV TGAGVSFNNA ALFGVLFGQA NLAHNKLSSV SSSFINAYLQ GTDLSRANLQ FADFTGAQFD AASNCIQANL NPAYSNFPGA KVPASPGSST CVPGKPAAAF CVQSSFAPSA GYPQTDCTNI CADGSTAGVG LTNGTCPNAF TCSSASWTTP LNGGGNGAMP TSNCQGAAAL CGNPFTGGAD PCW
|
| |