Gene PG1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1020 
Symbol 
ID2553043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1080708 
End bp1083518 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content50% 
IMG OID637149726 
Producthypothetical protein 
Protein accessionNP_905241 
Protein GI34540762 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00854238 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAGA ATGGACTTTC TTACATCCCC GTATTGCGTG CTTTATTGAT CTGTACGCTT 
TTGCTGCTCG CGTATGGAGC TTCGGCTCAG CATCATGTGT CCACTTGTAC GATCCGTGGT
AAGGTGACGG ATATGGCGGG AAAGGAGGGC ATTGGTTTTG CCACTGTACT ATTGGCGGAT
CAGGCTTATG GGGTAGCCTG TGACGGGAAG GGAGAGTTTG TGATCAAACG GGTTACAGCC
GGATCGTACC GTGTCGTGGT CAGCTGCATG GGCTTTGCTT CGTTCGAGAC CACGCTCCAT
ATCAAAGCAG ACACCACGGT ACACTTCCGA CTCAAAGAAC ATAGTATCGC TCTGAAAGAG
GTGCAGGTGC TGGCCACTTA TAAGAACAAG ACGGACGGCA ACGTATCGGT GGATCGGACT
GCTCTGGAAC ATATCCAGCC TACCTCCCTG CGCGACATCT TTCTCCTGCT TCCGGGTAAT
GTGGTGCAAA GCAACAGCCT GACCGGATTC GTGGGAACGA CTTCTCGACA AATAGGCAGC
GACGATAATA CCTCTTTGGG AACCAACATC TCCATCGATG GTATGCCGCT TTCGAACGAT
GCCATGCGTA CCCAACTGTA TGGTATCACG GGAGTAGATC GGTCTGACCT GTACGTTGCG
GATCGTACCG TTACTCGCAG GACGGGAATG AATGCCGGTA TCGACCTTCG TACCATCTCC
ACAGATCATA TCGAAACCAT CGAAATAGAA CGGGGCATCT CATCCGTAGG GGAAGGTAAT
TTGTCGAGCG GAGCCATTCG CATCCGTACC AAGCAAGGGT CTGCTCCACT CCAAATGCGT
GTCAAGGCTG ATCCCCTTAG CAAACTGGCC TATGTGGGGA AGGGATTCAA AGTGTCCAAG
CGCGGGGCAT CCATTCATAC GGGAATAGAC TTCCTGCACA ACAAACCCGA TGTGAGGGAG
GTGTTGGATA GCTATAAGCG GAGTACGCTG CATGCCAGAT ATAGCGATCA GTTGAATTAC
GGGGATATGA TCGTGGACTT GGGTATCGGG CTGATGCAGA CCTTTTCCAT TCAAGACAGC
AAATCGGATC AGCTCGTCAA TGAGTATGAA GAGACCTACA ACTCCAAGTA TTTGCGTTCT
TCTCTTTCCT TCGATGCCAA AGTCCGCTTT TCGGGTAGTT GGCTCAATAG TATAGACTGG
CTTCTCTCTG CCGACTATAC TTCCGATCTG CTCAAAAGGA AGAAATACTA CATCAGTACG
AGTGGCCCGA AGAGTATGCC CGTAGCTACC GAAAGCGGAG AACATGAAGG TGTTTTCCTC
CCTTCGGCCT ACTATGCCTA CTATGAAATA GACAATCAGC CGCTCTATTT CTTCAGCCGT
TTCAAAGCCA ATACCGATAT AAACTTCACC GAACGTCATC ACCATCATTT GCTGTACGGG
CTGGAAGCTC GCAGCAGTAA GAATATCGGT AGGGGAGTGG TGGCCGATCC TACTCGTCCG
CCTTATCCGG GAAACAACAG CTATATCCGC CCTCGTCCCA ACTATAATAT ACCGGCTTCG
GTCTATGCTG CATTCTTTGT CGAGGACAGG GCTTCTGTCG AATGGGGAGC CAATAGGCTG
GGTATTCAGG CCGGACTGAG GGCTACGCAT CTATTCAATC TGCCTTCGTC CTATGCCCTC
TCTCGGAAGA TGCTGATAGA GCCGCGTATC AAGGCCAACT GGCAGTACAG AGCCGAACAT
CTGTCGATAA ACCTGCGTGC CGGATACGGT ATGGAAAACA AGTTGCCTAC ACTGGATCAT
CTGTATCCGG ACAAGATCTA CCGTGACTTC ATGGTGCTGA ATGCCTATAT GCAGAATCCC
GAACTGGATC ATCTCATCAC TTATACTTAT ATCCACAATC CCGAGAATCC TGCTATCAGG
GAGAACCGCA ATGTCAAAAA AGAGATCGGC ATAGATATGA CGTACAAGCG TTTCGACTTT
TCTTTGACAC TCTTCCACGA AGAATCGCGA CGCGGTTTCG AGTATTTCGA CTCCTATCTG
CCTATAGCCT TCGACCGCTA TACCAAGCTG ATAGCCCCTC TTCCTCCGGG ACACAAACCG
CAGAAGGAGG ATTATATTCA GGAGCATCAC AAGGATTTCT TTGTTATCCC CACCGTGCAG
AATTCTGCCA AAGTAGTCAA GCGCGGTATC GAATTTCGTT TGCGCACACC TTATCTGAAA
GCCATCAATA CCCAAGTCGA AGTCAATGGA GCCTATTACC ATACGCTCTA CGCTTCGGGT
ATCCCTATCA TGTTCCGACC GATAGTGTCG GAGTACGAGC AGGCTCTCTA TCCTTATGTG
GGCTATTACG AAGGCAGTCT GCACAAACAC TACCAACGCT TCAATACGAA TGTTTGGGTC
AATACGCACT TCCCTCGTTA TAAACTGATC TTCACTTCCT TTTTCCAGAT TATCTGGCAC
GATGCCTCGT ACAGGGGGCA TGAGGAGAGC GAGTTCCCCT ATGCCTATAT GGATCTCGAT
GGAGTAGTCC ATCCCACGAG CAAGGCTGCC ATTCTGGAGG CTGCTGCCAC GAACGATATT
CTCAAGTACT TGAGCCGAGA GCGGACGGAG CTATACTACA GGGCTACATA CAAACCCATA
TCGCTGCGTA TCAATTTCAA AGCCACCAAA GAGTTCTCCG ACCGTATGCG GCTGGCTTTT
TTCGTGGACA ATATCATCGA CATCAATCCC AAATACAAAC AGGCCAATAA TACGACGGAG
CGAGACTGGT CTATACCCTA TTTCGGAATA GAAACGACAT TCACCCTATG A
 
Protein sequence
MKKNGLSYIP VLRALLICTL LLLAYGASAQ HHVSTCTIRG KVTDMAGKEG IGFATVLLAD 
QAYGVACDGK GEFVIKRVTA GSYRVVVSCM GFASFETTLH IKADTTVHFR LKEHSIALKE
VQVLATYKNK TDGNVSVDRT ALEHIQPTSL RDIFLLLPGN VVQSNSLTGF VGTTSRQIGS
DDNTSLGTNI SIDGMPLSND AMRTQLYGIT GVDRSDLYVA DRTVTRRTGM NAGIDLRTIS
TDHIETIEIE RGISSVGEGN LSSGAIRIRT KQGSAPLQMR VKADPLSKLA YVGKGFKVSK
RGASIHTGID FLHNKPDVRE VLDSYKRSTL HARYSDQLNY GDMIVDLGIG LMQTFSIQDS
KSDQLVNEYE ETYNSKYLRS SLSFDAKVRF SGSWLNSIDW LLSADYTSDL LKRKKYYIST
SGPKSMPVAT ESGEHEGVFL PSAYYAYYEI DNQPLYFFSR FKANTDINFT ERHHHHLLYG
LEARSSKNIG RGVVADPTRP PYPGNNSYIR PRPNYNIPAS VYAAFFVEDR ASVEWGANRL
GIQAGLRATH LFNLPSSYAL SRKMLIEPRI KANWQYRAEH LSINLRAGYG MENKLPTLDH
LYPDKIYRDF MVLNAYMQNP ELDHLITYTY IHNPENPAIR ENRNVKKEIG IDMTYKRFDF
SLTLFHEESR RGFEYFDSYL PIAFDRYTKL IAPLPPGHKP QKEDYIQEHH KDFFVIPTVQ
NSAKVVKRGI EFRLRTPYLK AINTQVEVNG AYYHTLYASG IPIMFRPIVS EYEQALYPYV
GYYEGSLHKH YQRFNTNVWV NTHFPRYKLI FTSFFQIIWH DASYRGHEES EFPYAYMDLD
GVVHPTSKAA ILEAAATNDI LKYLSRERTE LYYRATYKPI SLRINFKATK EFSDRMRLAF
FVDNIIDINP KYKQANNTTE RDWSIPYFGI ETTFTL