Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1887 |
Symbol | |
ID | 8252991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2180044 |
End bp | 2182014 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644935538 |
Product | transglutaminase domain protein |
Protein accession | YP_003092157 |
Protein GI | 255531785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATATC TTCTTTTTCT GTTTGTATTT TTTTGTTCCA TTGCTGTAAG GGCACAGGAT TTTGGATTTG GGGAAATCAG CGGGGATGAT CTTGACTTAA AAAAGGTAAA AACAGACAGC AATGCCAATG CAGTAGTGTT AAAGGAATTT GGTACGGCAT CAGTTCGTTT AGACGAAAGT TATGGCAATC TTTATATAGA TTTTGAATAC CATGTCAGGA TAAAAATACT GAACAAAAAT GGCTTCGGGA GTGCAAATGT TGTCATTCCC CAAAGAATTT ATGGCGATAA GGAAGACATG GTCCAGAACC TGAAAGCAGT GACCATAAAT TATATAGACG GGCAATTTAC ACAAACACCG CTGGATAAAA AAAAGGTGTT TACCGAGAAA AAGAACAAAT ATGTGGTACT GACTAAATTT ACCATGCCCA ATCTGGTTGA GGGCAGTATT ATTGAATACA GCTACCGCCT CTATTCGCAT GGCCTTTTTA ACTTCAGGAG CTGGGAATTC CAGTCGGACA TCCCGAAACT GTACAGCGAA TACATTGCCA TTATCCCTGC CCTTTACACT TATAATGTAT CCTTACGTGG GGCACAGAAG CTCAGTTCAC AAAATGCTGA ACTCTATAAA GAATGCCTCA GAATTTCAGG AAGGCCCTAT GACTGTTCAA AAATGACCTA TATCATGAAA GATATCCCGG CATTGGTTGA AGAAGATTAC ATGACTGCCC CAAGTAATTT CAGGTCGGCC ATTAATTTTG AACTCTCCGA ATACTACCTG CTGTCGGGCG GGAAGAAAAG TGTAACCAAG GAATGGAAGG ATGTGGACTT TGAGCTGATA AATGACAAAT CATTTGGCAG CCAGATGAAA AGAAAAGACC TTTTTAAAGA GCTGTTGCCA GAGATCCTGA AAAACAAGAC TGCACCGCTG GACAAGGCAA AGGAAATCTA TGATTACATT AAGCGCAACA TCAAGCGAAA CGGATTTATT GGAATTCAAA GTGAAAATAC AATAAAAAAA GCTTTGGAAA CCCATTCCGG CAATACGGCG GACATTAACC TGGCGCTGGT AGCTGCATTA AGTGCAGCGA ATCTGGATGC AGAAGCGGTT ATCCTTTCGA CCCGTTCCAA TGGCACTGTG AATAACTTAT ACCCCGTGAT CACTGATTTT AATTATGTAA TAGCTAAGGT AAACATTGAG GGAAAAAGCT ATTTGCTGGA TGCAACAGAG CCTTTAATGC CATTTGGTTT GCTGCCACTC CATTGCATTA ACGGACAGGG AAGGGTAATC AACCTGAAAA AACCCTCCTA CTGGTATGAC CTTAAAGCGA GTCAGAAAGA AACACTCCGG TACAGTTTAA TTGCTGAGCT GGGAAAAGAT GGAAAAATAC GGGGTAACCT GACCATCCAC GCCATTGGTT ATGCGGCCTA TAATAAACGT AAAAAGATCC TGGCAGCAAG TTCGGTGGAT GAATATGTAG AGAAGCTGGA TGAGAGTATG CCCCAGATCA GGATCCTTAA ACATGCAATC CATAACCTGG ACAGCCTGGA AAACCTGCTT ACCGAAAATT ATGAAGTTGA AATGTCGGCC TTTTCCAACC TCAATAGTGA CCCTTTATTT TTTAACCCGT TTTTTATCGA CCGGATCAGC AAAAATCCTT TCAATTTAAA TGAGCGTACC TATCCTGTAG ATCTGGGTGC AGAAAAGGAA ATCCGCATCA ACATGACAAT TAAACTGCCT GATAACTATA ATTTGGCCGA CAAGCCTAAA GAACTGAACA TGGTACTGGC CGATGCGGGT GGCAGGTTTA TCTGTACAAC TGCTGTTGAA GACAATATCC TGCTGTTTAA CCAGCTGATG CAGCTTAACA AACCAATTTA TAGCTCTGCA GAATACCTTT CACTTAAGGA GTTCTACAGC AGGATCATCC AATTGCAGAA AACGGATATT ATCCTTAAAA AATCAAAATA G
|
Protein sequence | MKYLLFLFVF FCSIAVRAQD FGFGEISGDD LDLKKVKTDS NANAVVLKEF GTASVRLDES YGNLYIDFEY HVRIKILNKN GFGSANVVIP QRIYGDKEDM VQNLKAVTIN YIDGQFTQTP LDKKKVFTEK KNKYVVLTKF TMPNLVEGSI IEYSYRLYSH GLFNFRSWEF QSDIPKLYSE YIAIIPALYT YNVSLRGAQK LSSQNAELYK ECLRISGRPY DCSKMTYIMK DIPALVEEDY MTAPSNFRSA INFELSEYYL LSGGKKSVTK EWKDVDFELI NDKSFGSQMK RKDLFKELLP EILKNKTAPL DKAKEIYDYI KRNIKRNGFI GIQSENTIKK ALETHSGNTA DINLALVAAL SAANLDAEAV ILSTRSNGTV NNLYPVITDF NYVIAKVNIE GKSYLLDATE PLMPFGLLPL HCINGQGRVI NLKKPSYWYD LKASQKETLR YSLIAELGKD GKIRGNLTIH AIGYAAYNKR KKILAASSVD EYVEKLDESM PQIRILKHAI HNLDSLENLL TENYEVEMSA FSNLNSDPLF FNPFFIDRIS KNPFNLNERT YPVDLGAEKE IRINMTIKLP DNYNLADKPK ELNMVLADAG GRFICTTAVE DNILLFNQLM QLNKPIYSSA EYLSLKEFYS RIIQLQKTDI ILKKSK
|
| |