Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1592 |
Symbol | |
ID | 8252694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 1882989 |
End bp | 1884992 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644935246 |
Product | transglutaminase domain protein |
Protein accession | YP_003091867 |
Protein GI | 255531495 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.184226 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGA AAAATCTACT CGTATTATGC CTATTGCTAC TGGTATCGTT CTCGTATGCC CAGGAAAATG TAGTCAGCAA ATCAAAAACC TTCAAATATG GAAAAATAGA ACCCGCCGAA TTTGACAGTA AAGGAAGTGG TGCGGATTCG GCTGCAGCAG CCCTGATCTT GTTTGATGTA GGGAGAGGAT ATTTTGAGCT CAGCCCTAAG ACCGGTGACT TTGCATTCGT TTTTGAAAGA CATATCCGTT ACAAGATCAT CAATAAGAGC GGGTACGATT ACGGAAACCT GGAACTGAGG TTTTACAAAC AAAATTCTTC GGAAGAAAAG CTGGACTACA TGGATGCGGC CACATATAAC CTTGAAGGTG ATAAAATTGT GATCAGCAAG ATAAATAAAG ATGCCAAATT CTCAGAAAAA CAGGATAAAA ATTTTACGCT TAAAAAGTTT GCCCTGCCCA ATGTAAAAGA AGGATCTATT GTTGAATATA AATACAAAAC AAAATCCGAT TTTATCTTTA ACCTGCAGCC CTGGTATTTT CAGAGAAGCA TCCCTACCCT ATATTCTGAG TACGAAGTCA ATATTCCGGA ATATTACAAA TACAAGATAA GATCTGGCGG GTACCTGTTT TTAAACCCCA AACAGGAATA TGTGAACGAA ACTTTTAGTT CGAGCGCAGG AACACTGAAT GCCTCATGCT TAAAACTGCA TTATCAGGTT GAAAATGTGC CGGGTTTAAA GAAAGAGAAT TTCATTACTA CGATGGAAGA TTATGTGAGC AAAGTAGGTT TTGAACTGAG CTCTGTAACA GTACCCGGAC AGGTTTACAG GGAATTTACT TCTTCCTGGC CAAAAATTGT TACCGGACTT AAAACAGAAG AAAACTTCGG GGCTTTTATA AATAAAAAAA GTTACAGCAA AAGCATTTTA AAGGACATTG TAAAAACTGC AACACAGCCT GATACGGTAT TACAGCTCAT TTTTAACTAT GTAAAAAATA ACATTAAATG GGACGGTAAT TACCGCTTGT ATACTTCGGA AACCAGCCCA AAAAACATAT TTGAGAAGAA AACCGGTAAC TCGGCCGATA TTAACCTTTG TCTGCTCACA CTTTTAAATG AAGCAAACAT TACAGCATCA CCGGTTTTAC TGAGTACAAG GGAAAATGGG GCACATCCTG GTTTTCCAAT GATTACGGAT TTTAACAATG TGATCGTTCA GGCCGAAATT GGCGATAAAA TGATCCTTTT GGACGCCACA GATAAAGACC ACGCCCTGAA CATGATTGCC TATGAAAACC TAAACCATCA GGGCTTAAAA GTAAACCTGC CTGATGCCAC TGCTGCATGG ATATCACTGG ACGAGGCCAA TCTGAGCAAG ACAAACATCA ATTTAATGCT GACTCTGGAC AAAGAAAACA AATTCAGTGG TAAACTGTAT CTGTCGTCTA CCCATTATGA GGCTTTAAAC CGCAGGGGAA AATACCGTTC GGCGACAAAT GAGACTGATT TTCTGAAAGA TTATAAAACC GACAGACCGG GTCTTGGTAT AAAAAACTAT CAGATCCAGA ACCTGGCCAA CCTGGCCGAA CCTTTGGTGG AAAGCATGGA TGTAACCATT GAGGACAATG TGGAAGAAGC CGGAAACCTG GCCTATTTTG CCCCGCTATT GTTTGAAAGG ACAAAGGAAA ATCCCTTTAA ACTGGAAGAA AGGATTTATC CGGTCGACTT TGCTTATCCT AAGGAAGAAA ATTACCGCAT CACAATCGAT TTTCCAAAAG AATACCATTT GGATAAATCG CCCAAAAATG AAAAAGTAGT TTTACCGGAT GATGCTGCTT CCTTTGTTTT CATGTTTGCT GCTGAAGAAA ATAAGCTGAT GATCACCAGC AAAATATCCT TGAAAAAAGC ATTCTTCACG CCGGAAGAAT ACCACTACTT AAAAGAGCTT TTTAAGAATA TTGTAAGAAA ACAGGCAGAA CAAATTGTAT TTAAGAAAAG TTAA
|
Protein sequence | MRKKNLLVLC LLLLVSFSYA QENVVSKSKT FKYGKIEPAE FDSKGSGADS AAAALILFDV GRGYFELSPK TGDFAFVFER HIRYKIINKS GYDYGNLELR FYKQNSSEEK LDYMDAATYN LEGDKIVISK INKDAKFSEK QDKNFTLKKF ALPNVKEGSI VEYKYKTKSD FIFNLQPWYF QRSIPTLYSE YEVNIPEYYK YKIRSGGYLF LNPKQEYVNE TFSSSAGTLN ASCLKLHYQV ENVPGLKKEN FITTMEDYVS KVGFELSSVT VPGQVYREFT SSWPKIVTGL KTEENFGAFI NKKSYSKSIL KDIVKTATQP DTVLQLIFNY VKNNIKWDGN YRLYTSETSP KNIFEKKTGN SADINLCLLT LLNEANITAS PVLLSTRENG AHPGFPMITD FNNVIVQAEI GDKMILLDAT DKDHALNMIA YENLNHQGLK VNLPDATAAW ISLDEANLSK TNINLMLTLD KENKFSGKLY LSSTHYEALN RRGKYRSATN ETDFLKDYKT DRPGLGIKNY QIQNLANLAE PLVESMDVTI EDNVEEAGNL AYFAPLLFER TKENPFKLEE RIYPVDFAYP KEENYRITID FPKEYHLDKS PKNEKVVLPD DAASFVFMFA AEENKLMITS KISLKKAFFT PEEYHYLKEL FKNIVRKQAE QIVFKKS
|
| |