Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2021 |
Symbol | |
ID | 8253125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2328702 |
End bp | 2329868 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644935669 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003092288 |
Protein GI | 255531916 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0022628 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAGGA TAGTTATAGC CAGTACCTGC GCAGAGGACT GGGGTGGCAG CGAAGAATTA TGGGGAAGAA GTGTTCCGCT GTTACAAGAG TCTGGTTTTC ATATTACAGT AATAAAATAT TATATCAACA GGGCTCATCC GGAATTTATC AGACTTGCTG AAAGGGGCGT TAATCTGCTC GACATTTTCC CAAAAGGTAC AATAGCAAAA AGAGTTTATA AAAAGAGCTT AAAATTAGTC AATCAGGTGG CGCTGAAACT GAAATTGACC TCAGATCAGG GTGAGGATTT TAGCGCTTTC ATCAAGATCA TGCAGGACAC CAAACCAGCA CTGGTGATCA TCTCACAGGG AATAAATTTT GACGGCCTGA AACTTGCTTA CCAATGTTCC CTCCTTAAAA TACCCTATGT GGTCATTTCC CAAAAGGCCG TAGATTTTTA CTGGCCCCAC AAGGATGATC GTGCCTTTAT GCTCAAAGCA CTGGAAAAAG CGGAAAAATG TTTCTTTGTA TCACACCATA ACCTGCGGTT AACAGAAGAA CAATTTGGTA AAAGACTACC CAACGGGCAG GTCATTTTTA ATCCGGTAAA GCTATCAGGA AACATTGTAC CATTTCCTAA ATCTAAAGAA CCATACAAAT TAGCTTGTTT AGGCAGGCTT TTTTTGCTGG ATAAAGGACA GGATCTATTG ATCCGTATAC TTTCCGAGCA AAAATGGAGA GATCGTCCTG TAAAAGTATC CTTTATTGGA AAAGGTACAG ACGAGGCAGC TTTAAAAGAT ATGGCTAAAC TGTTAAATGT CACCAACGTC GATTTTCTGG GACAGATTGA GGATATTGAA GCGATGTGGG AAGATTATCA TGCCCTTGTT CTTCCTTCCA GAAGTGAAGG TTTACCTCTA TCTATGGTCG AGGCCATGTC TGCTGGCAGG CCCGTAATCA TATCCAATGC AGGCGGGAAT GCAGAACTGG TAGAAGAAGG TGTTACCGCT TTTATCGGTC ACGCCAATGA AGAATCATTC GGGGAGGCAA TGGAACGTGC CTGGCATAAA AGAGAAGAAT GGGAAGAGAT CGGAAAAAAC GGGGCTAAAC ATGTTGCAGA AAACGTCCCA AAATCACCTG AAACAGAGTT CGCAAAGCTA ATTGTTGAGC TTCTTGCAAA TAAATAA
|
Protein sequence | MKRIVIASTC AEDWGGSEEL WGRSVPLLQE SGFHITVIKY YINRAHPEFI RLAERGVNLL DIFPKGTIAK RVYKKSLKLV NQVALKLKLT SDQGEDFSAF IKIMQDTKPA LVIISQGINF DGLKLAYQCS LLKIPYVVIS QKAVDFYWPH KDDRAFMLKA LEKAEKCFFV SHHNLRLTEE QFGKRLPNGQ VIFNPVKLSG NIVPFPKSKE PYKLACLGRL FLLDKGQDLL IRILSEQKWR DRPVKVSFIG KGTDEAALKD MAKLLNVTNV DFLGQIEDIE AMWEDYHALV LPSRSEGLPL SMVEAMSAGR PVIISNAGGN AELVEEGVTA FIGHANEESF GEAMERAWHK REEWEEIGKN GAKHVAENVP KSPETEFAKL IVELLANK
|
| |