Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0187 |
Symbol | |
ID | 8251272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 217283 |
End bp | 219124 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644933837 |
Product | von Willebrand factor type A |
Protein accession | YP_003090475 |
Protein GI | 255530103 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.592381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATGG TAATGGGTTT CCAGGAAAGC TTTGCCGCAG TAAAGACCAT TACCGGAAAG GTAACTGATA AAGCCGATGG CAAAGGACTG CCGGGTGTAA TGGTAAGTTT GCCCACAGGT AAAACCTATA CCCGGACCGA CAAAGATGGG CGGTATTCCA TTTCTGTAGC AGAGGATGCA GAAAGCTTAA GCTTCTCTTA TATCGGTTAT AAAACACTTA GGGTAAAAAT CGGGAAGTTT TCCAGCCTGA ATGTAGTAAT GGAACATAGC GTCCAAAGTT TGAATGAAGT AGTTGCAGTG GGCTATGGTA CGATGAAAAA AAGTGAAATT GTTGGCGCTG CCCCTGCAAA TGTCATGATA CGCGGCATGG CTTCAGGAAT TTGGGTAGGG CCTGAACGAA ATACAGAAAG TTATGCCGGC ATTAGTGAGA ACGGATTCAG GAACCCCGGC AAAAACCCGC TTTCTACTTT TGCTGTTGAT GTGGACGCGG CTTCCTACAG CAATGTACGC CGTTTCATCA ACAATGGGGG TATGCCACCT AAAGATGCGG TAAGGATTGA AGAAATGATC AATTATTTCG ACTATGAATA TCCGCAGCCC AAAGGAAATG ACCCCGTAAA TATTGTAACA GAAATTGCAG ATGCACCCTG GAATGCAAAC CACAAACTGG TACAGATTGG CCTGCAGGGA AAGAAAATAC CAACAGATAA CCTGCCTGCA TCCAATCTGG TTTTTCTGAT AGATGTATCC GGATCAATGA ACCAGCCCAA TAAGCTTCCG CTGCTGATCG CTTCATTTAA GCTGCTGACG GAGCAGCTGC GTCCTGAGGA TAAAGTAGCC ATTGTGGTGT ATGCCGGAAA TTCAGGGCTG GTACTGCCTT CTACACCCGG TAATGAAAAA ACAAAAATTA AAGAAGCATT AAATAAATTA AGCGCCGGAG GTTCAACCGC CGGTGGCGCA GGCATTCAGC TGGCCTATCA GGTAGCTACA GATAATTTTA TTAAAGGCGG CAACAACAGG ATTATCCTGG CTACAGATGG CGATTTTAAT GTGGGCGCAT CCAGTGATAA GGATATGGAA AGCCTGATTG AAGAAAAACG CAAATCAGGT GTGTTCTTAA CTGTGCTGGG CTATGGCATG GGAAATATGA AAGACAGCAA AATGGAAACA CTGGCCGATA AGGGCAATGG CAATTACGCT TATATTGATA ACATCAGTGA AGCCAGAAAA GTACTGATCA ACGAATTTGG GGGCACCTTA TTTACCATTG CCAAAGATGT GAAACTCCAG CTGGAATTTA ATCCGGACAA AGTACAGGCC TACCGTTTAA TAGGTTACGA AAACAGGTTG CTGCAGGACA GGGATTTTAA TGACGATAAA AAGGATGCCG GTGAAATGGG CTCGGGCCAT ACCGTAACGG CCTTGTATGA AATCATACCT GCTGGTATCA AAAGCAGTTT CCTGGATTCT GTAGATAAAC TAAAATATCA GTTCAATAAA ACCCCGGTTA GGGGCAATGG GAGTGCTGAA ATGCTGACGG TAAAACTGCG TTATAAAACC CCGGATGGCC ATACGAGTAA ACTGATCTCG AAAGCAGTGA TTGACCAGTC GGCCCCTTTT AGTCATACCA GTAACAATTT CAGGTTTGCA GCAGCCGTTG CAGAATTCGG GATGCTGCTC AGGCAATCAG AATTTAAACA GGACGCTTCT TATGGCCAGT TGATCGGCAT TGCAGAAAAT GCGAAGGGAA AAGACAAAGA AGGTTATCGC TCAGAATTTA TAAAACTGGC TAAGTCGGCA AAATTAATGG CAGAAGAATT GCTATCTTCA GCCAAAAAAT AA
|
Protein sequence | MLMVMGFQES FAAVKTITGK VTDKADGKGL PGVMVSLPTG KTYTRTDKDG RYSISVAEDA ESLSFSYIGY KTLRVKIGKF SSLNVVMEHS VQSLNEVVAV GYGTMKKSEI VGAAPANVMI RGMASGIWVG PERNTESYAG ISENGFRNPG KNPLSTFAVD VDAASYSNVR RFINNGGMPP KDAVRIEEMI NYFDYEYPQP KGNDPVNIVT EIADAPWNAN HKLVQIGLQG KKIPTDNLPA SNLVFLIDVS GSMNQPNKLP LLIASFKLLT EQLRPEDKVA IVVYAGNSGL VLPSTPGNEK TKIKEALNKL SAGGSTAGGA GIQLAYQVAT DNFIKGGNNR IILATDGDFN VGASSDKDME SLIEEKRKSG VFLTVLGYGM GNMKDSKMET LADKGNGNYA YIDNISEARK VLINEFGGTL FTIAKDVKLQ LEFNPDKVQA YRLIGYENRL LQDRDFNDDK KDAGEMGSGH TVTALYEIIP AGIKSSFLDS VDKLKYQFNK TPVRGNGSAE MLTVKLRYKT PDGHTSKLIS KAVIDQSAPF SHTSNNFRFA AAVAEFGMLL RQSEFKQDAS YGQLIGIAEN AKGKDKEGYR SEFIKLAKSA KLMAEELLSS AKK
|
| |