Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_4236 |
Symbol | |
ID | 5110414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009425 |
Strand | - |
Start bp | 48243 |
End bp | 50198 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640480853 |
Product | von Willebrand factor, type A |
Protein accession | YP_001165515 |
Protein GI | 146284562 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.644313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA GTTATGGATT CAGCGGCCTG TTGTGCGCGG CGGCATTAAG TTTGCCTGCT CACGCCGCTG ACACAAAGCC GCTGCTGCAG GAAGGGAAAC ACACGCTGTT CCAGCGCGTG CTCACCTATC CCGGTTGCAT GCTGGCCGCC AAAGCGGGCG AAGCGGGTAA AGAACAGCCT GCGTTCAGCC GCTTTTACGT TTATCAGCGT GAAAAACAGG GCCAGGACGA GTGGTTGCAG GTCGGGCCGG ACAGTTTCGG CCACGTCTCT GGCTGGATGA AGTCCTCCTG CACCGTGGAC TGGAAAGTGC AGCTCACGCT GGCCTTTACC AACCCCGCCG GACGCCACCC GATGCTGTTC TTCAAAGAGA AGGGCGATGT GGAGTCGCTG CTGAACAACG CCAAACCTGC TGCCGCGCTG GAGCCGATGA TCGCCAGTCT GAATCAGAAA AAGCCGGTGC CACAGGTACT GGCTCGTGAA CCAGACTACA TGGTCGATCA GCTGAAAAAC TTCTACCTGC TGCCCGTACT GGGTTCTGAC GATATCTTTA CCGATACCGG TTTCCAGGTA CGGGTGCTGA ACGTGGCTTC GGTCAGTGAG AATGGCAGCG CCACTACCTC GGCTAAAGCG ACCGACGAGA AGAACATGAT GAAGGGCTTC TCGGCGTCGG TGGTGTTCGT GATCGACTCG ACTATCTCGA TGGGGCCGTA CATTGATCGC ACTAAAGAGG CGATCGACAA GATCTACAAA CAGATCGAAA AAGAGCAGTT ACAGGACAAA GTGAAATTTG GCCTGGTGGC CTACCGTTCC AGCGTGAAGG CGGTGCCTGG GCTGGAATAC GATGCCAAAA TGTATGTCGA TCCGAACACG GTGAAAGACG GCAAAGATTT CCTCGCCAAA GTGCATGACC TGAAGCAGGC GACCGTCTCC AGTAGCAAAG TGGATGAAGA CGCCTACGCC GGGGTGATGA CCGCGCTGGA TAAAGTCGAC TGGACGCAGT TTGGTGCGCG CTATGTGGTG TTGATCACCG ATGCGGGCGC ACTGGACGGC ACCGACAGCC TATCTTCCAC CCATCTGGGC GCGGAGCAGG TGCGGCAGGA AGCGGCGTAC CGTGGCGTCG CGCTGTACAC CCTGCACCTG AAAACACCGG ACGGGAAGAA AAATCACGCC TCCGCCGCTG CGCAATATCA GGAGCTGACG CTTAATCCGT TCCTGCATAA GCCGCTGTAC TACCCGATCG ATTCCGGCGA TGTGAACAGC TTCGGCACCA TCGTCGACAG CCTGTCAAAT GCCATTACCG CCCAGATCAA AACCGCCTGG AGCGGGGAAG AGACCGCAGG CAGCGCCCTG GGCGCCAGCC CGGAATACGC CGGTAAGAAA GCCGATCCGC TGCTGAGCGA TGCCGAGAAA CTGAGCAAAG CCATGCGTCT GGCGTACTTG GGTGAAAAGC AGGGTACGCA GGTGCCGCCG GTGTTTAAAT CGTGGATTAG CGATCGCGAT CTGGTGAATC AGAACCTCCC GGCCACCGAA GTCCGCGTCC TGCTGACCAA AAGCGAGCTG AGCGATCTGA ACGACGTGAT GAAGAAGATC GTCAATGCCG CCAACGAAGG GATGATCTCG CCGGATGACA TGTTCGCCAG CCTGCGATCT CTGGCCGCGA CCATGGGCAA CGATCCGAAC CAGGCAAAAG GTAAAAATGC GACCCGCCTC GGTGAAATGG GCCTTCTGGG CGAGTACATC GAAAGCCTGC CTTATCTGAG CGAAGTGCTG AGCCTCGATG AAGAGACCTG GAAGAGCTGG GATGGGCTGG AGCAGGAGCG TTTCATTCGC CGCCTGAACA CCAAACTCAA CTATTACCAG CGCTACAACG AAGATGCTGA TCGCTGGATC GCGCTGGCGC CTGACAGTGA CCCGCGGGAT AACGTCTACC CGGTTCCGCT GGAGAACCTG CCCTGA
|
Protein sequence | MKKSYGFSGL LCAAALSLPA HAADTKPLLQ EGKHTLFQRV LTYPGCMLAA KAGEAGKEQP AFSRFYVYQR EKQGQDEWLQ VGPDSFGHVS GWMKSSCTVD WKVQLTLAFT NPAGRHPMLF FKEKGDVESL LNNAKPAAAL EPMIASLNQK KPVPQVLARE PDYMVDQLKN FYLLPVLGSD DIFTDTGFQV RVLNVASVSE NGSATTSAKA TDEKNMMKGF SASVVFVIDS TISMGPYIDR TKEAIDKIYK QIEKEQLQDK VKFGLVAYRS SVKAVPGLEY DAKMYVDPNT VKDGKDFLAK VHDLKQATVS SSKVDEDAYA GVMTALDKVD WTQFGARYVV LITDAGALDG TDSLSSTHLG AEQVRQEAAY RGVALYTLHL KTPDGKKNHA SAAAQYQELT LNPFLHKPLY YPIDSGDVNS FGTIVDSLSN AITAQIKTAW SGEETAGSAL GASPEYAGKK ADPLLSDAEK LSKAMRLAYL GEKQGTQVPP VFKSWISDRD LVNQNLPATE VRVLLTKSEL SDLNDVMKKI VNAANEGMIS PDDMFASLRS LAATMGNDPN QAKGKNATRL GEMGLLGEYI ESLPYLSEVL SLDEETWKSW DGLEQERFIR RLNTKLNYYQ RYNEDADRWI ALAPDSDPRD NVYPVPLENL P
|
| |