Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gbro_1379 |
Symbol | |
ID | 8550725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gordonia bronchialis DSM 43247 |
Kingdom | Bacteria |
Replicon accession | NC_013441 |
Strand | + |
Start bp | 1447522 |
End bp | 1449525 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003272555 |
Protein GI | 262201347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCA AGGGTTTCCA CCGATCACGC TATCAGCGCT ACACCGGCGG TCCCGATCCG CTGGCACCCC CGGTCGATCT CCGGGAGGCG CTCGGTGAGA TCGGTGACGA CGTGATGGCC GGGGTGTCGC CGCAGCGTGC GCTGCGCGAG TTCCTGCGGC GCGGTACGCA GAACATGCGT GGCCTGGACA AGCTGCGCGA GCAGGTGAAC CGGCGTCGGC AGGAACTCCT CAAGCGACGC AACCTCGACG GCACCTTCGC CGAGATCCGC GAGTTGCTCG ACCGCGCCGT TCTCGAGGAA CGCAAACAAC TCGCCCGTGA CCTCGACGAC GACGCCCGGT TCGCCGAGAT GCAGATCGGG AGCCTCCCGG CGTCGACCGC GCAGGCCGTC GAAGAACTCG CCGACTACGA ATGGCGTTCG CGGCAGGCCC GGGCGGACTA CGAGAAGATC AAGGACCTGC TCGGCCGCGA ACTGCTCGAC CAGCGTTTTG CCGGCATGAA ACAGGCCCTC GAAGGTGCCA CCGAGCAAGA CCGGCAACGT ATCTCGGAGA TGCTGGCCGA CCTGAACAAG CTTCTCGACG CACACAACCG TGGTGAGGAC ACCACGCAGC AGTTCGAGGA GTTCATGGAT GCCCACGGGG AGTACTTCCC CGAGAATCCG CGCAACACCG ACGAACTCAT CGATTCGCTG GCGCAGCGCG CCGCCGCGGC GCAGCAGTTC TACAATTCGC TGACACCCGA CCAGCGTGCC GAACTCGACC AGCTCGCCCA ACAGGCGTTC GGTTCACCGG ACCTGATGAG TCAACTGGCG CAGATGGATT CGGCGTTGCG GCAGGCCCTG CCGGGGCTGG ACTGGGATGA CGCGCAATCC TTCTCCGGTG ATCAGCCGAT GGGTCTGGGG GAGGGGGCGG CTGCGCTGCG CGACATCTCC GAACTCGAAG CGCTCTCCGA GCAGTTGTCA CAGCAGTACG CGGGCGCGCA GATGGACGAC ATCGACCTCG ATGCGCTTGC CCGTCAGCTG GGCGACGAGG CCGCGGTGGA TGCCCGCACG CTGCAGGAAT TGGAGCGGGC GCTCAATGAG GAGGGCTTCT TCGATCGCAC GTCCGACGGT CAGCTGCGGC TGAGTCCGAA GGCGATGCGG CAGTTGGGGC AGACGATCTT CCGTGATGTC GCCGAACAGC TCAGCGGGCG CCGTGGCGAT CGTCAGACGC GACGCAGCGG ACTACTCGGT GAGCCGTCGG GTGCCACGCG CGAGTGGGTG TTCGGTGACA CCGATCCGTG GGATGTCACC CGTACCGTGT CGAATGCGGT GTTGCGCACC ATCTCCGAGA GCACCGAACC GGCGTTGGCC AGCGCCGAGA TGGCCCGCGA CGGGGTGCGG ATCACCGTGC GCGACGTCGA GGTCGCCGAG ACCGAGAGCC GCACCCAGGC CGCGGTGGTC CTGCTCGTGG ACACTTCCTT CTCGATGGAG ATGGAAGGTC GCTGGACGCC GATGAAGCGC ACCGCGATCG CGCTCAACCA TCTGATCTCC ACCCGATTCC GCAGCGACGA ACTGCATCTC ATCGCGTTCG GACGCTACGC CCGCTCGATC GACATCGCCG AACTGACCGG ACTGCAACCA CGGATGGAGC AGGGCACCAA CCTGCACCAT GCACTACTTC TCGCGCAGCG TCATCTGCGT CGATTCCCCA ATGCGCAGCC GGTGGTCCTG GTCGTCACCG ACGGTGAGCC GACCGCACAT CTCGATCCCA GCGGCGAACC GTTCTTCTTC TACCCACCGC ACCCGCAGAC GATCGCTCTG ACCGTGCGCG AACTCGACCA CGTTGCACGA CTCGGTGCGC AGGTCACCTT CTTCCGGCTC GGTGAGGACC CGGGGCTGGC CCACTTCATG GACCAGATCG CGCGGCGGAT CGGCGGACGG GTGGTGGCAC CCGACGTGGA CGGACTCGGT GCGGCCGTCG TCGGCGACTA TCTGCGCTCC CGCCGGGGTC GTCGGCGCGG CTGA
|
Protein sequence | MARKGFHRSR YQRYTGGPDP LAPPVDLREA LGEIGDDVMA GVSPQRALRE FLRRGTQNMR GLDKLREQVN RRRQELLKRR NLDGTFAEIR ELLDRAVLEE RKQLARDLDD DARFAEMQIG SLPASTAQAV EELADYEWRS RQARADYEKI KDLLGRELLD QRFAGMKQAL EGATEQDRQR ISEMLADLNK LLDAHNRGED TTQQFEEFMD AHGEYFPENP RNTDELIDSL AQRAAAAQQF YNSLTPDQRA ELDQLAQQAF GSPDLMSQLA QMDSALRQAL PGLDWDDAQS FSGDQPMGLG EGAAALRDIS ELEALSEQLS QQYAGAQMDD IDLDALARQL GDEAAVDART LQELERALNE EGFFDRTSDG QLRLSPKAMR QLGQTIFRDV AEQLSGRRGD RQTRRSGLLG EPSGATREWV FGDTDPWDVT RTVSNAVLRT ISESTEPALA SAEMARDGVR ITVRDVEVAE TESRTQAAVV LLVDTSFSME MEGRWTPMKR TAIALNHLIS TRFRSDELHL IAFGRYARSI DIAELTGLQP RMEQGTNLHH ALLLAQRHLR RFPNAQPVVL VVTDGEPTAH LDPSGEPFFF YPPHPQTIAL TVRELDHVAR LGAQVTFFRL GEDPGLAHFM DQIARRIGGR VVAPDVDGLG AAVVGDYLRS RRGRRRG
|
| |