Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0723 |
Symbol | |
ID | 7266975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 901446 |
End bp | 902705 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643565574 |
Product | von Willebrand factor type A |
Protein accession | YP_002462083 |
Protein GI | 219847650 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0959383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAG AAGTTTCTCT GCGCGCCGTG TTAGCGCGTC CCTTTTTAGC AGCCACAACT ACGCCACAAG TGGCCTACGT GCTGCTCGAA GCGCAGCCGG CGCCACAGAT GACGCAGGTG CGAATGCCGG TCAATGTCTG TTTTGTGCTC GACCGGAGCG GTTCGATGAA GGGTGAGAAG ATCGAGCGAT TGCGACAAGC AGTGGTGAAG GCGATTGAGC TGCTCGATCA GCAGGACTCG CTTGCGATTG TGATTTTCGA TCATCGTACC GAAGTATTGG TTCCGGCTCA GCCGGTGCGT AACCGCGCGA TGATCCTCGA TCTCGTTCAC CGTATTCGTG ATGCCGGTGG GACGCGGATT GCACCTGCGG TCGAAAAAGG GTTGCAGGAG TTGCAGAAGA TGCCGCCGGG TGTACGCCGT CTCATATTAC TTACCGACGG CCAAACCGAG CACGAGAACG AGTGTTTGCT GCGGGCCGAC GATGCCGGGC GGCTTGGTGT GCCGATTACT GCCCTTGGCA TCGGCAAAGA CTGGAACGAA GATCTCTTGA TCGAGATGGC GAATCGCTCG AAGGGAGTGG CCGATTATAT TGCCCAACCC GGCGAGATTG TAAACTATTT TCAACATACC GTGCAGCGTG CCCAACAGAC CGTCATTCAG AACAGTGTGC TTACCTTGCG GTTCGTGCAG GGCGTAACGC CGCGCGCCGT CTGGCAAGTA ACACCGCTGA TCGACAATCT CGGCTACCAA CCGATCGGTG ACCGGGCGGT GAGTGTGAAA CTCGGCGAAC TCGAAGGTTC GCAACCCCGG ATACTCCTGA TCGAACTGTT GATCGATCCC CGTCCGCTCG GCACCTACCG GATCGGGCAG GCCGAGTTGA GCTACGACGC GCCGGCGTTG CAGTTGGTGG GAGAAAAAGC CAAGTTGGAT ATTATGCTGA CCTTTACCAA TGATCCGGCT CAACTGCAAC AAGTGAATCC GACGGTGATG AACATCGTCG AAAAGGTGAG TGCGTTCAAG CTCCAAACCC GCGCCTTGCA AGACCTTGCC GCCGGCGATG TGAGTGGCGC AACCCAGAAG CTGAAGAGCG CGGTGACGCG CTTACTCAAT CAGGGAGAAG TGGAGTTAGC GGCGACGATG CAGCAAGAGA TTGCCAACCT CGAACAACAA GGCCAAATGT CGAGCGAAGG TCAAAAGACG ATCAAGTTTC AGGGCCGCAA GACGGTGCGC TTAACCGACA TCGAATTGCC GAAGGAGTGA
|
Protein sequence | MAGEVSLRAV LARPFLAATT TPQVAYVLLE AQPAPQMTQV RMPVNVCFVL DRSGSMKGEK IERLRQAVVK AIELLDQQDS LAIVIFDHRT EVLVPAQPVR NRAMILDLVH RIRDAGGTRI APAVEKGLQE LQKMPPGVRR LILLTDGQTE HENECLLRAD DAGRLGVPIT ALGIGKDWNE DLLIEMANRS KGVADYIAQP GEIVNYFQHT VQRAQQTVIQ NSVLTLRFVQ GVTPRAVWQV TPLIDNLGYQ PIGDRAVSVK LGELEGSQPR ILLIELLIDP RPLGTYRIGQ AELSYDAPAL QLVGEKAKLD IMLTFTNDPA QLQQVNPTVM NIVEKVSAFK LQTRALQDLA AGDVSGATQK LKSAVTRLLN QGEVELAATM QQEIANLEQQ GQMSSEGQKT IKFQGRKTVR LTDIELPKE
|
| |