Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1135 |
Symbol | |
ID | 7267883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 1401915 |
End bp | 1403255 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643565978 |
Product | von Willebrand factor type A |
Protein accession | YP_002462481 |
Protein GI | 219848048 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0216745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTTGC ATGTGCAGTG TGATCCGCAA CCACTGCAAT TACCACCACT GACACAGCCA CAGGTGGCGT ATGTTCACCT GGTGATTAGC ACTCAGGGCG ACCGCACGTT GCCGCTCCAC CTTGTGGTGG TCGCCGATGC CAGTCGCTCG ATGCGCATTC CCATTGTCGA CGAGCACCGG TTTCGCGAGT TGGTGCGGAA CGGTGGCGCT CACGAGGTCT TGGTCGATGG AGTACCGGTT TGGCAATTGG CAAATCCCCT AAGCAGTGAG GCACGCAGTC AATTTTCTAG TCCGATTGAC TATACCGTGC GCGCATTACA CAGTGTGGTT GAACGGCTCA CTCCTGACGA CCGGATGGCA CTCATCGCCT GCGCTAGCGA TGCGCTCGTC CTCGCCCCAA GCACACCCGG CCACCGACGC ACCGACTTGA TCGGAGCGAT TGCTCGCTTG CCCGTGCTAC GGCTTGGCGA GAGCACCAAT CTGGCCCAAG GGTTGCAACT AGCGCTGGCC CAGTTTGTGG TCACCGATGA ACCGGCAGTA CGACGGGTGG TGTTACTCAC CGATGGCTTC ACAACGGATA CCACCATGTG TACCGCATTG GCCCGCGAAG CAGCAGACCG GAGTATTACG ATCAGTACGA TCGGGCTTGG AAATACATTT GAAGAAACGC TCCTCACGCA GATTGCCGAT CTTAGCGGTG GTCGTGCCAG TTTTGTCCAA GAGGCCGGTC ATATTCCGAC GATTATCTCC GCCGAGTTGG AGCATGCGCG CCAGACCACT ATCCACGCGC TGAGTCTGCA CATGACGTTA CCGCAGACGG TGACACTACG CCGGATCACC CGCCTCTCCC CAACCTTGAG TGTACTTACG CCGTTAAGCA CCGAACATGG GCGGCGCCTG ACCCTACACC TCGGCGATCT GCGCCGTGGC GACGCAGTGC GCTTGCTCTG CGAATTTCTC ATCGCCCCCG GCACTGCGGG GAGCCAACGC CGGCTAGCCC GCCTGCGCCT GAGGAGTGGG CAACACGAAC AACACCATGA CCTCATCGCG CATTACGATC CGCGCGCCAC GAACCCGCCA CCGGCACTAT TACCACTGAT TACCCACGCT ACCATTGCCC ACCTCCACCG GCGTGCCACC CTCGCCCGCC AGCAAGGCAA CCACGAAACT GCTGCTGTTC TGCTCCACCG GCTTGCGGCC CATCTCCGCA GCCTCGGCGA AACTGAACTG GCCACGTTGG CCCTGCAGGA AGCGAGTACT TCTGGGCAAA TACCACTTCC CAACTTGACA ACCAAAATGC TAACCTACGC TACCCGACGA CTAGGAGAAG CGGGTGATTG A
|
Protein sequence | MLLHVQCDPQ PLQLPPLTQP QVAYVHLVIS TQGDRTLPLH LVVVADASRS MRIPIVDEHR FRELVRNGGA HEVLVDGVPV WQLANPLSSE ARSQFSSPID YTVRALHSVV ERLTPDDRMA LIACASDALV LAPSTPGHRR TDLIGAIARL PVLRLGESTN LAQGLQLALA QFVVTDEPAV RRVVLLTDGF TTDTTMCTAL AREAADRSIT ISTIGLGNTF EETLLTQIAD LSGGRASFVQ EAGHIPTIIS AELEHARQTT IHALSLHMTL PQTVTLRRIT RLSPTLSVLT PLSTEHGRRL TLHLGDLRRG DAVRLLCEFL IAPGTAGSQR RLARLRLRSG QHEQHHDLIA HYDPRATNPP PALLPLITHA TIAHLHRRAT LARQQGNHET AAVLLHRLAA HLRSLGETEL ATLALQEAST SGQIPLPNLT TKMLTYATRR LGEAGD
|
| |