Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3755 |
Symbol | |
ID | 7267828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4576255 |
End bp | 4579131 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643568562 |
Product | von Willebrand factor type A |
Protein accession | YP_002465027 |
Protein GI | 219850594 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00649102 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTATA TCTCGTTCAT CTACCCCGAA GCGTTGTGGT TACTCATCAT ACCGGTTGTG ATGATTGCGG TCGCGCTGTT AGGCCCGCGC CGCTTATCAC CAATGCGGTT CTGGGGTAGC TTGATCGTGC GTACCCTCAT AGCGGTATTC CTGGTGTTTG CCTTCGCAGG TATTCAATTT ATCCGCCCGG TTGATCGGCT CACAACGGTT TTTTTACTCG ATGGCAGTGA CTCAATGCCG GCATCGACCC GTGCTCAGGC CGAAGCCTTT ATCCGTGCGG CGTTGCAAGA GATGCCGCCG GACGATCAAG CGGCAATCGT TGTCTTCGGT GGTAATGCCT TGGTTGAACG AGCACCCGAT AGTGATCGGC GATTAGGGCG CATTACTTCG ATTCCCATCA CCAACCGTAC CAACATTGAA GCCGCTATCC AACTGGGCAT GGCATTATTC CCCGCCGACT CGCAAAAGCG GCTCGTGTTA CTCTCCGATG GTGGCGAAAA TAGTGGGCGG GCGATTGATG TGGCCCGTTT GGCGGCCAGT CGTGGTATTC CTATTGATAT TGTCGATCTG GCCCTCGTAG AGACCGACGC AGAAGCATTG GTTGCCAGCG TTGAAGCACC GAATGGTGTC CGCGATGGTC AAGAGGCGTT GATCGTGGCA ACTGTCGAGA GCACGGTAGC GCAGCGTGCA ACGGTACGCC TGATCGATGA TGTCGGGGTC GTCGCGGAAC GTGAACTCGA TCTCGGACCC GGTGCGACAC GGGTTGAATT CGTGGTTCCG ATTAAGGGGA GTGGTTTCCA ACGCTATCGG GTGCAAGTGG AGGCTGCGCA AGATGGGCGG GTACAGAATA ATGAAGCAGC AGCCTTGATC CGAGTTCAAG GGCCGCCGCG CGTCTTGTTG GTGGCACGCA ACGCCGCCGA TGCTCGTCCG TTAGCGACGG CGCTTACTGC GGCTGATATT GTGGCCGAGA TTATTGCGCC GGAGGCTGCC CCGCGTTCGT TGGCCGATCT CAGCGCCTAC GATGCGTTGG TATTGGTAAA TACACCTGCT CGGGCATTGC CGGTTGGGCT GATGCAGGCT ATCCCCGGTT ATGTGCGCGA TCTCGGTCGT GGTTTGTTGA TGATCGGTGG GGAAGAAAGT TTTGGCGTCG GTGGCTATGG TCGTACTGCG GTCGAAGAGG CATTGCCGGT CTATATGGAC GTTCGCAACC GTGAGCTACG GCCCGATCTG GCAATTGTGT TTGTGATCGA TAAGTCAGGC TCGATGGATG CTTGTCATTG TGCCAATCCC GATCGCGGCG GCCCCATTAC CTCTTCGAGC GAGCGAAAAA TTGATATTGC GAAAGATGCG GTTGCGCAAG CGACGGCATT GCTCAGTCCA CAAGATACGG TTGGGGTGGT GACGTTCGAC GGTGCTGCTT TTCCAACGTT TGTCGCTACA CGCGGGGCGA CTGTTGAACA AGTGATGGAT GCAGTGTCCG GCGTTGAGCC GCGCGGACCG ACCAATATTC GCGCCGGTCT GCTGCGCGCC GAAGAGATGC TCCAGCAGGT CGATGCTCGC ATTAAGCATA TGATTTTGCT GACCGATGGT TGGGGGAGCG GTGGTGATCA GCTCGATATT GCGGCTCGTC TGCGTGAGCA GGGAATTACA TTGACTGTTG TAGCAGCGGG AAGTGGTTCG GCGACCTATC TGCAACAGTT GGCGGCTGAA GGTGGTGGTC GTTATTATCC GGCGGCTGAT ATGGCCGATG TACCGCAAAT TTTCGTCCAA GAGACCATTA CGGCGATTGG TAATTACATC GTTGAGCAGC CATTTGTGCC GGTACGCTAC AGTAACAGCC CTATTTTGGC CGGAATTAAC GCAGTGCCAC CACTCTACGG TTACAACGGC AGCACGCTCA AAGATACTGC GCAGTTAATC CTGGCGACCG AAGATCAGCA ACCAATACTG GCAACCTGGC AGTATGGTTT GGGGCGCAGT GCGGCCTGGC TTAGCGATAC CAAAGGCAAG TGGGCGCGTG ATTGGTTGAC ATGGGACGGA TTTCCACGGT TCGCCGCTCA GTTAATCGGG GATCTGACAC CACGTGGTGG TTCAGATGTC CGCGCCGAGG TGACGGTTGC CGGTGGGGAG ACGGTGGTGC GACTGATGAC GGCGGCCGGG CAAAATGATT TGACGGTGAC GGCTACACTG ATCGGTGGTG ATGGTTCGCG TCATGAAGTG CGTTTGACGC AGGTTGCACC AAACCAGTAT CAAGCGCGGC TAGAAAGCCC CGTACCCGGC ACGTATTTAG TGCAAATTGC CGGCAATCGC GGCGATCGGG TGGTCGTCCA AGAGACGGCC GGAATGGTTG TGCCGTATTC TTCGGAATAT CGCAGTGCTC AAGCCAACCC CGGCCTATTG GCCGAACTGG CGAATGTGAC CAAAGGTCGG TTTATTGAAC AACCGACTGA GGTGTTCAGT CGGATCAATC TGGTTTTCTC GGCACAAGAG ATTGCGTTGC CGCTTCTCCT ATTGGCGTTG ATCCTGTTGC CCTTCGATAT CGCCTTGCGT CGGCTGATGT TGCGGCGGAG CGATTTTGGC GTGCTCGGTC GATTAGCCGG ACGCTTCCAA CCGGCCGGAT CAGTGCCGGT TGCGCCTGAT CCGGTGCTTG GACGATTGCG GTCTGCTCGT GATCAGGCTC GCCGCCGGAT GGCCGGAGAG CAGCAACCGC TTACGCCACC CCCGGCGGTT CCTTCGTTAT CTACCTCACC GTCTCATCCG GTAGACCCTT CAAAGACGGA GACTGCAGAC GCGCTGGCGC GCTTGCGGGC TGCCAAAGAG CGTGCGCGTC GCCGAATCAC CGGCGATGAC AATGACGGAG GGGGAACTAC CTCGTAA
|
Protein sequence | MPYISFIYPE ALWLLIIPVV MIAVALLGPR RLSPMRFWGS LIVRTLIAVF LVFAFAGIQF IRPVDRLTTV FLLDGSDSMP ASTRAQAEAF IRAALQEMPP DDQAAIVVFG GNALVERAPD SDRRLGRITS IPITNRTNIE AAIQLGMALF PADSQKRLVL LSDGGENSGR AIDVARLAAS RGIPIDIVDL ALVETDAEAL VASVEAPNGV RDGQEALIVA TVESTVAQRA TVRLIDDVGV VAERELDLGP GATRVEFVVP IKGSGFQRYR VQVEAAQDGR VQNNEAAALI RVQGPPRVLL VARNAADARP LATALTAADI VAEIIAPEAA PRSLADLSAY DALVLVNTPA RALPVGLMQA IPGYVRDLGR GLLMIGGEES FGVGGYGRTA VEEALPVYMD VRNRELRPDL AIVFVIDKSG SMDACHCANP DRGGPITSSS ERKIDIAKDA VAQATALLSP QDTVGVVTFD GAAFPTFVAT RGATVEQVMD AVSGVEPRGP TNIRAGLLRA EEMLQQVDAR IKHMILLTDG WGSGGDQLDI AARLREQGIT LTVVAAGSGS ATYLQQLAAE GGGRYYPAAD MADVPQIFVQ ETITAIGNYI VEQPFVPVRY SNSPILAGIN AVPPLYGYNG STLKDTAQLI LATEDQQPIL ATWQYGLGRS AAWLSDTKGK WARDWLTWDG FPRFAAQLIG DLTPRGGSDV RAEVTVAGGE TVVRLMTAAG QNDLTVTATL IGGDGSRHEV RLTQVAPNQY QARLESPVPG TYLVQIAGNR GDRVVVQETA GMVVPYSSEY RSAQANPGLL AELANVTKGR FIEQPTEVFS RINLVFSAQE IALPLLLLAL ILLPFDIALR RLMLRRSDFG VLGRLAGRFQ PAGSVPVAPD PVLGRLRSAR DQARRRMAGE QQPLTPPPAV PSLSTSPSHP VDPSKTETAD ALARLRAAKE RARRRITGDD NDGGGTTS
|
| |