Gene Cagg_0300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0300 
Symbol 
ID7267481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp376433 
End bp378961 
Gene Length2529 bp 
Protein Length842 aa 
Translation table11 
GC content59% 
IMG OID643565168 
Productvon Willebrand factor type A 
Protein accessionYP_002461682 
Protein GI219847249 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTGGC AAGCTCCACA CATGTTTTGG TTATTACTCG CGTTACCGGC ATTGGTGCTC 
ATCTGGCGCC GTGCCGGACG GCGGCTACCT TGGAAGGTGG TTGTCCTGCG ATTAACGATC
CTAACCCTGT TGATCGGTGC GCTCGCAAAC CCGGTGAGGC AACAGGCTGA TACGCCGGCC
ACCGGGCCAT TGCTGGTGAT TTATGATCAA TCGGATAGCC TGACCCCTGC CGGTCAAGCA
GCGATCCGGG CCGAGGCTGA AGCAATTGCC GCCGCAGCCG CCCCCCAAAC ACGTTTGCTC
GCATTTGGGG CGAACGTCGT GGCCGGAAAT GAGCGACTAC CTGACGGCAC CGGGAGCGAT
CTTGCACATG CGTTACGGAC TGCCCGCCGG TTATTGCCGT CAGGTGGGCG AGTAATCTTG
ATCGGTGATG GGCACAACAC AGGGGGTGAC GTGCTGGCTG AAGCGCAACG GGCAGCCGCT
GCCGGTATCC GTATCGATGT ACGGTTTATC GCAGCGCCGC CCACTCCTGA AGTCGCCGTG
ACGCGCCTCG ATGCGCCGGC ACTCGTGCGG AGTGGCGAAC CGTTTGACAT TACCGTCACC
ACAACCTATC AACCAACTGA CGATCCGACA CCAATCGCTG CCGATTTGCG GATATGGATT
GACGAACAAC TGCTCGGCGA GGAATCGGTG GTCATTCCAC CCGGTCAGTC TCAATTTACT
ATCACCTATA CCGCAGAGCA GCCGGGGCTA CTCAGTCTGC GCACCGAGAT CATCCCGACC
GGTAACGACA CCTTCGCCGC GAACAATGTT GCCGCAGCAA CAGCCCTCGT GATGCCACCA
CCCCGCATCT TACTGGTCGA AGGGTTGCGT GACAACGGCA CCATACTCGG TGCAGCGCTT
ACTCAAGCGG GAATGGAGGT CGAGCGGGTG TCGGTTAGTG CCGTACCGGG AGTGCTCAAC
CGTCTCGCCA TGTACGACGG GATCGTATTG GTTGATGTAT CGGCGAATCA ACTCTCATTC
GAGCAGATGA CGGCATTGCG TGAAGTAGTG CGGAGCGAAG GCAAAGGGCT GACGGTCATC
GGCGGCAATC AATCGTTTAC CCTTGGCGGT TATGCCCGCA CGCCGTTGGC CGAAGCGTTG
CCTCTCCTGA TGGAACCTCC ACCACGCCCA CAACGGGCAC CGATTTCGCT CTTGCTGATT
ATTGACCGTT CGGCCAGTAT GAGTGCCTCG TTCGGAGTGA GTAAGTTTGA TTTGGCGAAA
GAGGCCGCCA TTCTTGCCCT CACCGCTCTC CAAGCTGGCG ACCGCATCGG TGTCCTCGCC
TTCGATACCG ACACTATCTG GGTGATACCG TTCCAAGCGG TTGGCGAGGG AGCAGCCGTT
GCCGAATTGC AGACTCGGAT TGCCACGATG GCAATTGGCG GTGGCACGAA CATCGAACGG
GCGCTCGCAG TTGGGCTACC GGCCCTGGCT GCTGAGCCGC ACAGTGTACG CCACGCCGTC
TTACTAACCG ATGGTCGCAG CTATAGCAAC AACTACCCAC GCTACCAGCA GTTGGTTGAG
ACGGCCCGTG CTGCTCAGAT TACGCTTTCG ACCATTGCCA TCGGCACTGA CGCCGATACC
GATCTCCTCG AGCAATTGGC ACGTTGGGGG AATGGACGCT ATTACTTTGT GCCCGACGCC
GCCGATTTAC CGCGCATCAC GTTGCAAGAG AGTGAAATCG CCGGCTCCGA ACTGACCGTC
GAACAACCTT CCCCCGTGCG TCTCAACCAG CCGCATCCGC TGGTACGGAA CTTCGATCCG
AGCACCCTAC CGCTCCTCGA TGGCTATATC GCGTTACAAT CTCGCCCGGA GGCCACCGTA
GTCCTGAGTA GTCCGGCAGA CGATCCATTG TTGGCCGTCT GGCAATACGG CTTAGGTCGC
AGTGTGGCGT GGACGGCCAG CGCTGCGGCG CCGTGGGCTA CACGTTGGCC GGGTTGGTCG
GAGTACGACC GGTTTTGGAA CCAGGTGGTA CAGTACACTA TCCCGACCCC TGACAGTGGG
CCGCTCCAAG TTTGGGTCGA ACCGCTTTCG CGTGGGGTTC GACTGATGGT CGACGCCCAG
ACTATCGGTG GGGCACCGAT TGATTTAGCG CAGGTGAATG CGCAGATCAC GTTCCCTGAC
CAAAACAGCC AGCGCATCAG TTTGTTACAA ATTGGGCCGG GGCGCTACAG TCGTGATGTG
GCGCTTGGTG AAGTTGGTCC GTACCGGGTT GTGGTAACCT TATTTGCCGA TGGGCAAACA
CTGCAACGCA GCATTGGCTA TGTGCAGGCA CCGCCGACCG AATACGCGAT TCACGATCCG
GCGCAGGGTG CGGAGCGCTT GCGTCAGATT GCTGCGATTA CCGGTGGGAG CACCGAGGTG
GTAGTCGTAG ATGAAGCATC GGTAGCGATG CCGGCTTCTC CGCAAGAACT CTGGCCGTGG
CTGGCTGCAC TCGCTCTGGC CTTGTGGGTT GGGGAAATAG CCCTGCGACG TAACCAATTA
TACGAATGA
 
Protein sequence
MIWQAPHMFW LLLALPALVL IWRRAGRRLP WKVVVLRLTI LTLLIGALAN PVRQQADTPA 
TGPLLVIYDQ SDSLTPAGQA AIRAEAEAIA AAAAPQTRLL AFGANVVAGN ERLPDGTGSD
LAHALRTARR LLPSGGRVIL IGDGHNTGGD VLAEAQRAAA AGIRIDVRFI AAPPTPEVAV
TRLDAPALVR SGEPFDITVT TTYQPTDDPT PIAADLRIWI DEQLLGEESV VIPPGQSQFT
ITYTAEQPGL LSLRTEIIPT GNDTFAANNV AAATALVMPP PRILLVEGLR DNGTILGAAL
TQAGMEVERV SVSAVPGVLN RLAMYDGIVL VDVSANQLSF EQMTALREVV RSEGKGLTVI
GGNQSFTLGG YARTPLAEAL PLLMEPPPRP QRAPISLLLI IDRSASMSAS FGVSKFDLAK
EAAILALTAL QAGDRIGVLA FDTDTIWVIP FQAVGEGAAV AELQTRIATM AIGGGTNIER
ALAVGLPALA AEPHSVRHAV LLTDGRSYSN NYPRYQQLVE TARAAQITLS TIAIGTDADT
DLLEQLARWG NGRYYFVPDA ADLPRITLQE SEIAGSELTV EQPSPVRLNQ PHPLVRNFDP
STLPLLDGYI ALQSRPEATV VLSSPADDPL LAVWQYGLGR SVAWTASAAA PWATRWPGWS
EYDRFWNQVV QYTIPTPDSG PLQVWVEPLS RGVRLMVDAQ TIGGAPIDLA QVNAQITFPD
QNSQRISLLQ IGPGRYSRDV ALGEVGPYRV VVTLFADGQT LQRSIGYVQA PPTEYAIHDP
AQGAERLRQI AAITGGSTEV VVVDEASVAM PASPQELWPW LAALALALWV GEIALRRNQL
YE