Gene Cagg_3755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3755 
Symbol 
ID7267828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4576255 
End bp4579131 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content57% 
IMG OID643568562 
Productvon Willebrand factor type A 
Protein accessionYP_002465027 
Protein GI219850594 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00649102 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTATA TCTCGTTCAT CTACCCCGAA GCGTTGTGGT TACTCATCAT ACCGGTTGTG 
ATGATTGCGG TCGCGCTGTT AGGCCCGCGC CGCTTATCAC CAATGCGGTT CTGGGGTAGC
TTGATCGTGC GTACCCTCAT AGCGGTATTC CTGGTGTTTG CCTTCGCAGG TATTCAATTT
ATCCGCCCGG TTGATCGGCT CACAACGGTT TTTTTACTCG ATGGCAGTGA CTCAATGCCG
GCATCGACCC GTGCTCAGGC CGAAGCCTTT ATCCGTGCGG CGTTGCAAGA GATGCCGCCG
GACGATCAAG CGGCAATCGT TGTCTTCGGT GGTAATGCCT TGGTTGAACG AGCACCCGAT
AGTGATCGGC GATTAGGGCG CATTACTTCG ATTCCCATCA CCAACCGTAC CAACATTGAA
GCCGCTATCC AACTGGGCAT GGCATTATTC CCCGCCGACT CGCAAAAGCG GCTCGTGTTA
CTCTCCGATG GTGGCGAAAA TAGTGGGCGG GCGATTGATG TGGCCCGTTT GGCGGCCAGT
CGTGGTATTC CTATTGATAT TGTCGATCTG GCCCTCGTAG AGACCGACGC AGAAGCATTG
GTTGCCAGCG TTGAAGCACC GAATGGTGTC CGCGATGGTC AAGAGGCGTT GATCGTGGCA
ACTGTCGAGA GCACGGTAGC GCAGCGTGCA ACGGTACGCC TGATCGATGA TGTCGGGGTC
GTCGCGGAAC GTGAACTCGA TCTCGGACCC GGTGCGACAC GGGTTGAATT CGTGGTTCCG
ATTAAGGGGA GTGGTTTCCA ACGCTATCGG GTGCAAGTGG AGGCTGCGCA AGATGGGCGG
GTACAGAATA ATGAAGCAGC AGCCTTGATC CGAGTTCAAG GGCCGCCGCG CGTCTTGTTG
GTGGCACGCA ACGCCGCCGA TGCTCGTCCG TTAGCGACGG CGCTTACTGC GGCTGATATT
GTGGCCGAGA TTATTGCGCC GGAGGCTGCC CCGCGTTCGT TGGCCGATCT CAGCGCCTAC
GATGCGTTGG TATTGGTAAA TACACCTGCT CGGGCATTGC CGGTTGGGCT GATGCAGGCT
ATCCCCGGTT ATGTGCGCGA TCTCGGTCGT GGTTTGTTGA TGATCGGTGG GGAAGAAAGT
TTTGGCGTCG GTGGCTATGG TCGTACTGCG GTCGAAGAGG CATTGCCGGT CTATATGGAC
GTTCGCAACC GTGAGCTACG GCCCGATCTG GCAATTGTGT TTGTGATCGA TAAGTCAGGC
TCGATGGATG CTTGTCATTG TGCCAATCCC GATCGCGGCG GCCCCATTAC CTCTTCGAGC
GAGCGAAAAA TTGATATTGC GAAAGATGCG GTTGCGCAAG CGACGGCATT GCTCAGTCCA
CAAGATACGG TTGGGGTGGT GACGTTCGAC GGTGCTGCTT TTCCAACGTT TGTCGCTACA
CGCGGGGCGA CTGTTGAACA AGTGATGGAT GCAGTGTCCG GCGTTGAGCC GCGCGGACCG
ACCAATATTC GCGCCGGTCT GCTGCGCGCC GAAGAGATGC TCCAGCAGGT CGATGCTCGC
ATTAAGCATA TGATTTTGCT GACCGATGGT TGGGGGAGCG GTGGTGATCA GCTCGATATT
GCGGCTCGTC TGCGTGAGCA GGGAATTACA TTGACTGTTG TAGCAGCGGG AAGTGGTTCG
GCGACCTATC TGCAACAGTT GGCGGCTGAA GGTGGTGGTC GTTATTATCC GGCGGCTGAT
ATGGCCGATG TACCGCAAAT TTTCGTCCAA GAGACCATTA CGGCGATTGG TAATTACATC
GTTGAGCAGC CATTTGTGCC GGTACGCTAC AGTAACAGCC CTATTTTGGC CGGAATTAAC
GCAGTGCCAC CACTCTACGG TTACAACGGC AGCACGCTCA AAGATACTGC GCAGTTAATC
CTGGCGACCG AAGATCAGCA ACCAATACTG GCAACCTGGC AGTATGGTTT GGGGCGCAGT
GCGGCCTGGC TTAGCGATAC CAAAGGCAAG TGGGCGCGTG ATTGGTTGAC ATGGGACGGA
TTTCCACGGT TCGCCGCTCA GTTAATCGGG GATCTGACAC CACGTGGTGG TTCAGATGTC
CGCGCCGAGG TGACGGTTGC CGGTGGGGAG ACGGTGGTGC GACTGATGAC GGCGGCCGGG
CAAAATGATT TGACGGTGAC GGCTACACTG ATCGGTGGTG ATGGTTCGCG TCATGAAGTG
CGTTTGACGC AGGTTGCACC AAACCAGTAT CAAGCGCGGC TAGAAAGCCC CGTACCCGGC
ACGTATTTAG TGCAAATTGC CGGCAATCGC GGCGATCGGG TGGTCGTCCA AGAGACGGCC
GGAATGGTTG TGCCGTATTC TTCGGAATAT CGCAGTGCTC AAGCCAACCC CGGCCTATTG
GCCGAACTGG CGAATGTGAC CAAAGGTCGG TTTATTGAAC AACCGACTGA GGTGTTCAGT
CGGATCAATC TGGTTTTCTC GGCACAAGAG ATTGCGTTGC CGCTTCTCCT ATTGGCGTTG
ATCCTGTTGC CCTTCGATAT CGCCTTGCGT CGGCTGATGT TGCGGCGGAG CGATTTTGGC
GTGCTCGGTC GATTAGCCGG ACGCTTCCAA CCGGCCGGAT CAGTGCCGGT TGCGCCTGAT
CCGGTGCTTG GACGATTGCG GTCTGCTCGT GATCAGGCTC GCCGCCGGAT GGCCGGAGAG
CAGCAACCGC TTACGCCACC CCCGGCGGTT CCTTCGTTAT CTACCTCACC GTCTCATCCG
GTAGACCCTT CAAAGACGGA GACTGCAGAC GCGCTGGCGC GCTTGCGGGC TGCCAAAGAG
CGTGCGCGTC GCCGAATCAC CGGCGATGAC AATGACGGAG GGGGAACTAC CTCGTAA
 
Protein sequence
MPYISFIYPE ALWLLIIPVV MIAVALLGPR RLSPMRFWGS LIVRTLIAVF LVFAFAGIQF 
IRPVDRLTTV FLLDGSDSMP ASTRAQAEAF IRAALQEMPP DDQAAIVVFG GNALVERAPD
SDRRLGRITS IPITNRTNIE AAIQLGMALF PADSQKRLVL LSDGGENSGR AIDVARLAAS
RGIPIDIVDL ALVETDAEAL VASVEAPNGV RDGQEALIVA TVESTVAQRA TVRLIDDVGV
VAERELDLGP GATRVEFVVP IKGSGFQRYR VQVEAAQDGR VQNNEAAALI RVQGPPRVLL
VARNAADARP LATALTAADI VAEIIAPEAA PRSLADLSAY DALVLVNTPA RALPVGLMQA
IPGYVRDLGR GLLMIGGEES FGVGGYGRTA VEEALPVYMD VRNRELRPDL AIVFVIDKSG
SMDACHCANP DRGGPITSSS ERKIDIAKDA VAQATALLSP QDTVGVVTFD GAAFPTFVAT
RGATVEQVMD AVSGVEPRGP TNIRAGLLRA EEMLQQVDAR IKHMILLTDG WGSGGDQLDI
AARLREQGIT LTVVAAGSGS ATYLQQLAAE GGGRYYPAAD MADVPQIFVQ ETITAIGNYI
VEQPFVPVRY SNSPILAGIN AVPPLYGYNG STLKDTAQLI LATEDQQPIL ATWQYGLGRS
AAWLSDTKGK WARDWLTWDG FPRFAAQLIG DLTPRGGSDV RAEVTVAGGE TVVRLMTAAG
QNDLTVTATL IGGDGSRHEV RLTQVAPNQY QARLESPVPG TYLVQIAGNR GDRVVVQETA
GMVVPYSSEY RSAQANPGLL AELANVTKGR FIEQPTEVFS RINLVFSAQE IALPLLLLAL
ILLPFDIALR RLMLRRSDFG VLGRLAGRFQ PAGSVPVAPD PVLGRLRSAR DQARRRMAGE
QQPLTPPPAV PSLSTSPSHP VDPSKTETAD ALARLRAAKE RARRRITGDD NDGGGTTS