Gene Cagg_0715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0715 
Symbol 
ID7266967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp882439 
End bp887565 
Gene Length5127 bp 
Protein Length1708 aa 
Translation table11 
GC content59% 
IMG OID643565566 
Productalpha-2-macroglobulin domain protein 
Protein accessionYP_002462075 
Protein GI219847642 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTCG GCGTGGTGGT AAGCTTACTC GGGGGGCTGG CCATTCGCTG GTTCGCTCCG 
GCATTACCGT GGGCTGAAAC GCCAACGGTG CAACTGCGTG ATCCCTTACC GCAAAGTCAA
GTTGTACCGC CTCGTAGTAC CATCACGCTC GTCTTTAGCA CGCCGATGAA CCCGTTTACG
GTTGTGCGGG CGTTACGCAT CGATCCGCCA ATCGCCGGCA AACTGGAATG GTCGGAAGAC
CGACGTAGTG TGCGGTTGAT CCCGGATCAA CCGCTGCAAC CGGCTACAAC GTACCGTGTA
CAGGTGATGA CCAACGCGCA GAGCCAATGG TGGCGGCCCC TGGCTCGGTC ACTTGACGTT
GAGTTTACCA CTGCTGCCCA ACCTACCGTC ACAGCGGCAT TGGCCCCGCA GTCGCGGACA
GCCCCAATTG CGCTTATCTT TAGCCGGCCA ATGGTTGACC CGGATACAAT CGGTCAGCCT
GCATCGCTGG AACAGATCCG AATATCGCCA CCGCTGCAAC TGAATGGCGA ATGGCTTGAT
CGGCAAACGT TGCTGTTGCA ACCGGCAACT GCCTTTACGG CCATCACCAC CTACACCCTC
ACCCTCGATG CGAATCTACG CGATGCACGG GGGATCGAAC TTGGCCAACC GTTCCAATGG
CAATTCACAA CGCCATGGCC AATCCTACTC GAACAGACGC CATTGCCTGA CGAACAATGG
GTGAATCCGC AACAGCCTCT GACCTTGGTG TTTGATGCCC CGATTGATAC CCGTCTACTC
AGCTCAGCGT TAACGATTAC CCCGGCCGTC AGCGGCAATT TTAGCAGTTT CACCGATGGC
GACCGCTATA TCACCATCTT TACTCCCCAT AACGGTTGGT CACCCGGCCA GACCTATCAG
ATACGCCTCC AACCCGACCC AAGTGGCGAA CCGGTAGCTG CGTGGCGCTT CACCGTCGAA
CCTGAACCGC TCCTCATTGC GTCATTTCCG GGGCAAGGCC AGACGCTCGC GCCGGGTCAA
GCGATTCGCT TGATCTTCAG CACCCCAATG GACGAAGCCG AGTTACGTGC CGGTTTGCGG
ATTGATCCGC CGGTGGTAAA GCTCGAACTG GAAGTGGATG AGAATCGGAT CACGCTGCAA
CCGCTCTTAC AGGCATCAAC GACATATACC ATCACTATCG CTGCCGGTAC ACGTGATCGC
AGTGGAGTGC CACTCGCCCA CGATGTGTCG CTCACCATTC GTACCGCCAG TGCCTCACCG
CAGTTGCAGG TTATCGGCGA GATCGTTCTC TCGTTTCCCG CCAATGTGCA ACCGGTAGTA
ACTATCGAGC GAATGAATCT GTCGGTGATC GACGGTCAAT TGTACCAACT CGATCCGTCA
ACCCTGATCA GAGCGCTCTC ACTCCGACCA AACGAATGGG CTAGCTTCGT GCCCGAAAGG
TATGGGCAAC CTCTCGTGCG TGCATGGCGT GAGCTACTGA ACGACCCGGC CGATACCCTT
GTGCGTAGTT TGCTACCGAT CACGGCCAAC AGTGCCGGCG ATCCCTTGCC GCCGGGAGCG
TACTATCTGC GGATGACCGC TAGTGCCGGT TTACGGGCTG ACCGGTTATT GCTCATCTCA
TCGCTGAATC TGTCGCTGTT GGTGAACAAT GATGAACTGC TACTGTGGGT GACAACCGGT
GGGACCGGTA CACCCGCCGG CAATATTCCG TTGACGGTCT ATACCGGTGA GACGGTGCTG
GCGCGCGGTA CGAGTGACGA TCAGGGTCTG TGGCGGGTAC CGCTGACCGG CCTGTCAACC
GGGAATAGTA GTTCGCCACT ACCGATAATT GCTTTGGCCG AAGGCAATGG CCTGACGCTG
GCCCGTGCCG AACTCACAAC GACACAACCA CGAGTACAAG CCCTGCTTGC CCCTGATCGG
TTAAGCTATC GTCCAGGCGG CGTCGTGCGG ATCAACGGTG TGGCTCGTGC TCAACAACCT
GATGGTCGGC TGATGCTGCC GACGAGCGGC AATTGTACCA TTCAACTCGA CGGCCCAAAC
CTGAACGACC AACCACCTGC CGTGGACTGT ACGGTGAGTA ATACCGGTAT TGTCAGCGGC
AGTCTCCAAT TGAATGCCCG TACTACACCC GGCCCCTATA CAGCAACCGT CGTTATCGGC
GATAGTGTCT ACCGTCTGCC GATTCGAGTC AGGGGACCGT CTACCGGTAC GGCCGTTCGG
ATCATGCCCG CACGCCCGGC CGGCCTGGCG GTTGATGTGA CCCGTGCCGG CCTACCGGTG
AGCGGTGCAA CCATCAGTTG GACACTCCGC CTCGAAACCC TCTCGGTCGC CGAACTTGAC
GGGGTCAACG GCGTGGTGAT CGGTGGGGTT AGTGAAAACC AGGCCAGTGC CACCACCGAC
GCCAGCGGTC GGGCGATGAT CAATCTGCCG GCCGATGAAA ACCTGCTACG CCCATTACGC
TATCAACTCG CTCTCACCGT CCTCCTCCCC GATGGCGAAC AGCTCTCACG TGAAACAGAG
GGAGTGATCA CCCCGCGCAG TCCGCGGTTG GTGCTCGATG CTCCGGCGAT CGTTGAGCGC
AACGAACGGG CTACCATCAC GATGTGGTTG CGCACAGCCG ATGGTGCAGC GATTGGGAAT
ACGTTGGTTG AACTTGAAGT TCGCCGTAAC CCAACCGACC CGCCACTGAT CGTGCGTCGT
GTCCGTACCA ACAATGACGG CCAGACCAGC GCCGAACTCG TACCACTCGC ACCGGGCCGC
TACGAACTCA CCGCCCGCGC AGGTACGGCT CTCAGCCGTC ATTCCTTGTG GGTCGCCGGA
TTCGGCCTGT CTACTGCCGA ACCACAAATC ATCCCCGATC GCTCGTCCTA TACGGTGGGC
GAAACGGCTC GTGTGTTGAT CACCGGACCG GCCGGCGGTA GAACGCTCCT CCTCGTGATC
GGTCAAGGCG CTACTGCTCA AACCATCATT ACTGCTGCGC AGCCCGGTAC GGTGCTCGAT
ATACCGATAA GCGCCGATCT GGCGCCAATC ACCCTCCTCA CCGCCCTTAT TGACGATGGC
GTGCGGCATT GGATGACCTC TACCACGATC TCCATCGATC CGCCACCGCC ACCAGAACTG
AACATCAGCC AACCCGATGT CTTGCCCGGT GCGACGGTCA CGTTCACGGT AACGGCAGCA
ACCGATACCC TGCTGGTTGT CCTCAGTTTG CTCCATTCGC CGCCGGTCGA CCTGACACCC
TGGAATCAAC CTCTCGCACC TTTAACCCGC GCGCCAGGGC AACAGACCGG TCTTGACGGG
ATTATCCTGC CGGCGAGTAT TCAATCTCAC CAAAACCAGC ACACGATCAG TGTGACTATG
CCAAACCAAC TCGGTCGCTG GCGCCTCAGC GTGATTGCGG TTTATCCGAA CGGTATCGCT
ACCATCGCCG GTGCCCTGCT CGACACTAAC CAACCCATTG AAGCGATTGC CATTCCTCTC
CCGGCCCCAC GCCCACCCGA CACGGTGACG GCGACGCTCA TCCTCCGCAA TCTCAGTGGA
CAAGACCGCA CGGTACGCAG TCGGCTTTGG CTCAGTGATG GCATCCTCCT CGACCCGCTT
GAACAGACCA CCACCGTACC AGCCGGCGCA ACCGTCCCGA TTGCCTGGCG CTTACAACCA
CAACCCAATG CCAATCTCGT CGGTTTGCGC TATGAGGTGA TCGATACGAT CAGCTTGCCA
CCGATTGAAT ACGCCATACC GGTATGGCGT GACCCGCCAC TGCCTACTAC CGACCAGACA
TACGTCGCCA CCGACCCGAT TACCATCACC CTCCCCGACG GAAACAACGA AGTCGTCATT
GCAGCGAGTG TCCGTGCCGC TCTGGCCGAC CAGGCCCAAC GGTTGTTGCA AACTACGCCC
CCAACCGCGG AAACGCTCGC CGCTGCCATT GTGATCGGTC GTGAACTCGA ACGTACTGCT
ACCACCACTG CCGAAGCCGA CCGGTGGCGA ACAGCTATCG AGAATGTGTT ACCACCACTG
CGTAGCCTTC GTAACCCCGA TGGCGGATGG GGCTGGTGGC CCAATACGGC CTCCGATCCC
TTCATAACCG CCTTTGTGCT CGAAGCAATA GGCCGACTTC CTTCACCGTC TGCCGAGCGT
CGTGAATTGA GCGAACCGGC CCTAAGCTAT CTACGCCGTA CACGGTTGAC GCAACCGGCC
GATGCTCAGG CTTATATTAA TTATGTCATG TCACTCTACG GTGCAAACCC GACGCCGCCG
ACACTCCCCA CCGGCGCCGG ACCGGCCGGA CGTGCGTTTA CCGCCCTGCA ATTGCCGAAC
GAGCGTGATA CACTGCTCAA TCCGTTGCGC GTTTCTGCCA GCAGTCGTTT ACCGTGGGCC
GGTGCCGAAG GATTGCCACC GAGTACACTC GCCGTCAGTG CGAGTGTCAT ACAGGCCTTA
GCGCAAGATC GTCCCTCTGA TCCACGTCTT ACCCACTGGC GCACGACCCT TATGCGAACC
TGGCAGATCG ATGGTCGGTC AACGCCGTAT GAAGCAGCAC GGGTTGCACT TGCCCTTAAC
ACCACATTAT TGGCCGAAGA TGGCGAGGTG CAGGTCTTCC ACAACGATAC GTTGATAACC
GACCGGCCAT TGGGTGATGT GGCGCGTTTC CGGTTCAACG GTGGCACCCT TCGGATCGAA
CCGGCCCAGA CAGCCGCACT CATCACGGTG CGGAGTCCGG CATCACCTAC GCAGCCAAAC
AACGTGCGTG CCCGCCTGCA ATACCTCACC AATACCAACT CGCTCACAGT CGACCGGCCG
GTACAGATCG AATTAATTGT GATCACCACT CAGCCACTTT TCCGGCTTGA TGTTGCCGTA
CCGCTCCCGG CCGGTCTGAC TCCGCTCGCA GTTGACGCCG GTACTGAACT TACCGTTCAA
CAGATAGATC GTGAACGGCG GCAAGTGCGG CTCGGCGGTG TACGATTAGC GCGTGGCGTG
TATCGTATTC TCATCACGGC ACAGACCACC GCGACCGGTA GCTTTACTGT GCCATCGGCA
TTTATCAGCA TGCCGGGAAG TGATTTAGCT CCGGTCGTAG CAGAATGGCA AACGATGATT
GCGATTACGC CAAAAATTGA CGAATAA
 
Protein sequence
MLFGVVVSLL GGLAIRWFAP ALPWAETPTV QLRDPLPQSQ VVPPRSTITL VFSTPMNPFT 
VVRALRIDPP IAGKLEWSED RRSVRLIPDQ PLQPATTYRV QVMTNAQSQW WRPLARSLDV
EFTTAAQPTV TAALAPQSRT APIALIFSRP MVDPDTIGQP ASLEQIRISP PLQLNGEWLD
RQTLLLQPAT AFTAITTYTL TLDANLRDAR GIELGQPFQW QFTTPWPILL EQTPLPDEQW
VNPQQPLTLV FDAPIDTRLL SSALTITPAV SGNFSSFTDG DRYITIFTPH NGWSPGQTYQ
IRLQPDPSGE PVAAWRFTVE PEPLLIASFP GQGQTLAPGQ AIRLIFSTPM DEAELRAGLR
IDPPVVKLEL EVDENRITLQ PLLQASTTYT ITIAAGTRDR SGVPLAHDVS LTIRTASASP
QLQVIGEIVL SFPANVQPVV TIERMNLSVI DGQLYQLDPS TLIRALSLRP NEWASFVPER
YGQPLVRAWR ELLNDPADTL VRSLLPITAN SAGDPLPPGA YYLRMTASAG LRADRLLLIS
SLNLSLLVNN DELLLWVTTG GTGTPAGNIP LTVYTGETVL ARGTSDDQGL WRVPLTGLST
GNSSSPLPII ALAEGNGLTL ARAELTTTQP RVQALLAPDR LSYRPGGVVR INGVARAQQP
DGRLMLPTSG NCTIQLDGPN LNDQPPAVDC TVSNTGIVSG SLQLNARTTP GPYTATVVIG
DSVYRLPIRV RGPSTGTAVR IMPARPAGLA VDVTRAGLPV SGATISWTLR LETLSVAELD
GVNGVVIGGV SENQASATTD ASGRAMINLP ADENLLRPLR YQLALTVLLP DGEQLSRETE
GVITPRSPRL VLDAPAIVER NERATITMWL RTADGAAIGN TLVELEVRRN PTDPPLIVRR
VRTNNDGQTS AELVPLAPGR YELTARAGTA LSRHSLWVAG FGLSTAEPQI IPDRSSYTVG
ETARVLITGP AGGRTLLLVI GQGATAQTII TAAQPGTVLD IPISADLAPI TLLTALIDDG
VRHWMTSTTI SIDPPPPPEL NISQPDVLPG ATVTFTVTAA TDTLLVVLSL LHSPPVDLTP
WNQPLAPLTR APGQQTGLDG IILPASIQSH QNQHTISVTM PNQLGRWRLS VIAVYPNGIA
TIAGALLDTN QPIEAIAIPL PAPRPPDTVT ATLILRNLSG QDRTVRSRLW LSDGILLDPL
EQTTTVPAGA TVPIAWRLQP QPNANLVGLR YEVIDTISLP PIEYAIPVWR DPPLPTTDQT
YVATDPITIT LPDGNNEVVI AASVRAALAD QAQRLLQTTP PTAETLAAAI VIGRELERTA
TTTAEADRWR TAIENVLPPL RSLRNPDGGW GWWPNTASDP FITAFVLEAI GRLPSPSAER
RELSEPALSY LRRTRLTQPA DAQAYINYVM SLYGANPTPP TLPTGAGPAG RAFTALQLPN
ERDTLLNPLR VSASSRLPWA GAEGLPPSTL AVSASVIQAL AQDRPSDPRL THWRTTLMRT
WQIDGRSTPY EAARVALALN TTLLAEDGEV QVFHNDTLIT DRPLGDVARF RFNGGTLRIE
PAQTAALITV RSPASPTQPN NVRARLQYLT NTNSLTVDRP VQIELIVITT QPLFRLDVAV
PLPAGLTPLA VDAGTELTVQ QIDRERRQVR LGGVRLARGV YRILITAQTT ATGSFTVPSA
FISMPGSDLA PVVAEWQTMI AITPKIDE