Gene Cagg_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2604 
Symbol 
ID7267195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3184372 
End bp3186513 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content58% 
IMG OID643567430 
Producthypothetical protein 
Protein accessionYP_002463909 
Protein GI219849476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0826455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0858615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGC TCATCATCGT CTTGATCGGG ATCGTTCTGC TCGTCAGTTG CAGCCCGCCC 
CCTCCCAACC AAACCGACCA ACTTACCGAA CCATTTCGCA GCGCATGGCA AGCCGCAGGT
GGTCCGAGAA TCGGCCCACC CCTGGGCGAA CCACGGTGGG TTGATGATAC GCTCGTTCAG
TATTTTGCAA CCATCAGGAT CGTTGCGCTC CCGGACGGTG GGGCTATCGC CGAACGGTTG
CCCGCCGATT GGCGCTCGTC TGTACCGCGT CCGGTCATTG AACTGCCGAC GGTTCCACAA
CGCGCTTCAT TAGCCCTTGC TACGCCAACG GATGTTATGC AACCGCTCCA ACCGGTACGT
ATCACCCTTC GCATCCCGGC TTACAGCGGC CCGACGACAG TTCAGCTCTA CGATGTCGCC
GGCCATCTGA CAACACAAGG TGTCACGACG GTGAGTGATG GCCTCGGCGA GATAACCCTG
TTCGCCGGTG GCACTCTTGG CCCCCAGTGG GCCGTCGCCC TGATCGATGG TCGGTTGGCC
GGTGCCCATA GCCGCTTGTT TACCCTCGAT GCTGAAACCT TTTTGAGCAG TGGTTCAAGT
GACATCGACT CGCTCTATCC GCGTATTCGC CGTCTCATGG CCGAGGCAAG GGTAAGCTAC
GAGTTGAACG GTCGGTTGAT CGGCGGTTAT CGCTCACCCG ATAATCCGCT GCTCTGGCTC
CGCGATCACG TCTATCAAGG ACGAGGGTTT CGCTATTTTG AAACCGATGT CACCAGCCTC
CTCGACGCCT TCCGCGATGC ACAATTGCCC GATGGTAGCC TGCCTGATGT GATCGATTAC
CCTGAGCGGT ATGTGCAGGC CTTCCGTAAA GAGGTTGAGT CCGATGTTGA GTTTTTGTAC
GTGCAGGGTG TGTACGAAGC GTGGCAGATG ACCGGTGATG ATGAGTGGTT GCGTTCCCAT
TTGCCCGCAT TGCGACGGGC TATCGAGTAT ATAACAACTA ACCCGCTACG TTGGAACGCT
GAGCGTGGTT TGGTCAGACG CCCGTATACG ATTGATATGT GGGATTTTGC CTACGGCCCA
ACCACAATGA GTCCTGATGG TAAACCGGCT CCACGCCACT GGATCGGGCC GGATACGATT
TGGGGCATGT TTCACGGGGA TAATACCGGC TTGGCGTATG CCCTCTTTTT ACTTGCCCGA
ATTGAAAATC GGGTCGGTGA CCCGACACGG GCTAAACGCT ATTTCGATCT GTCTGACCAG
ATTATGCAGC GCTTGAATGC GTTGGCGTGG AATGGCCGCT TTTTCACCCA TTTCATCCCC
GAAGATGCAA CGTTTGTCCC GGCAGGGGTT GATGCGGCTG CACAACTGAG CCTCTCCAAT
GCCTATGCCC TCAACCGGCG GGTGCTGTCG GTGGGTCAAG CGCAGGCGAT TGTCGAGACG
TACTATGCCC GGCGCGATTT CACCCGTGCC TTCGCCGAAT GGTACAGCAT CGACCCGCCG
TTTCCGCCGG GTAGTTTTGG CATGGCCGGT GGTAAGGGTG AACAACCCGG TGAGTATGTC
AACGGAGGGA TTATGCCGCT CGTCGGGGGT GAACTGGCGC GGGGCGCATT CGCCTTCGGC
TTCGAGCCGT ATGGTCTCGA TATTCTGCGG CGCTATGCCA ATCTGCTGCG CCTGACCAAC
GCTTCGTATT TGTGGTATTA CCCCGATGGA CGACCCGGCA TTTCCGGCCC TGATACCATC
CCCACCGATG GCTGGGGGGC CAGTGCAATG CTCGGTGCGC TGTTCGAGGG CTTGGCCGGT
GTGTTCGATG ATGCATCCCG TTACGAAGAG GTGATCATCA GCCCGCGCTG GCCGGTCGAA
CCGACGGTCA AGCAGGCTTA TGTCGTGACG CGCTACCCTG CCTCGACCGG CTATGTCGCC
TACCGTTGGC AGCGCGATGA TCGGTCGCTC CGTTTGTTGA TCACCGGGAG CGGACGACAG
GCGACGGTGC GCTTCCTTCT GCCCACTGAT GTCGGTGATC GGCTGACGAT GATGGTAGAC
AACCAACCGG TCACACCGCT GATTGAGGTG ATCGGCAATA GTCGGTATGC TAGCGTTACG
CTTGACCGGC TCAATGTTGA GGTGATAGTG ACGTGGCCGT GA
 
Protein sequence
MKQLIIVLIG IVLLVSCSPP PPNQTDQLTE PFRSAWQAAG GPRIGPPLGE PRWVDDTLVQ 
YFATIRIVAL PDGGAIAERL PADWRSSVPR PVIELPTVPQ RASLALATPT DVMQPLQPVR
ITLRIPAYSG PTTVQLYDVA GHLTTQGVTT VSDGLGEITL FAGGTLGPQW AVALIDGRLA
GAHSRLFTLD AETFLSSGSS DIDSLYPRIR RLMAEARVSY ELNGRLIGGY RSPDNPLLWL
RDHVYQGRGF RYFETDVTSL LDAFRDAQLP DGSLPDVIDY PERYVQAFRK EVESDVEFLY
VQGVYEAWQM TGDDEWLRSH LPALRRAIEY ITTNPLRWNA ERGLVRRPYT IDMWDFAYGP
TTMSPDGKPA PRHWIGPDTI WGMFHGDNTG LAYALFLLAR IENRVGDPTR AKRYFDLSDQ
IMQRLNALAW NGRFFTHFIP EDATFVPAGV DAAAQLSLSN AYALNRRVLS VGQAQAIVET
YYARRDFTRA FAEWYSIDPP FPPGSFGMAG GKGEQPGEYV NGGIMPLVGG ELARGAFAFG
FEPYGLDILR RYANLLRLTN ASYLWYYPDG RPGISGPDTI PTDGWGASAM LGALFEGLAG
VFDDASRYEE VIISPRWPVE PTVKQAYVVT RYPASTGYVA YRWQRDDRSL RLLITGSGRQ
ATVRFLLPTD VGDRLTMMVD NQPVTPLIEV IGNSRYASVT LDRLNVEVIV TWP