Gene Cagg_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1829 
Symbol 
ID7267741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2241914 
End bp2243962 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content58% 
IMG OID643566667 
Productcellulose synthase subunit B 
Protein accessionYP_002463162 
Protein GI219848729 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTGT GGCGCTGGTG GATGGTAAGT ATTCTTGCAT TTTGGTCGAT CTGCTTGCCG 
GCGCCGGTTG TTGCTCAAGG AACGGCATTA CGCTTTGCCG ATCTCGGCTA TGGTGATCGT
ACAGCCCGTG GCATTGATGC GGTGCTTGAC TACTATTTTC CGATACCACT TGGCTTACAA
CCGGCAAGTG ACGGTGTGTT GACCTTGCGT TTCACGCATT CATCGTTGCT CAAGCCCGAT
CGTTCGACGC TCAGTGTTGC GTTGAATGGG CAATCGCTGG CGAGTATCCG TCTCACCGCC
GATAATGCTG AAAACGGCCA GTTGACCGTC TCTTTGCCGA TTACCGGCTT TAACGGACCG
GGTTTGTTCC TCCAAGTGCA ATTTCATATG CGATTGACCG ATGACCCGTG TGAGGAGGTG
CAAAATCCGG CGCTATGGGC GGTCGTGAGC GGTGATTCAA CCCTCCGCCT GGATGTACAG
CCAGTGACGG TAGGGACCCT CGCCAATGTA ACTGCGCTCT TCGCACCTTT ACCGTTGAGC
GCGCCGGCAA CTCGCTTACC GCCGACTATG GTGTTGTCTC CACCAACCGA CCCGGCAACC
CTGAACGCTG CCGGTACAGT GGCGTTCGCC GTGGGCCGGT GGGCAGCATT GGCCGGACAA
GACCCCGTGA TCAACGTTGC CGAGACGATC CCACCGCAGG TACCGGCCAT TGTTGTCGCT
TACGGTGCAT TACCGGATGG CGATTGGGGA ACGGTACGCT GGAACGGAAA CACGTATGAA
GTTGACGGAG TACCCCTACC GGTTGACCAT GGGGTACTAG CGTTGGCCCT GGTCACACCA
CCGCGCTTAT TGGTGGCCGG CGCCACACCG ACTGCGCTTA CCTATGCAGC GCAAGCACTC
ACCCACGTCT TGCCGGCGGC ATCCGTGTTA GCCGTCACCC AGCCACCACC GACTACCGTT
GCAGCAGCGT GGCGTGAGGG AGCTGCTAGC TTTGCCCAAC TTGGAGTAGA GCGACGGCAG
GTGGTTGGCG CCGGTGAACA TCGCATTGAT ATTGCCTTCG AGCGACCACC TGCTTGGGAT
CTGCGGGTTG GGGCGACGCT AGAACTGCAC GTTGTCACGG CTGCCGGCTT AATGCCCGAC
TCGTCGTGGC TGGCGGTAGC GGTCAATGGA ATAACGATCG GTTCACAACG CCTCCGCGTC
AATTCTACGG CCATCGAACG GTACCGGTTT GAGCTTCCCG CCGATCTTCT CAACAGCGAT
TTGAACGGTA CTCCATTACG TCGCATCGAT CTACAGGTAC GGCTCTACCT TGATCTCCCC
AATATCGGTT GCGAAGAGGT GGATACGACA GCAGCATGGG CGATCATCGA ACCAACATCG
GTGTGGCGCT TACCGAACGA TCCGGCAGCA AGCGATGATC TCGGTCGGTT TCCGGCAATG
TTGGATAGTG AGCAGCCGGC GCGGCTGATC CTCCCGCCAC AACCGGACCT GAGCGAGGTA
CAGGCCGGTC TCGAATTGGG GGCTGCAATC GGGCGATGGA GAGTCTTACC CGATCTACCG
CCACCAATGG TACTGACAGC CGACACCCTT GGCGATGATC GCGGTGGGCC GCTGGCCGTA
TTGGGTGACC GCAACCGCAA TCCGCTCGCA ACTGCTCTGA ACTCTCCTAC GAATCCACCG
TTTGTCTACC AACCCGGGCG TAGTACCCAA GCGACGCTGA GTGTCGTTCG CTCGCCGTGG
CAAGCCCAGG CACGAGTATT ATTGATCGAA GCGACCGACG GTAAAGGGCT GCAATTGGGG
GTACGCAGTC TGCGGGAACG GGATTTACTA CAGGTCTTAC GCGGGTCACA AGCCCAGATC
AGCAGTGATC TTGACATCAC CGTCGTACCG TTGACGACAC CGTTAGCTCC GCCGCCGCAA
ACCTTGACCC CAAAGATTGA GGTAGCGCTG CTCGAACGAT TCCCTGTCTG GCAAGTGATC
GGTGCCATTG TCTTCATTGC CCTGCTGGCT ACGGCAATTC TCGTGATACG CATCCGGTGG
TGGCGGTAA
 
Protein sequence
MIVWRWWMVS ILAFWSICLP APVVAQGTAL RFADLGYGDR TARGIDAVLD YYFPIPLGLQ 
PASDGVLTLR FTHSSLLKPD RSTLSVALNG QSLASIRLTA DNAENGQLTV SLPITGFNGP
GLFLQVQFHM RLTDDPCEEV QNPALWAVVS GDSTLRLDVQ PVTVGTLANV TALFAPLPLS
APATRLPPTM VLSPPTDPAT LNAAGTVAFA VGRWAALAGQ DPVINVAETI PPQVPAIVVA
YGALPDGDWG TVRWNGNTYE VDGVPLPVDH GVLALALVTP PRLLVAGATP TALTYAAQAL
THVLPAASVL AVTQPPPTTV AAAWREGAAS FAQLGVERRQ VVGAGEHRID IAFERPPAWD
LRVGATLELH VVTAAGLMPD SSWLAVAVNG ITIGSQRLRV NSTAIERYRF ELPADLLNSD
LNGTPLRRID LQVRLYLDLP NIGCEEVDTT AAWAIIEPTS VWRLPNDPAA SDDLGRFPAM
LDSEQPARLI LPPQPDLSEV QAGLELGAAI GRWRVLPDLP PPMVLTADTL GDDRGGPLAV
LGDRNRNPLA TALNSPTNPP FVYQPGRSTQ ATLSVVRSPW QAQARVLLIE ATDGKGLQLG
VRSLRERDLL QVLRGSQAQI SSDLDITVVP LTTPLAPPPQ TLTPKIEVAL LERFPVWQVI
GAIVFIALLA TAILVIRIRW WR