Gene Cagg_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0204 
Symbol 
ID7269118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp254921 
End bp256819 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content56% 
IMG OID643565073 
Producthypothetical protein 
Protein accessionYP_002461588 
Protein GI219847155 
COG category 
COG ID 
TIGRFAM ID[TIGR02226] N-terminal double-transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTCC TCACACCGTT AGCTCTGCTT GGTGCAGCGA TTGTTGGCCC GATTATCGTG 
GCGATGTATC TGCTCAAACT GCGTCGCGAA GAGCGGGTGA TTTCATCAAC GTTTCTCTGG
CAACGCATGG TGCGGGATGT TGAGGCAAAT GCACCATGGC AGAAATTGCG CCGTAATCTC
CTTCTCTTGT TACAGTTGTT GCTGATGTTG TTGCTGGTGT TGGCATTAGC CCGCCCGTTC
CTGCCGGTCA CCGGTATTAG TGGTACCAAC CTGATCATTA TCTTCGACCG TTCGACCAGT
ATGCTGGCAA CCGACGAACC ACCGACCCGC TTGGAAGCTG CGCGCCGGCA GGCACTTGCA
TTGATCGACC AACTACCCGA TAACGGCCGG GCAACGATTA TCACGATCGG AGGACAGATG
GAAGCGCCTA TTGCCGCTTC CAGTGACCGC CGCGAGCTAC GTCGCATTAT TGCGAATATC
CAACCGAGTT ATAGTACCCA GTCCGATCTA ACGCAAGCAC TGACCCTCGC TTCGGCGCTG
GCTGCTCGCG AACCTGATAG TGAGGTTGCA ATTATCTCTG ATGGAAATGT CACGATCCCC
GATGGGTTGC GGGTACCGGC ACGAGTACGT TACTTTCCGA TCGGACGTGC CTCAGATAAC
GTAGCCATTA ACGCGATTGC GCTGCAACCC GGTCCGTTCG AGCAGACCCT CTTTGTACAG
GCGATCAATT ACGGGAATAG TCCGGTGACT CGTCGCCTCT CACTCTATTA CGATGGGATA
TTAGCGAATG CTGTTGATTT GACCATCGAT CCAGGGCGTG AGCAGAGTTG GACGGAGACG
CTCCCGACCA CGGTGGCGGT GGTCGAAGCG CGACTTGACG AAAATGGTGA TGCGCTGCCG
GTAGATGATC GGGCGTATGC GGTTAGTCCG CAACGTGAAA CGGTAAAGGT GCGACTGGTC
AGCGATGGTA ATCGCTTCCT CGAGACCGGA CTGTCGCTGT TGCCGGGGCT TGAAGTTACG
CGCGTGCCCA CTACTACCCT TACATTCACG GAGAGTGCTG CGGAGATTCC GCTCACTATT
TTCGATGGCG TCACCCCAAC CGAGCTACCA CCCGGTAACC TGCTCTTTAT CGGCCCGTTG
CGGAGTACCG AACTCTTTAC GATCACCGGT GAATTTGATT TTCCCCTCAT CCGACCGGTG
GCGCTCGAAG ATCCACTGCT GCGCAATGTG CGCTTATCCG ATGTCAATAT TCTGCGCGCA
CTACGCATTG TTCCGGGATC GTGGGCGCGT GTCATTGTCG ATAGCGACGG TGGCCCGCTG
CTGTTGGCCG GCGAGCGTGA AGGGCGTCGG ATTATCGTAC TGGCGTTTGA CTTACACCTC
TCCGATTTTC CACTCACCAT CGGATTCCCA TTGTTACTCT CCAATATGAT TGACTATCTC
CTGCCGGTGA GTAGTGTGCA ACTGACGACC GGTCAGCCGA TTGTGGCCCC GGTTGACAGC
AGTATCGAAG AGGTGCGTGT CATTCGTCCT GACGGTCGGG TGGCGAGTTC ACGTGATGGT
CAGGTACAGG TCCAGGCCAA TCAAACCTTC TACACCGACA CCGAACTGCC CGGTATTTAC
ACGCTCGAAG AACGGCGAGG TAATGAGGTC ATCACCAGTC GTCGCTTCGC CATCAACCTA
TTTGCCCCCA ACGAATCACA AATCACGCCA CAACGCGATC TTGTTATCCC ACAAATCAGC
GGTGCGCAGA GCACGGTTGC CCGTGAGCGT GATGGTCGCC AAGAAATCTG GCGTTGGTTG
GCACTCGCGG CGTTGCTCGT CTTGCTTATC GAATGGCTGT ACTATCAGCG GAATACACTG
ACGCTGCTAC GAGAACGTTG GCGACGGCGC ACGGCGTGA
 
Protein sequence
MSFLTPLALL GAAIVGPIIV AMYLLKLRRE ERVISSTFLW QRMVRDVEAN APWQKLRRNL 
LLLLQLLLML LLVLALARPF LPVTGISGTN LIIIFDRSTS MLATDEPPTR LEAARRQALA
LIDQLPDNGR ATIITIGGQM EAPIAASSDR RELRRIIANI QPSYSTQSDL TQALTLASAL
AAREPDSEVA IISDGNVTIP DGLRVPARVR YFPIGRASDN VAINAIALQP GPFEQTLFVQ
AINYGNSPVT RRLSLYYDGI LANAVDLTID PGREQSWTET LPTTVAVVEA RLDENGDALP
VDDRAYAVSP QRETVKVRLV SDGNRFLETG LSLLPGLEVT RVPTTTLTFT ESAAEIPLTI
FDGVTPTELP PGNLLFIGPL RSTELFTITG EFDFPLIRPV ALEDPLLRNV RLSDVNILRA
LRIVPGSWAR VIVDSDGGPL LLAGEREGRR IIVLAFDLHL SDFPLTIGFP LLLSNMIDYL
LPVSSVQLTT GQPIVAPVDS SIEEVRVIRP DGRVASSRDG QVQVQANQTF YTDTELPGIY
TLEERRGNEV ITSRRFAINL FAPNESQITP QRDLVIPQIS GAQSTVARER DGRQEIWRWL
ALAALLVLLI EWLYYQRNTL TLLRERWRRR TA