Gene Cagg_1468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1468 
Symbol 
ID7269301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1802249 
End bp1803592 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content55% 
IMG OID643566310 
ProductCBS domain containing protein 
Protein accessionYP_002462809 
Protein GI219848376 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0356908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGGACC CCGGCCCTAG TTCGATAATC ATCGGCATCG GCCTCTGTCT TATTATGCTC 
GGTATTACCT CGGCTGCCGA TACGGTCATG ATGATGGTTA GCCGTCCTCG TCTGCACGCC
CTTCTGGCTT CGGCCGGTCT TGGTAGCCAA CGCTTCACTG CCCATTTTCT CGATGAGCCA
TACCGGATCA AGTCAGCCAT CATCTTTCTC AATACGGCGC TGACGATTAT GGTAACAGCG
CTGACCATTC ACCTCACGAT GCCGTATGGT TTGACCGTCG TCATCGGTGG TATGGCAATC
CTCTTGTTTT CCATCCTGCT CCTGAGTGAG GTTATCGCAA AAGCGCTCGC TCGTCGCAAC
CCGGATACGA CTATTCTTGT ACTGGCCCGC CCACTGGTTG CAGTCGCAAC GATTTTGTGG
CCGTTAATGG CTATCATCAA TGTTATCACC CGACCGATCT TTACCCTCGT GAGTGGTCAG
CCGGCGCCAC CGGCGCCGCT CGTAACCGAA GAAGAGCTGC GCTTGATGAT GAGTGCCGGT
GAAGAGGCCG GCTGGATCGA ACACGAAGAG CGCGAAATGA TCGAGGGGGT GATGGACTTT
GGCGACACCT TGGTGCGCGA GATTATGATC CCGCGGGTCG ATGTCGTAGC GCTCGAAGTC
AATAGTTCGC TCGATCGAGC ACTCGATGTA GCGATTACAC GCGGTCATTC ACGGATTCCG
GTCTATGAAG AGACTATTGA TAATGTGGTT GGTATTTTGT ATGCCAAAGA CCTGATCCCC
GTGTTGCGCG ATGGCCGGCG TGATACGCCG CTACGCGATC TGATCCGTCC GGCCTACTTC
GTCCCAATGA CGATGAAGGT CACTGCACTG TTAGAGGATC TCCAACGCCG GCGCGTCCAT
ATGGCAATTG TCGTTGATGA ATACGGTGGC ACCGCCGGAA TTGTCACGCT GGAAGATCTG
CTCGAGCAGA TTGTCGGTGA AATTCGTGAT GAATATGATA CAGAAGAACC GGCGATTGTC
GAGGTAGGGC CACACGAGTT CATTGTCGAT GCGCGCGTCC CGATTGATGA CATCGCCGAG
TTGCTTGAGG TCGAATTCCC GGCTACTACT GCCGATCGGA TCGGCGGCCT CGTTTACGAG
CAACTAGGTC GTATTCCGCG GGTGGGGGAT GAAGTAACGT GTGGTGATGT TACCATCACG
GTATTGTCCA TCAAAGGCAT TCGCGCTGAA CGCTTACGTG TCATTCGCCA ACAGCCGGCC
CAGAACCAAG CGGCAGCACC GGCTGAGGCA GACAAACCGC TGTTGCCGCT CCCGCAAGAA
GTGCATGGAT CCAGTGGACC TTGA
 
Protein sequence
MEDPGPSSII IGIGLCLIML GITSAADTVM MMVSRPRLHA LLASAGLGSQ RFTAHFLDEP 
YRIKSAIIFL NTALTIMVTA LTIHLTMPYG LTVVIGGMAI LLFSILLLSE VIAKALARRN
PDTTILVLAR PLVAVATILW PLMAIINVIT RPIFTLVSGQ PAPPAPLVTE EELRLMMSAG
EEAGWIEHEE REMIEGVMDF GDTLVREIMI PRVDVVALEV NSSLDRALDV AITRGHSRIP
VYEETIDNVV GILYAKDLIP VLRDGRRDTP LRDLIRPAYF VPMTMKVTAL LEDLQRRRVH
MAIVVDEYGG TAGIVTLEDL LEQIVGEIRD EYDTEEPAIV EVGPHEFIVD ARVPIDDIAE
LLEVEFPATT ADRIGGLVYE QLGRIPRVGD EVTCGDVTIT VLSIKGIRAE RLRVIRQQPA
QNQAAAPAEA DKPLLPLPQE VHGSSGP