Gene Cagg_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3357 
Symbol 
ID7267097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4068950 
End bp4070950 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content56% 
IMG OID643568166 
Producthypothetical protein 
Protein accessionYP_002464637 
Protein GI219850204 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00235574 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000266176 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGACGAT TCCTGCTTAT TATTGCTCTC ACCCTGATCA CCGGCTGTAT GCCATCGGTC 
GAGAGACCAC CGGAAGACTA CCGACCTACG GCGACACCAC GATCCGCTAC CGGTACCGCC
CACGCTCCGC CACCACCACC GACTCCGGTA GCGATGGTTA CTCCCGCACC GAACAACGAC
GTGAATGAAA TCGCGCTGAT CGATGCGGCT GCTCGTCTGC CGCGCGATCA AGTTGAGCTA
GCCCGCCAAC TCGGCGCGTG TCGCCCGGCA CCAGAGGAAT GCCTGTATGT AGCGCGTACT
ATCCCCCCTG ACGTGCAGCT TGGCGAACGC CGATCTTTTT CTGTAACCGA TTTTAGTAAC
GATAGCCAAT ACGAAATCAC TGCTGATCTA CGCTACATTG GGCCGGTGGT GTTGATGTAT
GTCGAAACCG GCGTACCTTT TGATCAGGGC GCATTGGAAC GTGCTGCGCG TACCTTTGAA
CAAGAGATTT ACCCGCGTAC CCGCGAGATC TTCGGCAGTG AGGCACAACC CGGGGTTGAC
GGTGACAACC GGATTACCAT TCTGAACGCG GTAGAGCGCA GTCGTCAGAT CCTCGGTTAT
TATTCGTCGA GCGACTCATT ACCGAAACAG GTCAACCGCT ATAGCAATGA GCGTGAGATG
TTCTTTATGA ACATCGAGCT GATGCCTTTC GATAGCGATA CCTACCTCGA CGTGCTGGCC
CACGAATTTC AACACATGAT CCATCAGCAC GAACAGCCGG GCAGTGCCTT GTGGCTCAAC
GAGGGAATGT CACAATTGGC CGAAGACCTC AACGGCTTTC AGAGCGAAGG CTTCATTCCG
CTCTATCTGC GCAATACCGA CATTCAATTG ACCGGGTGGG GCTTTGCGCC CGGCCAGTCA
GGCGTGCATT ACGGCGCTGC TCACCTCTTT ATGCGCTACA TCTATGCCCA ATACGCCGGC
AAAGACCAAT TACGCTCGTT GATTCGGGCC AACGCCGGTA ACAATCTCGA AGCGTTCGTC
GAGTTAGCCG CCCGTGTTCG ACCCGATATT ACGCACTTTC GGCAGATCAT GGCCGATTGG
GCCGTCGCCA ACTTACTTAA CGACCCACGG GTAGGCGATG GTCGTTACAC CTACGATACC
GGTACCGAAT TGAGGAATCT CCTGCCGCAC ACAGTACGCC CAACGCCGGT CGAGCGGCGG
CATCAAGACG ATATTGTGCA GTTCGGCGTT GATTACCTTG CATTACCGGC GAACGCGCGC
TCGATAACCT TCCGTGGCGA CACTACCGTG CGCATTGCCG GACAGATGCC ACAGGGACGC
TACGCCTGGT GGAGCAACCG TAGTGATGAT AGCATTGCAA CGCTCACGCG CAAGATTGAT
CTACGTGGGG TCAGTTCAGC GACACTCACC TTCGACACGT GGTTCGAGAT CGAAGACGAT
TATGACTACG CTTTCGTCAC TGTTTCGACT GACGGCGGGC GGACGTGGGA GACCCTACCC
GGCAAGTGGA CGACCGACTA TGATCCACAA GGCGTGAATT ATGGTCACGG TCTGACCGGT
GTTTCGGGGA GACCGGAGGC CGACGTTGAA GACGGCTTGC GCGGGCGTTG GGTCAACGAG
CGGATGGATT TAACCCAGTT TGTCGGCCAA GAAGTGTTGT TGCGATTTTG GTCGATTAAT
GATCAGGGAG TACATGCACC TGGTATCTTG ATTGATAACA TTACCATCCC CGAAATCGGT
TTTCGTGATA CCGTCGAAGA GGGGGAGAAT GGCTGGGAAG CAGCAGGATT TGTGCGGGTC
GATGGCGATC TGCCGCAGCA ATGGGATTTG CACCTGGTGC GCACGGCGGC CAATGGACAG
ATCACCGTTG AGGCATTGCC GGTTGATGAA GACGGTATCG CAACGGCAAC GCTGAATGAC
GGTGAGCGGG GGGTATTGGT GGTGATTGCC GTCACGCCGC ACACGAGTGA ACGGGTGCAA
TACGAGGTTA TCAGCGAATA G
 
Protein sequence
MRRFLLIIAL TLITGCMPSV ERPPEDYRPT ATPRSATGTA HAPPPPPTPV AMVTPAPNND 
VNEIALIDAA ARLPRDQVEL ARQLGACRPA PEECLYVART IPPDVQLGER RSFSVTDFSN
DSQYEITADL RYIGPVVLMY VETGVPFDQG ALERAARTFE QEIYPRTREI FGSEAQPGVD
GDNRITILNA VERSRQILGY YSSSDSLPKQ VNRYSNEREM FFMNIELMPF DSDTYLDVLA
HEFQHMIHQH EQPGSALWLN EGMSQLAEDL NGFQSEGFIP LYLRNTDIQL TGWGFAPGQS
GVHYGAAHLF MRYIYAQYAG KDQLRSLIRA NAGNNLEAFV ELAARVRPDI THFRQIMADW
AVANLLNDPR VGDGRYTYDT GTELRNLLPH TVRPTPVERR HQDDIVQFGV DYLALPANAR
SITFRGDTTV RIAGQMPQGR YAWWSNRSDD SIATLTRKID LRGVSSATLT FDTWFEIEDD
YDYAFVTVST DGGRTWETLP GKWTTDYDPQ GVNYGHGLTG VSGRPEADVE DGLRGRWVNE
RMDLTQFVGQ EVLLRFWSIN DQGVHAPGIL IDNITIPEIG FRDTVEEGEN GWEAAGFVRV
DGDLPQQWDL HLVRTAANGQ ITVEALPVDE DGIATATLND GERGVLVVIA VTPHTSERVQ
YEVISE