Gene Cagg_2367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2367 
Symbol 
ID7268717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2877019 
End bp2878245 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content56% 
IMG OID643567196 
Productnuclease SbcCD, D subunit 
Protein accessionYP_002463681 
Protein GI219849248 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0953909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.605774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCA TGTTACACCT TGCCGACTTG CATCTCGGGA TCGAAAATTA CGGCGCGCTC 
GATCCGCGGC GCGGCTTGCA TTCCCGTCTG ATAGATTATC TTGACCGACT TGACGAGGCA
ATTACGGTTG GGCTTGATCA TCAGATCGAT CTTTGCCTGA TTGCCGGTGA TGTGTATAAA
AACCGTTCAC CTAACCCTAC GGTGCAGCGC GAGTTTGCGA CGCGCATCCG CCGTTTACGC
GACGCCAGCG TAGCGGTGGT GATCCTTACC GGTAATCACG ACATCTCACC GGCTCAAGGG
CGCGCTCACT CGGTAGAGAT TTTTGCCACA TTGGCCCTCG AAGGGGTGAC GGTGGCCGAC
CGCCTACGTC GGTATCGGAT TCCCACGCGC AGTGGCGATC TGCAACTGAT TGCGGTGCCG
TGGGTGACAC GCCAAATGTT GCTTACCCGC GACGAAATGG TCGGTGCGTC ATTCGCGACG
ATTGAATATG AATTACGTCG TCGATTGGAG CAGTTTATTG AACAGGCGGT GGCTGCGTGC
GATACAACGA AACCGACGGT GGTCGCGTTT CATGGCACAG TTGAAGGTGC GCAATTGGGG
TCGGAGCGGG CAATGATCTT GGGTCGTGAT CTCAGCTTAC CGCGTTCGAC CTTGGCTCTG
CCCGGTGTCG ATTATGTCGC CCTTGGTCAC ATTCATCGTC ATCAGGTGCT TGGTGAACAG
CCACCGGTTG TCTATCCCGG CAGTATCGAG CGGATCGATT TTGGTGAACG TGATGAACCG
AAAGGATGTG TGCTGGTTGA GCTGGAGCCG GGACAGGCAC GCTGGCAATT TGTGCAACTG
TCGGCACGCC CGTTCGTCAG TATCGAACGT GATCTGCGTC AAAGTAGCGA TCCGGTGGGG
GCGTTGCGTG CCGCCATCAA TCGTCACGAT CTGCGCGAAG CGGTGGTTCG CGTTGAAGTG
CAACTGTCAC GCGAACAGGC AACGTTGCTG CGCGAAGACC ACGTGCGTGA ATGGTTACGT
GAAGCCGATG CGGCAGTGAT TGCGGCAATT GTGTTTGATA TCGAACGTCC GGTTCGTCAG
CGATTCGCCG GGGTTGCTGA AGCGTTACGT GCCGGTCTTA CACCACGACA CGCGCTCGAA
CTCTATCTCA AGAGCAAAAA TACGCCTCCA GAACGGATCG CACAATTGTT GGCTGCTGCG
GATGAACTGA TTGGAGGGGA TACGTAG
 
Protein sequence
MIRMLHLADL HLGIENYGAL DPRRGLHSRL IDYLDRLDEA ITVGLDHQID LCLIAGDVYK 
NRSPNPTVQR EFATRIRRLR DASVAVVILT GNHDISPAQG RAHSVEIFAT LALEGVTVAD
RLRRYRIPTR SGDLQLIAVP WVTRQMLLTR DEMVGASFAT IEYELRRRLE QFIEQAVAAC
DTTKPTVVAF HGTVEGAQLG SERAMILGRD LSLPRSTLAL PGVDYVALGH IHRHQVLGEQ
PPVVYPGSIE RIDFGERDEP KGCVLVELEP GQARWQFVQL SARPFVSIER DLRQSSDPVG
ALRAAINRHD LREAVVRVEV QLSREQATLL REDHVREWLR EADAAVIAAI VFDIERPVRQ
RFAGVAEALR AGLTPRHALE LYLKSKNTPP ERIAQLLAAA DELIGGDT