Gene Cagg_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0002 
Symbol 
ID7268998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1767 
End bp3164 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content55% 
IMG OID643564874 
Productpeptidase S41 
Protein accessionYP_002461391 
Protein GI219846958 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000027007 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCATA GACATATTCT CCTCATGCTA TTGGCCGCGA TCATTATGAG CGGCTGTACC 
TTTGGGCAGA ACCCCGCAGA ACCTGCCCTT ACCGGTGCTC GACCGGCAGC ACCGGCAACT
CGTATTCCTT TACCAAGCCC TTTACCAAGT GCCACGCCGG CTCCTACAGC ATCAGATCTC
GCGCGGACAC CAACCCCTAC ACCTCCGATT GGAACGATTG TTCCTACACC AACACTGGCT
CCACTCAGTC GTGAACAACG TCTGCAAATC TTTCAGCAAG TCTGGGAAAA GGTACGCGAC
AACTATGTGT ACCCTGACTA CAACGGACTA GACTGGCAGG CAATTCGGGA AGAGCTACGA
CCAAAAGTTG CTGCTGCTGT AACCCCGGAA GAGTTCTACA GCATCATGCG CGAGATGATT
GCCCGGTTGG GTGACGACCA CTCGCGCTTC GAGTCTCCGC AAGAAGTTGC TGCCCAGCTT
GCCGAGGCCA GTGGTCAACT ACAATACGGC GGCATTGGTG TGAGTGTGCG AACCATTGAT
GAAGGCGGCC TCATCACCCG CGTGGTGCCC GGTGGGCCGG CAGACCAGGC TGGTATTCTT
ACGCGCGATA TTATTGTGGC CGTTAATGGC ATTCCGTTCA ACGATCCCAA CGCATTCGGT
CCAGATGGTG CAATTGGGGC AGTCCGTGGC ATTCCCGGTA CGAGTGTCCG GCTGACGATA
AAGCGCGGCA ACGAGCCGTT ACGAGAGATT GAAGTTGTCC GAGCGGTAAT CGACATTGCC
GTGTTCAATC GAGTTACTGT CGAACGGCTC GCCGGTGACG TTGGCTTGCT CACCATCCCT
AGCTTCTACG TCGACAATGC CGACAGCCAG GCACGTGACG CCCTGACTAA TCTGTTAGCA
GCGGGGCCGG TACGCGGGAT GATTATTGAT GTCCGTGATA ACAGTGGCGG CTATATTCAT
ATCATGCGCA ACATTATCGC CCTCTTTCAC GACGGAGGCA GTATCGGCAC GTCGGTAGGT
CGTAACGAGC GTGAAGAACA GCGCATTCCA CGTGGAAAAA CGATCGCGGG TCTGATCGAC
ATCCCGATTG TAGTACTTAT CAGTGAAGAG ACGGCCAGTG CCGCCGAGAT GTTTGCCGCC
GGGATGCGGG TTTTGTGTCA AGCGACGATT GTTGGTGTAC CATCGGCGGG GAATACCGAA
AACCTGTACG GCTACAACTT CGATGATGGC TCACGGCTTC TCCTGGCTGA AGTCGCCTAC
CAACTTCCCG ACGGTACCCT GATCGAGGGA ACCGGTGTTG TCCCTGATGT CCTGATCGAA
GCGGAATGGT GGCGCTTCCC GCGTGAACAA GACCCACAAC TGCAAGCCGC ACTAGCCATT
ATTCAAAAAC CATCGTAG
 
Protein sequence
MRHRHILLML LAAIIMSGCT FGQNPAEPAL TGARPAAPAT RIPLPSPLPS ATPAPTASDL 
ARTPTPTPPI GTIVPTPTLA PLSREQRLQI FQQVWEKVRD NYVYPDYNGL DWQAIREELR
PKVAAAVTPE EFYSIMREMI ARLGDDHSRF ESPQEVAAQL AEASGQLQYG GIGVSVRTID
EGGLITRVVP GGPADQAGIL TRDIIVAVNG IPFNDPNAFG PDGAIGAVRG IPGTSVRLTI
KRGNEPLREI EVVRAVIDIA VFNRVTVERL AGDVGLLTIP SFYVDNADSQ ARDALTNLLA
AGPVRGMIID VRDNSGGYIH IMRNIIALFH DGGSIGTSVG RNEREEQRIP RGKTIAGLID
IPIVVLISEE TASAAEMFAA GMRVLCQATI VGVPSAGNTE NLYGYNFDDG SRLLLAEVAY
QLPDGTLIEG TGVVPDVLIE AEWWRFPREQ DPQLQAALAI IQKPS