Gene Cagg_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1887 
Symbol 
ID7266378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2314601 
End bp2316313 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content58% 
IMG OID643566724 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_002463218 
Protein GI219848785 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTC GCCGTCAGCG CTGGGAGGTA CGTCCACCGG CTCCCGCCAC GTTCATCGCC 
CAACTAAACC TCCATCCCGT ACTCGCGACC CTGCTGTACC AGCGCGGGCT TCACGAACCG
GAAGCAGCGC ATGCGTTTCT GGCCGCCGAT TACACCTCCG GCTTGCACGA TCCACTGCGC
ATGCGTGGTA TGGCAGAAGC AGCAGCTCGG ATCGCCACCG CCATCGATCG CGGTGAACGC
ATGGCCGTCT ATGGCGACTT TGATGTTGAT GGGGTCACGG CGGTTGCGCT GCTCACCCAG
GCTATCCGGG CGATGGGAGG GAAGATCCGA CCATACATTC CTCACCGGGC CCGCGAAGGT
TATGGTCTCA ACAATGTGGC AATCGGTCAG TTGGCTGCCG ACGGTGTTCG ACTGCTCATT
ACCGTCGATT GTGGTATCTC GAACGTGGCT GAAGTCGTCG AGGCCAAACG ACTGGGGATG
GATGTGATCG TGACCGACCA TCACCACCCC CCAGACGAGT TGCCATCGGC TGATGCCGTC
ATCAATCCTA AACAACCCGA TTGCGCGTAT CCTTTCAAAG GGCTGGTAGG CGTCGGCATT
GCGTTCAAAT TGGTGCAGGC ACTGGCCCGT TACGGAAAAC GTCCGGCTCA CTTGCGTGGG
CGTGATTTGC TCGATTTGGT TGCATTGGGA ACAATAGCCG ACATGGGTCC GCTCGTTGAC
GAGAATCGGG TATTGGTACG GGCAGGCTTA CTTGCGCTGA ACGAGACGAA CCGACCCGGC
GTGCGCTCAT TGATCACGGT AGCCGGTCTG ACACCGGGTG CCGTCGATAG CGGAGGTGTT
ACCTTTAGTC TGGCTCCTCG CCTCAATGCT GCCGGTCGGC TCGACGATGC GCGCCGCGCC
TACGAATTGT TGCTCGCCGA CGATCAGGCC ACCGCCGATG CGATTGCCGC CGACTTGCAT
GCTACCAACC GTGAGCGACA GAGCATGACC CGTCAGTTAC AGACGATTGC CGAAGAATTA
ATCAATGCGA GCGGTCGGGC TGAGCACCCC TTGATCGTAC TAACCAACCC GAACTTTAAC
GCCGGTCTCC TCGGTTTGGT CGCTGCGCGG TTGGTCGAAC GTTATCATCG CCCGGTGGTC
GTCATCGAAC AAGGTAGCGA AACGTCACGC GGCTCGGCAC GGTCGATCCC CGGTTTCAAT
ATCATCGAGC TGCTCGATCA GTGTGCCGAT CTCTTCGTTC GCTACGGGGG ACATACAGCC
GCAGCCGGCT TTACCATCCA TACCGCCAAC ATTCCGGTGC TCGAACAACG CCTGTTGGCG
CTAGGGAAAC AGTATCTGAA AGAAGAACTT CTAACGCCAA GGTTACTGAT CGACGCGAAA
CTACCCCTCG ATGAACTGTC GTGGGATGTC TACTACGCTA TCCAACAACT CGAACCGTTC
GGGCATTGCA ATCCAACACC GATGTTTATG GCTCCTAACG TGACGGTGAT CGATCCGCAG
ACAACCACTA CCGGCGATCA TCTGCGTATG CGAGTACGCG CCGGTACTAA CGTCTATGAA
GCGATTGGGT TTAACTTCGG TCACTTCGCT GCTGCCCTCC AGCGTCATCC CACCGTCGAC
CTTGCTTACC AACTCGCGGT CGATGAATGG AACGGTCAGC GCCGGATGCG TCTGTTAGTA
CGTGATTTTC GGCGGGCAGG GCAAGGAGGG TAA
 
Protein sequence
MSARRQRWEV RPPAPATFIA QLNLHPVLAT LLYQRGLHEP EAAHAFLAAD YTSGLHDPLR 
MRGMAEAAAR IATAIDRGER MAVYGDFDVD GVTAVALLTQ AIRAMGGKIR PYIPHRAREG
YGLNNVAIGQ LAADGVRLLI TVDCGISNVA EVVEAKRLGM DVIVTDHHHP PDELPSADAV
INPKQPDCAY PFKGLVGVGI AFKLVQALAR YGKRPAHLRG RDLLDLVALG TIADMGPLVD
ENRVLVRAGL LALNETNRPG VRSLITVAGL TPGAVDSGGV TFSLAPRLNA AGRLDDARRA
YELLLADDQA TADAIAADLH ATNRERQSMT RQLQTIAEEL INASGRAEHP LIVLTNPNFN
AGLLGLVAAR LVERYHRPVV VIEQGSETSR GSARSIPGFN IIELLDQCAD LFVRYGGHTA
AAGFTIHTAN IPVLEQRLLA LGKQYLKEEL LTPRLLIDAK LPLDELSWDV YYAIQQLEPF
GHCNPTPMFM APNVTVIDPQ TTTTGDHLRM RVRAGTNVYE AIGFNFGHFA AALQRHPTVD
LAYQLAVDEW NGQRRMRLLV RDFRRAGQGG