Gene Cagg_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3397 
Symbol 
ID7267137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4125779 
End bp4127539 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content53% 
IMG OID643568206 
ProductDNA repair protein RecN 
Protein accessionYP_002464677 
Protein GI219850244 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00306133 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATCG AGTTGCAGAT TCAGGATTTC GCTATTATTG ATCGTCTGCA TCTGCGCTTT 
GAGCAGGGCT TTACTGTGCT CACCGGTGAA ACTGGTGCCG GTAAGTCGAT CATTATTGAC
GCTCTCGGCA CCTTGCGCGG TGAGCGAATT GATCCTACCT TTGTCCGCGC CGGATGTTCA
CGAGCAAGGG TTGAAGGTAT CTTCAGCCTC GATGATTGCC CGCATATACT CCCGCTACTC
ACCGAACTCG ATTTACTGAC CGAAGGCGAC GATCAAGTTA TTCTGGTTCG TGAGATCTCT
GCCGAGTCGG GGCGTAGCAT TGCTCGGATC AATGGGCGTG CAGTCAGTAG CGCCACTTTA
CGCGAGATTG GGAGCCGGCT GATCGATATT CATGGTCAGC ACGAAGGTCA GTCACTGTTT
AACCCCCGTA CCCATCTTGA ACTGCTCGAC CGGTTTGGTG ATTTGTTACC GATACGGCAA
CGGGTTGCCG ATCAGTTGAC GACATTACGA GCGGTACAAA CCCAATTAGC CGAACTCCGT
ACCGGAGAAG CTCATCGACA AGCGCGGATC GAAGAGCTAG AGATGCTGCG CGACGATGTC
ACGGCAGCGA AGCTTAAACC CGGCGAAGAG GAAGCGTTGT TGCGGGAACG GACCATTGCT
CAGAATGCAG CTCGGATTGC AACCTTAACC GATGATGCGT ACCGTGCGTT GTATGCCGGT
AGTGAGGGGC GAAGTGGGCG ATCAGCGATT GAGGCAATGG TATTTGCGGT CAACGCCTTG
AACGAATTAT CCCGCTTTGA TCAGCGCACA ATTCCGTTGG CGCAGCGAGC TACCGATATT
CGTTATCAAC TGGAAGACTT GGTAACGGAC CTGCGCAAAT ATCGTGCTGA TCTCGATGTC
GATCCACGTC GCCTCGACAT GATCGAAGAT CGGCTAACAG TACTGCGGGA TTTGCAGCGT
AAGTACGGGG TTGATTTGAA TACGCTCATC GAGCGGGCTA CCCAGGCTGA AGCAGAGCTT
GAGAGTCTGC GTAATCGCAC CGGCCAAATT GCCGATCTTG AGAAGCAAGA GCAGGCTTTA
CAGGCTGAAT TGGCCCGTCT CGCCCTTGAA TTATCGCAGC GCCGGCGTCA AGTCGGTGAA
GAGTTGAGTC ACCAGATTAT CCAGGCAATG CACGATTTGG CGATGCCTAA CGTTCATTTC
GCCGTTCAGA TAACATACGA TGATGATCCA CAGGGGTTAC CGGTAAACGG ACGTCGGGTT
GCTTGCGATC GGACAGGTAT TGATCGGGTA GAGTTTTTGA TAGCCCCTAA CCCCGGTGAG
CCACTCAAAC CGCTGGCCCG TATCGCCTCA GGTGGTGAGA GTGCTCGTTT GTTACTGGCT
CTGAAGTCGA TTCTTTCACA GGTAGATGAG GTGCCAACCC TGATCTTCGA CGAGATCGAT
ACCGGTGTTG GTGGTCGAGC AGGTCACGTG GTTGGACAGA AGCTCTGGGC TATTAGTCAA
CGTCATCAGG TACTGTGTAT TACCCACTTG CCGCAAGTGG CAGCGTTCGC TAACGCGCAT
TACCATATCC GTAAAGAGGT TCATGCCGGG CGCACGCGCA CCAATGTTGA GGTGTTGTCC
GCCGAACAAC GGATCGATGA GCTTGCGGCA ATGCTCGATG GTGTTCCCAA CGACCATAGT
CGGGCTAATG CTCGTCAGAT TCTCGAACGA GCACAGACTT GGAAATCGCA CCGCCAGACC
GAACTGATGA CAAAATCCTA G
 
Protein sequence
MLIELQIQDF AIIDRLHLRF EQGFTVLTGE TGAGKSIIID ALGTLRGERI DPTFVRAGCS 
RARVEGIFSL DDCPHILPLL TELDLLTEGD DQVILVREIS AESGRSIARI NGRAVSSATL
REIGSRLIDI HGQHEGQSLF NPRTHLELLD RFGDLLPIRQ RVADQLTTLR AVQTQLAELR
TGEAHRQARI EELEMLRDDV TAAKLKPGEE EALLRERTIA QNAARIATLT DDAYRALYAG
SEGRSGRSAI EAMVFAVNAL NELSRFDQRT IPLAQRATDI RYQLEDLVTD LRKYRADLDV
DPRRLDMIED RLTVLRDLQR KYGVDLNTLI ERATQAEAEL ESLRNRTGQI ADLEKQEQAL
QAELARLALE LSQRRRQVGE ELSHQIIQAM HDLAMPNVHF AVQITYDDDP QGLPVNGRRV
ACDRTGIDRV EFLIAPNPGE PLKPLARIAS GGESARLLLA LKSILSQVDE VPTLIFDEID
TGVGGRAGHV VGQKLWAISQ RHQVLCITHL PQVAAFANAH YHIRKEVHAG RTRTNVEVLS
AEQRIDELAA MLDGVPNDHS RANARQILER AQTWKSHRQT ELMTKS