Gene Cagg_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1071 
Symbol 
ID7268523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1322209 
End bp1324545 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content57% 
IMG OID643565916 
Productprotein of unknown function DUF324 
Protein accessionYP_002462421 
Protein GI219847988 
COG category[L] Replication, recombination and repair 
COG ID[COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR02674] CRISPR-associated RAMP protein, Csx10 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCC GTCTCGAATT CGAGATCGAG TTCAAGAGCG ATTACCACAT CGGCGCCGGT 
TATGGTCTCG GTCTGCAAGT TGACTCGGCC CTACTCCGCG ATGCCGACGG CGTACCGGTA
ATCCGCGGCA CGGTGCTGGC CGGTCTCCTG CGCGAAAGTT TAACCAACCT GCTGACACTG
CAAGTGTTCG CGTCCAATCA CCATGTGGTG GATACGATCT TCGGTTCGCC CGCCCGCCAA
AAGCGTTGGC GTATCTCTTC TGCACGACCT GCGGGTATGA TGACACCGCT CGTGCCATCG
GACGTGTGGA GTGCGGGCGA AACGGCGGCG CAGATCACAA CGCGCGTGCG GGTTAATCCT
CGCACACGTC GGGCCGAGAA AAATAAACTG TTCACCCGTG AGGAGGGTGA TGGCAGCTTG
CACTTCCGCT TCGTGGCCGA GTGCCAGAAT GATGATGCCG ATGCGCAGCG CGAGGCTGAA
TGGCTCGTCG CCGCTGCACG TATGCTGCGT AATCTTGGCG CCGGGAAGCG GCGTGGCTAT
GGTGAGTGCG AGATTCATCT TGTTGATCGA GCGCAGGAAA CCGCGATTCT GGATCGATTG
GCCAAGCGGT TAAGAGATGA ACAAGTCGCA GAACCCGCCT TGCCTGCCGG CGCAATCAAC
ATCAGTCCGC TTCAGTTGCC CCCCTATCCC AACCATCATA CCTATCGCTT GCGCGTTCTC
ATGCGGCTTG ACGAACCGTT ACTGATCGCT CGTCGGGTCG AAGCCGGTAA TCAGTACGAG
ACGCTCGACA TCATTCCCGG CAGTGTGCTG CGCGGCGCGC TGGCGTGGCG TGCGGCGAAG
CATCTGGGAA AGCAACTTCA GGGGTCGGTA TACCAAGATT TTGTCGATCT GTTCTACCGT
GATACAGTGC GCTTCTCGAT GTTGACCCCG GTAGAAGTGT TTCAGAAAGC TAATGGTTAT
CCCACTATAA TTGCCCCGCG CGATCTGCTA ACGTGTGAAT TGCATCCGGG GCATGCCGAT
CCTAGCCGAG ATAAAGGCCA TGGAGTCTGG AGTCTGTTCG ATCCGAGTGC GCCAGAGGAA
TGCCCGCGAT GTAAGCGTAC CGGCCAGTTA ACAAGCTTGG AAGGGTTTAT TTCATTAGTC
AGTGGTATGC CGCGTTCCAA GCACAAACCG TCCACAATGG TGGAAATGCA CATTCGCATC
GATCCTGACA GTGGGCGGGT GCGTACCGGC GATCTGTACG GATACGTAGC TTTAGAGCCG
GGTCAGTATT TTGTCGGTGA GGTAACCTGT ACCGATCAAA CGGTCTGGGA CCTTTTACAG
AGCATGGCCA ATCTTCAACC AAATGGAGCG GTAAACGAAC TGCACCTCGG TAAAGCCACC
CAACGCGGCT ATGGTAAAGT CAGCGCCGTG TTTCAGAAAA TCGACGAACC GCTGTTCAGC
TTGCAGTCAC TCACAGGACG CCTCACGTCA ACGGAGCATG TCACGATGCT ATTGTTGAGT
GATGCGATTA TCGTCGATCC GTGGGGACGT TTCTGGCGTG GGTTCGATAC CGCATGGCTA
AAGCGCGAGT TGCAGTTGCC GAATGGCGCA GCAGTCTCGA TTGATTGCAA CCAGAATGGC
GAGGCGCTGG CGTTTTCAGC GGTGCGGACG GTGGACTCCT TCAATGCCAC GTTAGGGTTG
CCGCGCGCGC GAGATATTGC CATCGTTGCC GGTTCCAGTG TACGCCTGAG TTTCAAGGGA
ATCCCGTTGG ATGACCTGCG ACAGCGTCTG GGGGAGGTAG AGGCGCAGGG TATCGGTCTC
CGCCGCAATG AGGGCTTCGG TCGTGTTGCG TTCAATCATC CGGTCTACCG TCAACTTCAA
GGGATCACCA GTCCGATGCT CGATCTCACC CCGTTGCAGA TGGCGTCTGC CGAGCAGTCG
CCCCCGTTTG TGGCCGTGCT TGCTTTCACC CGCGAGTGGG AGGAAAAGCT CGAAGCGGAA
GCAACATCCT TCGCTTGTTT CAACGATGGG CGTTTTGAGA CGATTGCTCG GCTGTTACAT
GTTTCTCAGC ACACCTCGGT TGATGCCATC AAGCAGGATT TACAACGGTC AGGGAATGCA
GAGAACCTCC TCGGCAAATC GTTGTCCGGC CGCGACAAAC CGAATTTCTT CACGGCTGAC
GGGAAGCGCG GTATGGATGT GATCGATAAG CTGCTGGATG CGTTGGTAGA TAAACTCAAG
TCACACAAGT TGGACCACAA TCCGCTGGCA TGGCGCATCG GCTTGCAGAT CGGCTTGCAG
ATGCTGGCTG CCCGGATCGC TGCGCCTGCC CGCCAGAAGG CTGAAGAGAG GAGATAG
 
Protein sequence
MSIRLEFEIE FKSDYHIGAG YGLGLQVDSA LLRDADGVPV IRGTVLAGLL RESLTNLLTL 
QVFASNHHVV DTIFGSPARQ KRWRISSARP AGMMTPLVPS DVWSAGETAA QITTRVRVNP
RTRRAEKNKL FTREEGDGSL HFRFVAECQN DDADAQREAE WLVAAARMLR NLGAGKRRGY
GECEIHLVDR AQETAILDRL AKRLRDEQVA EPALPAGAIN ISPLQLPPYP NHHTYRLRVL
MRLDEPLLIA RRVEAGNQYE TLDIIPGSVL RGALAWRAAK HLGKQLQGSV YQDFVDLFYR
DTVRFSMLTP VEVFQKANGY PTIIAPRDLL TCELHPGHAD PSRDKGHGVW SLFDPSAPEE
CPRCKRTGQL TSLEGFISLV SGMPRSKHKP STMVEMHIRI DPDSGRVRTG DLYGYVALEP
GQYFVGEVTC TDQTVWDLLQ SMANLQPNGA VNELHLGKAT QRGYGKVSAV FQKIDEPLFS
LQSLTGRLTS TEHVTMLLLS DAIIVDPWGR FWRGFDTAWL KRELQLPNGA AVSIDCNQNG
EALAFSAVRT VDSFNATLGL PRARDIAIVA GSSVRLSFKG IPLDDLRQRL GEVEAQGIGL
RRNEGFGRVA FNHPVYRQLQ GITSPMLDLT PLQMASAEQS PPFVAVLAFT REWEEKLEAE
ATSFACFNDG RFETIARLLH VSQHTSVDAI KQDLQRSGNA ENLLGKSLSG RDKPNFFTAD
GKRGMDVIDK LLDALVDKLK SHKLDHNPLA WRIGLQIGLQ MLAARIAAPA RQKAEERR