Gene Cagg_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2939 
Symbol 
ID7268812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3602355 
End bp3603977 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content54% 
IMG OID643567761 
Productsingle-stranded nucleic acid binding R3H domain protein 
Protein accessionYP_002464235 
Protein GI219849802 
COG category[S] Function unknown 
COG ID[COG3854] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.269188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTGA CGGACACTGT CAATGACATT CAGCTTCTAT TAGCAACATT GCCACCAGCG 
ATACGTGACG CGATCCAGAA AGCTAACGAT CAGGATAATT TGCTCGAGAT CGTGATGGAT
CTGGGCCGTC TGCCTGAAGC ACGCTATCGT GGTCACGAAC TATTCTTGAG CGATCGGGAG
GTGACCGCTG AAGATATCAA TTATGTCATT GCCCGCATCG GTGAATTTGG TGAAGATAAT
CGCGCCGGTA TTCCTCGTAC ACTGCATCGT ATATCGGGCA TTCGTAATCG CCGTGGTGTG
GTTATTGGCT TGACCTGCCG CGTTGGCCGG GCAGTCTACG GTACGGTTGA TATTATTCGC
GATTTGGTTG AAACCGGTCA GAGTATTTTG CTGCTCGGTA AGCCGGGTAC CGGTAAGACG
ACGCTGTTGC GTGAGACGGC GCGAGTGCTC GGTGATGAGT TGCGTAAGCG GGTGGTGATA
GTCGATACCT CAAATGAGAT TGCCGGCGAT GGTGATATTC CCCATCCCGG TATCGGTCGT
GCCCGCCGTA TGCAGGTACC GCGACCGTCC GAACAGCATA ATGTGATGAT CGAGGCGGTT
GAGAATCACA TGCCTGAAGT GATTGTGATC GATGAGATCG GTACCGAATT GGAAGCTGCT
GCCGCTCGTA CTATTGCCGA ACGTGGGGTA CAGTTGATCG GGACGGCGCA CGGTAATACC
CTCGATAATC TGATGATTAA CCCCACGCTG TCGGATCTGG TGGGAGGGAT CCAGGCGGTG
ACGCTTGGTG ATGAAGAGGC GCGTCGGCGT GGAACACAAA AGACGGTGCT CGAGCGCAAA
GCGCCACCGA CGTTTAGCAT TCTGGTTGAG ATTCAGTCGT GGGATAGCGT GACCGTCTAC
CCGGATGTAG CAGCGGCGGT TGATGCCATT TTGCGCGGTG AGGAGCCGCC ATGTGAGCAA
CGCATCCGCG AGCCGGACGG TACGGTGCGG CGTGAGCCGG TACGGCGCGC GCTGATCGAT
GCCCCGGCGT TTGGTTTCCG ACGTAGTCGG GGTGGGCGTG AACAGTCGCA GATGGGGACA
AACGGTCCAC GTTTGCGTGA TCGGAATGGT AGTATGACCG GCTCGGTTAC TACCGTGCCA
CCCCAGCGTA TCTTCCCGTT TGGTGTGAGT CGCAACCGAT TGCAAAATGC AATTGAACGA
CTGCGGGTGC CCGCCGTTAT TGTGCGTGAC TTGAAAGATG CAACCTTAGT GATGACCCTG
AAAAACTACT ACCGGCAGAG TTCACATCAG TTGCGGCAAG CTGAGGAACA GGGGGTGCCG
GTGTATGTGT TGCGCAACAA TACGATCACG CAGATGGAAC GTCAATTAGC CCAAGTCTTT
CAGTTGCGCG AGATGTTTGA TGATGAAGCA GAGTATTCGC GCAGCGATTC GGTGATCGAA
GAGGCATTGC TCGAGACTGA ACAGGCGATT GCGCAAGTTA TCAACGGTGA ACGCAATGCG
GTAGAATTGA CGCCACGTAG TAGTTATATT CGCCGCTTAC AACATCAGAT GGCCGATCGG
TACAATCTAC GTTCAGAGAG CCGTGGCGAT GATCCAAACC GGCGGGTGAA GATCTTTCGG
TAA
 
Protein sequence
MAVTDTVNDI QLLLATLPPA IRDAIQKAND QDNLLEIVMD LGRLPEARYR GHELFLSDRE 
VTAEDINYVI ARIGEFGEDN RAGIPRTLHR ISGIRNRRGV VIGLTCRVGR AVYGTVDIIR
DLVETGQSIL LLGKPGTGKT TLLRETARVL GDELRKRVVI VDTSNEIAGD GDIPHPGIGR
ARRMQVPRPS EQHNVMIEAV ENHMPEVIVI DEIGTELEAA AARTIAERGV QLIGTAHGNT
LDNLMINPTL SDLVGGIQAV TLGDEEARRR GTQKTVLERK APPTFSILVE IQSWDSVTVY
PDVAAAVDAI LRGEEPPCEQ RIREPDGTVR REPVRRALID APAFGFRRSR GGREQSQMGT
NGPRLRDRNG SMTGSVTTVP PQRIFPFGVS RNRLQNAIER LRVPAVIVRD LKDATLVMTL
KNYYRQSSHQ LRQAEEQGVP VYVLRNNTIT QMERQLAQVF QLREMFDDEA EYSRSDSVIE
EALLETEQAI AQVINGERNA VELTPRSSYI RRLQHQMADR YNLRSESRGD DPNRRVKIFR