Gene Cagg_2208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2208 
Symbol 
ID7266781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2704091 
End bp2706418 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content56% 
IMG OID643567039 
ProductSpoVR family protein 
Protein accessionYP_002463527 
Protein GI219849094 
COG category[S] Function unknown 
COG ID[COG2719] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGTA CCCGCCTTCC CGCTCATCTC GAACAGGCGC GTCAGGAAAT CGAGCAGATC 
GCCCGCTCCT ATGGCCTCGA TTTCTTTCCG ATTATTTACG AAGTGCTCGA TTACCGCACG
CTCTATGAGA CGGCGGCATT CGGCGGCTTC CCGACGCGCT ACCCACATTG GCGCTTTGGT
ATGGAGTATG ACCAATTACT GAAGGGCCAT ATTTGGGCGG GCAGTACCAT CTACGAGATG
GTGATTAACA ATAACCCGGC GTATGCCTAT CTGCTGGAAG GGAACGAAGA TGTGACCCAG
AAAATGGTGA TGGCTCACGT CACCGGTCAC GTTGATTTCT TCAAGAACAA TATGTGGTTT
GCCCACACCA ATCGTAAGAT GCTCGATACG ATGGCAAACC ACGCCGCCCG GATCCAGCGG
ATTATCGACC GAATTGGCTA CGATCAGGTT GAGGAGTTTA TCGATACGTG CTTGTCGTTG
GAAAATCTGA TCGATTATCA CGCTCCATAT ATCCGCCGGC CTGAAGCCGT CACCCCACGC
CCGCTGATTA ACGATGAGGA ACCACCGGTG GTTGAGGGTC TACCGGTAGA GCGTGAGTAT
ATGCGCGATT ATATCAATCC ACCGGAGTTT CTCGAAGCCC AACGCCGCAA ACTCGAAGCG
GAGCGTGCCC GTCAGCGCCG GTTCCCTGAA AACCCGCAGC GCGATGTGCT GCTCTTCCTC
ATGAATTTTG CACCGCTTGA GAACTGGCAA CATACTGTTC TCGAAATTAT CCGCGATGAG
GCGTATTATT TTGCCCCACA AGGTATGACC AAAATCCTCA ATGAGGGGTG GGCATGCCTT
CACGGCGAGA GTCTGATCGT GACTGACCGT GGCCTGATCC CGATCCGCGA CGTAGTCACG
CAACGCCTCA CGGCGCGTGT GAGCGATGGT CAGCGCCGGC AGACAGTTTA CGACTGGAAT
GTCTTTACGG CGTATCCGAC GGTTACGATC CGTACCCGCA CCGGTTTGTC CCTGACCGGT
TCACACAATC ATCGTATCTT GCTTGCCAAC GGTGAATGGC GCCGGCTCGA TGAACTGCAC
TGCGGTGATC AGGTGCGCAT CGCCGGGGGC ACCGAGCTGT GGGCGACGAC ACCGGCTACG
CTGCGCCGCC GACCAAAACT CAAGCGGGTG TTGGCCTACG TGCCGGCGAA TACCGCGGCA
CAGTCGGTTC TTGCGCGTCA CCGCTCCTAC CGACGCACTG CTGAGGTGGT GGTCGACGAG
ACACTAGCCG CTCAGCTCGG TCGATCTTGC ACACGCCGTG CCGATCATCG TGCGCTCGCT
GCCATCTGTC GTTCACCACG ACCACAGGTG GTCGCATTTT TGCGTACCTT CTGCGCCGGC
ACTCCACGCT TGGCCGACAA CAGCGTGACG ATCGCCTGCC CCACCGACAG CGTTGCTGCA
ACGGTTCAGT TGCTGCTCCT CAATCTGGGT GTACTGGTAA CACGACGAGA TACCAGCTTG
CATATTGCTG ATGCTGCTGC ACTTGAGCGC GCTCTGCTGG CCCCTTCCAC GCCCGTCGCA
TGGACCGATG ACATCGTTGC GATTGAACAC GGCACTGCCG ATGTCTACGA CATCTCAGTC
ACTGAAACGC ATCGCTACGC GGCGCAGGGC TTCATCAATC ACAATAGCTA CTGGCACAGT
ACCATCATGA CCAAGCATGT GGCCTCGGCA GCCGAGATTA TTGCGTTTGC CGACCTGCAC
TCTGGAGTCG TCGCCACGAG CGGTGGTCGG TTGAACCCAT ACAAGCTCGG GTTAGAATTA
CTACGCGACA TTGAGGATCG TTGGAATAAA GGGAAGTTCG GCAAAGAGTA CGAAGAGTGC
GACGACATCG CGCTCAAACG CTCGTGGGAT AAGCAGCTAG GGCTTGGCCG TCAGAAGATC
TTTGAGGTAC GCCGGCTCTA CAACGATGTC ACGTTTATTG ATGAGTTCCT CACGCCGGAA
TTTGTGATGG AGCAGAAGCT CTTCACCTTC CGCTACAATC GCGATACCGA CCTGTACGAG
ATTGCGTCAC GCGAATTCAA GGAAGTGAAG GAGAAATTGC TCTTCCGCCT GACGAACTTC
GGGCAGCCGG TGATTTTGGT TGAGGATGGC AACTACGGCA ACCGCGGCGA ACTCTACTTG
CGCCACCGTC ACGAAGGGGT TGATCTCAAG ATGGACTATG CGCGTGAGAC CATGCGCAAC
CTCTACAAGA TTTGGACTCG TCCGGTGCAT TTGGAGACGG TGATCGAAGA AAAGCGCCGG
TTACTGTCGT TCGATGGGCG TGACTTCAGC GAACGACGGA TGGATTAA
 
Protein sequence
MKSTRLPAHL EQARQEIEQI ARSYGLDFFP IIYEVLDYRT LYETAAFGGF PTRYPHWRFG 
MEYDQLLKGH IWAGSTIYEM VINNNPAYAY LLEGNEDVTQ KMVMAHVTGH VDFFKNNMWF
AHTNRKMLDT MANHAARIQR IIDRIGYDQV EEFIDTCLSL ENLIDYHAPY IRRPEAVTPR
PLINDEEPPV VEGLPVEREY MRDYINPPEF LEAQRRKLEA ERARQRRFPE NPQRDVLLFL
MNFAPLENWQ HTVLEIIRDE AYYFAPQGMT KILNEGWACL HGESLIVTDR GLIPIRDVVT
QRLTARVSDG QRRQTVYDWN VFTAYPTVTI RTRTGLSLTG SHNHRILLAN GEWRRLDELH
CGDQVRIAGG TELWATTPAT LRRRPKLKRV LAYVPANTAA QSVLARHRSY RRTAEVVVDE
TLAAQLGRSC TRRADHRALA AICRSPRPQV VAFLRTFCAG TPRLADNSVT IACPTDSVAA
TVQLLLLNLG VLVTRRDTSL HIADAAALER ALLAPSTPVA WTDDIVAIEH GTADVYDISV
TETHRYAAQG FINHNSYWHS TIMTKHVASA AEIIAFADLH SGVVATSGGR LNPYKLGLEL
LRDIEDRWNK GKFGKEYEEC DDIALKRSWD KQLGLGRQKI FEVRRLYNDV TFIDEFLTPE
FVMEQKLFTF RYNRDTDLYE IASREFKEVK EKLLFRLTNF GQPVILVEDG NYGNRGELYL
RHRHEGVDLK MDYARETMRN LYKIWTRPVH LETVIEEKRR LLSFDGRDFS ERRMD