Gene Cagg_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1072 
Symbol 
ID7268524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1324542 
End bp1326170 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content57% 
IMG OID643565917 
Producthypothetical protein 
Protein accessionYP_002462422 
Protein GI219847989 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTACC TCTTGGCCGC CGAGGCCGAT AAGATTCAGG ATTTCATCTT TCGCTCTTCG 
CGCTTGCGCG AAGTGGTTGG GGCGAGTCAG TTGCTGACTC GCTTCTGCCG TAGCGTCGAA
GATACCTTGG CGAAGCAGTA CAACGGTCAG GTTGTGGTCA ACGATGGTGG TAGTTTTCGG
GTGATCTTTG ACGATCGAAA TGACGCGGTT GCTTTCGGCG CCGATCTGGC TGAACGCTAC
CGGCTGGCGT TGGGTGGCAG TCTGACAGTT GCTGAGCCTG TAGCGATGAA CAGTGATTTC
CGCACGGCCA ATGATGAAGC CGGTACAAAG TTGCGCTGGG CGAAGAGTCA TCGGCAGGGA
GTAGTAGCCG AAGTGCATAT GCCGTATGTC GCATTCTGCG AGTCGTGTGG CGTGGGGTTG
GCGGAACGGC GTGATCGGCT GGCGGGAAGG AACGATTCCC GCCACAGGTA TCTGTGTGCG
ACCTGTCAGA TCAAAGCAAC TGAGCGTGAT CGTGGTCTGC GCGAATTCCT TGGTGGAGTG
TACGATCCTT ATGCTAAGAA AGCGGCAATT CCTGCCCACA TTGAACCCGA TTGGCCCGAA
GACGCCGATG CCATTGCCGT TTTTGACCTG AGTAAACGAA ACTACGTAGC CTATTTGGTG
GCCGACGGCA ACGGTATGGG TCAATTATTC GGCAATTGTG ACCAGGGGCA GCTCCAGAAC
CTTTCGCAAG GTCTATCAAC GGTGCTGAGT GAGAGTCTGG CCGTTCCGAT GATTGAGTTC
CGCAAGCAAG TTCCGGCACA GGCGACGATG ATGCCGATGC TCCCGCTCAT TCTCGGTGGT
GATGATCTCT TTGCACTTGT GCCGGCGTCG TATGCGCTCG ATATTGCCCG TCGCTTCTGC
CTCGAATGGG AAGAGCGTAT GCAGATGCTG GTAAATAAGA TAGGTCTGCA CAATGTGCCT
CGCCCGACGA TTGCCGCAGC AGTGGTGATT TGCAAGCGTA CCTATCCGTA TGCACTGGCC
CATCGCCGGG CCGAAGCTTT GCTGGAGGAT GCCAAGCGCC AGAGCAAATT GCTGGCTGCC
AAGACGAACG GGCATCTATC GGCGGTCAAT TTCGAGGTCA TTTTGGGCAA TCGGTTGGCG
GGTATGGCCG AGGCAGACGG TGATCAGGTC ATCCGGCGCT CGTTACGTCC GTATTGGGTC
GCAGAGCACG ATCTCTCGAA AGACGCCTTG CTGCGCGGGA TCGACCTCAA GCATCTGCTG
GCGCAGCGCT ATGCCCTGAA AGATCTTCCC CGGAAGCGTC TGGCCGAATT GCGCCGTTGT
TTTGCCGAGG TGCAGACGGA TATTCCTGTG CAGCAGCGTA CCCAAAACTT AGAACGGTGG
ACGCAGCATC GGCTCGAATG GATTTTGGAG CGATTGAGTG CAGCTTCACG TTCGGCGGTA
GTCGATGCGC TTGCGGTGCT GGGCAAGCCC AAGAACGACG GGAATGGCGC TCACTATTGG
CGCAGTATCA CGCGCGATAA CCGCGATGTG GTCGTTCACG GCATGCTCGA TCTGCTGGAA
GTTTGGGAGT TTGCGCAGGA GTTGAGTCAT AACCCCGACG ATTATGAACC GCAGGAGGAC
GAGGCATGA
 
Protein sequence
MPYLLAAEAD KIQDFIFRSS RLREVVGASQ LLTRFCRSVE DTLAKQYNGQ VVVNDGGSFR 
VIFDDRNDAV AFGADLAERY RLALGGSLTV AEPVAMNSDF RTANDEAGTK LRWAKSHRQG
VVAEVHMPYV AFCESCGVGL AERRDRLAGR NDSRHRYLCA TCQIKATERD RGLREFLGGV
YDPYAKKAAI PAHIEPDWPE DADAIAVFDL SKRNYVAYLV ADGNGMGQLF GNCDQGQLQN
LSQGLSTVLS ESLAVPMIEF RKQVPAQATM MPMLPLILGG DDLFALVPAS YALDIARRFC
LEWEERMQML VNKIGLHNVP RPTIAAAVVI CKRTYPYALA HRRAEALLED AKRQSKLLAA
KTNGHLSAVN FEVILGNRLA GMAEADGDQV IRRSLRPYWV AEHDLSKDAL LRGIDLKHLL
AQRYALKDLP RKRLAELRRC FAEVQTDIPV QQRTQNLERW TQHRLEWILE RLSAASRSAV
VDALAVLGKP KNDGNGAHYW RSITRDNRDV VVHGMLDLLE VWEFAQELSH NPDDYEPQED
EA