Gene Cagg_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0154 
Symbol 
ID7266893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp204504 
End bp205559 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content57% 
IMG OID643565026 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_002461541 
Protein GI219847108 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.386223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0859714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC AAATAACCTT CACCGAAATT GCGCAGCTCC CAACAGCGTA CCATGTTGCG 
TACATCAAAA TAGAACCGAA TATCGTTCTG GCCCCGATGG CCGGCGTCAC CGACAGCATC
TTCCGCCGCA TGATATTGCG GTTGGGTGGG TGTGGTTTGG TGAGTACAGA GATGACCAAC
GCTGCGAGCG TCAGTCCAAA AGCATTGCGC CGCCATCGCT TGCTCGACTA CCTACCAGAA
GAACGACCGT TGACGATGCA GATTTCGGGC AACGATCCCG ATCTCGTAGC GAATGCAGCA
CGGGTCGTCG AGCAATTAGG GGCCGATATT ATCGATATTA ACTGCGGTTG TCCGTCCCCG
AAAGTCACCG GTGGCGGGCA TGGTTCGGCG TTGCTGCGTG ATCTGCCGAA GATGGAACGA
CTCTTGCGAG CAGTGCGGGC TGCCGTCCAG ATTCCGGTGA CGCTCAAGCT GCGTGCCGGC
TGGGACGAAG CGAGCCTCAA TTTTATTGAA GCCGGCCAGC GCGCCGAAGC TGCCGGCGTC
GCTGCACTAA CGCTACATCC GCGTACCCGT GAGCAAGGGT ACAAAGGGCA AGCCGATTGG
TCACGAGTGG CAGCGCTCAA GCGTGCCGTC TCAATCCCGG TGATCGGGAG TGGTGATGTC
GTCACCGCGC AAGATGCGCT CATCCGATTA CGCGATAGTG GCGCCGATGG CGTGATGATC
GGACGCGGCG CAATCGCTAA TCCGTGGATT TTCCGCCAAG TCGCCGATCT GCGTCAAGGC
CGCACACCGT TTGAGCCAAC TCCTGCCGAT AAGTACCATC TCTTGCTGGA GTACATGGCG
ATCTACGCCG AAGAATTACC CGAACGGTTG GCGCTCAATA AGATCAAACA ACTGATCGGT
CAGTTTTACA TCGGCTTACC CGGCAGTAAC CATCTGCGTG TCGCCGTTCA TACTTCCACC
AGTCTTGCCG CAGCGCAAGA AGCAATTGAG CGATTCTTCG CACCGTATCT CGAAACAGAT
GCAGCGGTGC CTGAACCGGC AATTGCCGCC GATTAG
 
Protein sequence
MSDQITFTEI AQLPTAYHVA YIKIEPNIVL APMAGVTDSI FRRMILRLGG CGLVSTEMTN 
AASVSPKALR RHRLLDYLPE ERPLTMQISG NDPDLVANAA RVVEQLGADI IDINCGCPSP
KVTGGGHGSA LLRDLPKMER LLRAVRAAVQ IPVTLKLRAG WDEASLNFIE AGQRAEAAGV
AALTLHPRTR EQGYKGQADW SRVAALKRAV SIPVIGSGDV VTAQDALIRL RDSGADGVMI
GRGAIANPWI FRQVADLRQG RTPFEPTPAD KYHLLLEYMA IYAEELPERL ALNKIKQLIG
QFYIGLPGSN HLRVAVHTST SLAAAQEAIE RFFAPYLETD AAVPEPAIAA D