Gene Cagg_3835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3835 
Symbol 
ID7266315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4674174 
End bp4675244 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID643568646 
ProductDRTGG domain protein 
Protein accessionYP_002465106 
Protein GI219850673 
COG category[R] General function prediction only 
COG ID[COG0857] BioD-like N-terminal domain of phosphotransacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000379504 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACAC TGTATGTCGC CTCAACTGAG ACCTACGTTG GTAAGAGTGC GGTGTGTGTG 
GGTTTGCTGC GCCGAATGCA ACGAGATGGC TACCGTGTCG GGTATATGAA ACCGGTAAGC
GTTTCGGTTA CTCACACGCC CGACGCGGTG CTTGATGAGG ATGCCGCCTT TATTCGCCAG
ACCATTGGTC TTGACGCACC TATGGAGCAG GTTGCGCCGG TGCTCATTAC ACCGGGCGTT
ATCGAGTCGA TCTTGCGCGG GCAACCCCAT TCGTTTGCGA AGACCTTGCG CGATGCCTAT
CTGGCCGTAT CACGCCAGAA AGATGTGATG GTGTTAGAGG GGACTAATAC GTGGGCCGAG
GGCGCGCTGG TCGATCTGAC GGCCGATCAA GTGACCGATA TGTTGCAAGC ACCCGGCTTG
CTCGTGTGTC GCTACACTTC GACACTGTCG GTTGATACCA TTCTCAGTGT CCAACGGTAC
GTTGGGGATC GTTTGTTGGG GGTGTTGATT AATCAGGTTG AAGAGCCGCA CCGTGAGTTT
GTGCGGAACC GCGTTACTCC GTTTCTAGAG GGGCGTGGTA TTCCGGTGTT GGGTGTCCTT
CCTCGCGATC GTTTGCTGTC GGGGGTGACG GTAAACGAAC TGGCTCAGCA TCTCGGCGGG
CAAGTAATCG GTCGCCCTGA ATGGGGTGAG AAGATGCTCG ATTCCTTGAT GATCGGTGCA
ATGGGTGCAG ATGCCAGTCT CTCGTTCTTC CGCCGGCGGG CAAATAAAGC GGTGATTACC
GGCGGTGATC GGAGCGATTT GCAGTTGATT GCCCTGCAAA CGAGTACGAA TGCGCTGGTC
CTTACCGGCA ATATCCGACC AACGATGCAG GTGATGGATC GTGCCGCCGA ATTGGAGGTG
CCGATTATTC TCGTCGCCGA TGATACACTC AGCACCGTTG ATCGGGCTGA AAAGTTGTTT
GGTCGGGTCC GGTTTCACCA AGAAGCCAAG TTGCGCCGTT TCACTGAGTT ACTTGATACA
CACTTTGATT TTGATCGTTT GTACCGACTG CTAGGACTGA AGATTCATTA G
 
Protein sequence
MATLYVASTE TYVGKSAVCV GLLRRMQRDG YRVGYMKPVS VSVTHTPDAV LDEDAAFIRQ 
TIGLDAPMEQ VAPVLITPGV IESILRGQPH SFAKTLRDAY LAVSRQKDVM VLEGTNTWAE
GALVDLTADQ VTDMLQAPGL LVCRYTSTLS VDTILSVQRY VGDRLLGVLI NQVEEPHREF
VRNRVTPFLE GRGIPVLGVL PRDRLLSGVT VNELAQHLGG QVIGRPEWGE KMLDSLMIGA
MGADASLSFF RRRANKAVIT GGDRSDLQLI ALQTSTNALV LTGNIRPTMQ VMDRAAELEV
PIILVADDTL STVDRAEKLF GRVRFHQEAK LRRFTELLDT HFDFDRLYRL LGLKIH