Gene Cagg_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1122 
Symbol 
ID7268576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1384596 
End bp1385846 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content49% 
IMG OID643565965 
ProductDNA methyltransferase 
Protein accessionYP_002462468 
Protein GI219848035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.909173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0173084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGACA ATACCATTCA ACAACGGCCA GAGTTCACAT TTCGAGAAAA CCACAAAAAC 
GGTCGCCATG GATGGGTGCG TTTAACACCT GCCTACTCTG TGACACTCGT GACCGATATT
CTTAACCGAG AACGAGGAGC ATTCCGCATT CTCGATCCAT TCGCAGGGAC CGGAACCACC
GTACTTTGTG CAGCAGAACA GGGAATGTAC GGTGTTGGTA TCGACATCAA CCCCTTCCTC
GTCTGGCTGG GAAATGCCAA GCTTCGTCGA TACACTCTTG CAGAGATTGC CGAATTCGAG
CACACGGTTT GCCGTATCAT ATCGGCACTC AGATCGGAGG GACCAAGCGT ACCCCCACCA
CCAATCCACA ACGTTGCGCG CTGGTGGTCT CCTCCAGCAC TTGACTTTCT CTGTCGACTG
AAGTATGAAA TTGACACTAG GATCAATAAA GCGCAAGCAA TAAGCGACCT ATGGTACATC
ACATTCTGTC GAGTTCTTAT CATGATCGCT CGTGTCGCAT TTAATCATCA ATCAATGTCG
TTCCAAGATG AGTCGTTCCG GCAAGGGTAT CTTTTTGTAG GAGAAGATCA GTACATCACT
ATTTTGTCAG AAGTAGCGCA TCTTGTGAGC CGATCCGCGA TGAGCAATCC AAGCGGAACG
GGAACGATCA TGTTAGGTGA TTCACGCATG TTGGTATGTC TCGCCGACGA AGAACGTTTC
GATCTCGTCA TTACTTCTCC ACCATACGTT AATCGGATGT CGTATATCCG TGAACTACGC
CCATATATGT ATTGGCTTGG CTATATCACA GCGGCGAGAG AAGCAGGAGA ACTCGATTGG
CAGACGATTG GCGGAACGTG GGGGGTCGCG ACGAGTCGGC TTGCCGAATG GCGATTAGCG
AGCGACACCT TTCTGCCAAG AGACCTGCAC GACACCATCA AACGGATACG TTCCGCAGAA
CACAAGCACA GCTTCTTAAT GGCTCAGTAC GTTGCAAAAT ACTTCGAGGA TATGTGGATG
CACGTAAAGG CGGTGAAGCG TTGGGTTCAA CCCGGCGGGC ATCTGTACTA CATCATCGGT
AATGCTAAGT TCTACGACGT AGTTGTTCCG GTCGAACGAG TATTGGCCGA TATGATGCTT
GAGTCAGGGT ACGAACAGGT CTCCATCGAA ACCGTGCGAA AGCGTAACTC GAAGAAGGAG
CTTTTTGAAT ACATTGTATC AGCCCGTAAA CCAAAAGAAG ATGATCGGTA A
 
Protein sequence
MRDNTIQQRP EFTFRENHKN GRHGWVRLTP AYSVTLVTDI LNRERGAFRI LDPFAGTGTT 
VLCAAEQGMY GVGIDINPFL VWLGNAKLRR YTLAEIAEFE HTVCRIISAL RSEGPSVPPP
PIHNVARWWS PPALDFLCRL KYEIDTRINK AQAISDLWYI TFCRVLIMIA RVAFNHQSMS
FQDESFRQGY LFVGEDQYIT ILSEVAHLVS RSAMSNPSGT GTIMLGDSRM LVCLADEERF
DLVITSPPYV NRMSYIRELR PYMYWLGYIT AAREAGELDW QTIGGTWGVA TSRLAEWRLA
SDTFLPRDLH DTIKRIRSAE HKHSFLMAQY VAKYFEDMWM HVKAVKRWVQ PGGHLYYIIG
NAKFYDVVVP VERVLADMML ESGYEQVSIE TVRKRNSKKE LFEYIVSARK PKEDDR