Gene Cagg_3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3470 
Symbol 
ID7269695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4228291 
End bp4229484 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content54% 
IMG OID643568278 
Producthypothetical protein 
Protein accessionYP_002464746 
Protein GI219850313 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.250629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG ATTTCTTTAC CCAGCTTCCC TTGATTACTC GCTTTCGCGA GATTACCGAT 
TTGAACGCCT ACCGGCCATT ACCCGCAGAT TGGACGATTT TTGTTAGTGA TGTCCGTGGG
TCGACACGTG CCGTCGCCGA GGGTCGTTAT AAAGAAGTTA ATATGGTTGG TGCAGCGACG
ATTACCGCAG CGTTGAATGT AGCCGACGAG ATAGAAATAC CGTTTGTGTT CGGTGGTGAT
GGGGCGGTTC TGGCAGTGCC GCCACCATTG GCTGAACCGA CGAAACAAGC CTTGAGTGCG
GTAAGTGGGC TGGCCCGTGA GGCTTTCAAC CTTGAGTTGC GGGTTGGCGC CGTACCGGTG
CAGACGATTT TAGATGGCGG GTATCAGGTG TTTGTCGCAC GTTTAGCACT CAATATGCAG
GTGGCACAGG CGGTATTTAG CGGTGGCGGC ATCCGCTACG CTGAACAATT GGTCAAAGAT
GCCGTCACGG GTGCAGACTT CAACATTGCA CCGACTGATC CCGCTGCGGC CAATCTGAGC
GGCCTCGAAT GTCGGTGGGA TACTATCCAA CCTGCGCATG GTACGGCGCT CTGCGTGATC
GTGCAGACAC CACCACAACC TGATCCAGCG ATAACGATGG CGATCTACCG TGACGTGATC
GATGAGATTG AACAAATCTA CGGTGGTGAT CAAGCGTACC ATCCCCTGCA CTACGACTTG
ATGCAGATAA GCACGAGACC GCAGGCTCTT TGGGCAGAGG CGCGGTTGCG TGGCGGCGAG
AGTCGCTTAT CGCAATTGGC CTATTTGATG CAAGTCTACG CGCTCAATCT CGGTGTGTAC
GGGTATCGGT GGCTGCAACA GTTACGAGGT GAGAATCCGT GGTGGGATCA GTATCGCAAA
CATGTGGTCA CCGCTGCCGA TTATCGCAAG TACGATGATG TGTTGCGTAT GATCATTGCC
GGTACCGACG CACAACACGA AGCATTAATC ACCCACCTGA CAGCCCGTTT TGCTGCGGGT
GAGTTGATCT TTGGGGTTCA TCGCTCGCCT GAGGTTATGC TGACGTGTCT GGTGTTCGAG
CGGATGGAGC GGCAAATTCA TTTTGTCGAT GGTGCTGATG GCGGTTTTAC CCTTGCTGCT
CAAGATTTGA AACAGCGTCA GCAGCAGTAC ACGTTCGTTA ACAGCCGGGA ATAA
 
Protein sequence
MTADFFTQLP LITRFREITD LNAYRPLPAD WTIFVSDVRG STRAVAEGRY KEVNMVGAAT 
ITAALNVADE IEIPFVFGGD GAVLAVPPPL AEPTKQALSA VSGLAREAFN LELRVGAVPV
QTILDGGYQV FVARLALNMQ VAQAVFSGGG IRYAEQLVKD AVTGADFNIA PTDPAAANLS
GLECRWDTIQ PAHGTALCVI VQTPPQPDPA ITMAIYRDVI DEIEQIYGGD QAYHPLHYDL
MQISTRPQAL WAEARLRGGE SRLSQLAYLM QVYALNLGVY GYRWLQQLRG ENPWWDQYRK
HVVTAADYRK YDDVLRMIIA GTDAQHEALI THLTARFAAG ELIFGVHRSP EVMLTCLVFE
RMERQIHFVD GADGGFTLAA QDLKQRQQQY TFVNSRE