Gene Cagg_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3626 
Symbol 
ID7269770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4405902 
End bp4406852 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content55% 
IMG OID643568433 
Productprotein of unknown function DUF124 
Protein accessionYP_002464899 
Protein GI219850466 
COG category[S] Function unknown 
COG ID[COG2013] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00266] conserved hypothetical protein TIGR00266 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.56616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGTC CGAATTGTGG TGCATCAGTT ACCGCCGGTG CGCGCTTTTG TACCAATTGT 
GGCTTTCGCC TATCGACGCC GGTGCAAAGC GCACCACCGC CGCTCGTCCC GCCATCGGGT
GAAGCCAGCA GTATGGCCGA CGTCTACGAT AATCGTCCGG GTGAACGACT CGATCTCCCC
GAACCGCCGG TTGTCGGTTC GGGCGTGGGA GCTAGCGGTC TCCGCTTTAA GATCATCGGA
ACAACCATGC AGGCGGTGGT GCTTGAGGTA CCACCTGGTC AGACGGTCTT TTCCGAGCGC
GGTGGGATGA GCTGGATGAG CGCCAATGTC CAGATGCAGA CCAATATGGA AGGCGGTCTC
GGTGGCGCGT TTAAGCGCAT GTTCTCCGGC GAGTCGATCT TTATGGTCAA CTTTACACCA
CAAGGCGGAC CAGGAATCAT CGGCTTTTCG GCAGAGTTTC CGGGCAAGAT CGTACCGCTC
AACCTTGCAC CGGGGCAGGT CATGATCTGC CAGAAAGATG CCTTTATGTG CGCCGAGCGT
AGCGTTTCGC TCGACATTCA CTTCCGACGT AGGCTCGGTG CTGGTTTGTT TGGTGGTGAA
GGCTTTATCA TGCAGAAATT GACCGGGCCG GGACTAGCGT TTGTCGAGCT TGATGGAGAG
ATTATCGAAT ACACGCTCGA AGCCAATCAG ATGCTGAAAG TCGATACCGG CCATGTCGCA
ATGTACGAGC CAACGGTGCA ACTCGACATC GAGATGGTGC GTGGGTTTAA GAACATTCTG
TTCGGTGGTG AAGGACTGTT CTTGACAACC CTCCGTGGGC CAGGGCGAGT CTGGTTGCAG
ACGATGCCGG CGATGAATTT AGCGAAGAAG ATCGCCCAAT ACTTGCCAAC ATCGAGTAGT
TCGAGCAGTG GGGGTGGTAT TAACTTGGGA AGCCTATTTA CCAACGATTA G
 
Protein sequence
MNCPNCGASV TAGARFCTNC GFRLSTPVQS APPPLVPPSG EASSMADVYD NRPGERLDLP 
EPPVVGSGVG ASGLRFKIIG TTMQAVVLEV PPGQTVFSER GGMSWMSANV QMQTNMEGGL
GGAFKRMFSG ESIFMVNFTP QGGPGIIGFS AEFPGKIVPL NLAPGQVMIC QKDAFMCAER
SVSLDIHFRR RLGAGLFGGE GFIMQKLTGP GLAFVELDGE IIEYTLEANQ MLKVDTGHVA
MYEPTVQLDI EMVRGFKNIL FGGEGLFLTT LRGPGRVWLQ TMPAMNLAKK IAQYLPTSSS
SSSGGGINLG SLFTND