Gene Cagg_3705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3705 
Symbol 
ID7268241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4502120 
End bp4503922 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content56% 
IMG OID643568512 
Productprotein of unknown function DUF88 
Protein accessionYP_002464977 
Protein GI219850544 
COG category[S] Function unknown 
COG ID[COG1432] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00288] conserved hypothetical protein TIGR00288 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000821311 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000382798 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACGACC AAAAGCGACC TGATGTCGCG GTCTTCATCG ACTTCGAGAA CATCTATGTG 
AGCGTGCGCG ATAAGCTCAA CGCCACACCG AACTTCGAAG CGATTATGGA CCGCTGCAAC
GATCTCGGCC GCGTTGTGAT TTCACGCGCC TACGCCGATT GGTATCGTTA CCCACGTATT
ACGAGCGCAC TGTACGCCAA CGCTATCGAA CCGATTTACG TAGCAACGTA CTACTACGAC
AAAGACGCCG GACGCACCGG ACGTGCTATC AAGAATAGCG TCGATATGAA CCTATGTATC
GACGCTATGA AGACCCTCTA CACCAACCCG AATATTTCAC GCTTTGTCCT TGTTACCGGC
GACCGTGACT TTATTCCGCT CGTCCACAGC ATTCGCCAAC ACGGCAAAGA AGTGTATATC
ATCGGGATCG GTGGAGCGGC TAGTACGCAC CTTGCACAAA GCGCCGACGA ATTTGTTTTC
TACGAGCAAC TGATCGGGCG TCAGCCGAAT GCGAGCGCCG CCGCCACGGC GATTGCTAAT
CGTGCTACTG AGCTAAACCG ACTACCCGAA CCGATAGAAG AGCCGATTGC GCCGGTTGCT
CCCCCACCAC CCCCTCCCCC AGAACCAGAC ATCTACGATG TGCTGGTGCA GGCGATTCAT
CTTGCCCGTA AGCGCGGATA CGTCACGACC CTTGGCTCGC TCAAGATGCT GATGAAAGAG
CTTATGGGGG GCGATTTCAA AGAAAGCCGC TACCGCGATC TGAATGGACG CCCATTCACG
AAGTTTAAGG ATCTGGTGCT CGATGCTGAG CAACGTGGTA AAGTGCAAAT CTTTACCAAA
GGTTCGGTCA ACGAGGTCTT CTTGCCGGGC GAGGATCCGA TGAAGCTCTC GCGGTTTGCG
CCGCTACTGA CCGAAGAACC ACCACCTGAG CCGTTGGTGC TTGATCCGCC AATTGGTAGC
AACGGTGCGG TGCAGCTTGC AGAGATTGAA CCGGTGATTA TTGAAGAGGT GAAAACAACG
GTTGCTGAGA TACCCGCGGC AACCCCATCA AGCAGTAATC GCCGTCGGCG CCGCCGTTCA
CGACGCAATA ATCGCAACCG CGAAGAGGTC ACCGCGAATG GCCTTGGAAC GGAGCCTGCC
ATCACCGACG ATCAGGAAGA TCGCGCGATT GATGAACTCG ATTACACCCC ACCACAGTTA
GAAGAACCGC CAGTGGCAAC GGTGGCAATA GCTAGCAGTG ACGAAGGCGC CGTGATAGAG
ACGGTCGTGC CGTCTGAGCA TCTGCCGCCC TCGCTTGCCG ATGCTGCCGA ACCGGTAGCG
GTCGTTGTCG AGAGCGAGCC AACGCCTAAG ACCGGCCGTA GCCGTCGCCG CCGTTCACGT
AAGGCGACAG CAGAATCAAC ACCGACCGGT GAATCGTCGC TGCCTGAGAC AACCGTTGCC
GAACCATCGG TTGCCGAGCC GGTCGTGGCT GCTACCCCAC CCGCCGAGCC GGTCGTGGCC
GCTACCCCAC CCGCCGAGCC AACACCATCG ACCACGACCA ATGGTAGCAG TGATCACGTC
GATCTGACGC TGGAATTCAG CACAGAAGAG TGGCAATTAT TTCGCACTAC CATCCGCGAA
CTCGGTAAAC CGGCTACGTT TCAGCAACTG CTGAGTGCGT TACAAACAGC ACGTCGCAAG
CATGGCTTGC CGCGGACAAT GGAGGAGAAC CGCACAATGC TCAAGCAAGC CATTCATCAC
GGTCTGCTTG AGCGCACCAC TCGGAACCGC TACGTCTACT ACACGTTGAA AGAACTTGAG
TAG
 
Protein sequence
MYDQKRPDVA VFIDFENIYV SVRDKLNATP NFEAIMDRCN DLGRVVISRA YADWYRYPRI 
TSALYANAIE PIYVATYYYD KDAGRTGRAI KNSVDMNLCI DAMKTLYTNP NISRFVLVTG
DRDFIPLVHS IRQHGKEVYI IGIGGAASTH LAQSADEFVF YEQLIGRQPN ASAAATAIAN
RATELNRLPE PIEEPIAPVA PPPPPPPEPD IYDVLVQAIH LARKRGYVTT LGSLKMLMKE
LMGGDFKESR YRDLNGRPFT KFKDLVLDAE QRGKVQIFTK GSVNEVFLPG EDPMKLSRFA
PLLTEEPPPE PLVLDPPIGS NGAVQLAEIE PVIIEEVKTT VAEIPAATPS SSNRRRRRRS
RRNNRNREEV TANGLGTEPA ITDDQEDRAI DELDYTPPQL EEPPVATVAI ASSDEGAVIE
TVVPSEHLPP SLADAAEPVA VVVESEPTPK TGRSRRRRSR KATAESTPTG ESSLPETTVA
EPSVAEPVVA ATPPAEPVVA ATPPAEPTPS TTTNGSSDHV DLTLEFSTEE WQLFRTTIRE
LGKPATFQQL LSALQTARRK HGLPRTMEEN RTMLKQAIHH GLLERTTRNR YVYYTLKELE