Gene Cagg_0926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0926 
Symbol 
ID7267999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1155037 
End bp1156362 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content56% 
IMG OID643565774 
Productprotein of unknown function DUF21 
Protein accessionYP_002462280 
Protein GI219847847 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.731568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAT TATTAATCAT TGTTGCGCTG GCATTGGCGA ATGGAATGTT TGCCGCCACC 
GAGCTGGCAG TTGTTTCTGC CCGCCGAGGC CGGCTCGAAC AGCGCGCTGA AGAGGGAAGT
CGCGGGGCGG CGGTTGCCCT CCAATTGCAA GAAGATCCCG ATCGCTTTTT AGCCGCAGTG
CAGATCGGTA TTACCCTGAT CGGAACACTG AACGGTGTCT TTGCCGGCGC AACCCTGACC
GGTCAACTCG CACCATGGCT AGCCCGCAAT GAGTGGCTAC GACCGTATGC CGACCAGTTG
GCCCAATTTT TGGTCGTGCT GCTGGTTACG TACCTGTCGC TAGTGTTGGG CGAGTTGGTA
CCGAAGCGCA TCGCTTTGCA AAGCGCCGAG ACTATTGCGA CGCTGATGGC TCGGCCAATG
TTAGGGCTGG CGCGGATCAG TACACCGTTC ATTGCGTTAC TCAGTGCTTC CACTCGTTTG
ATTCTTACCC TGATCGGGCG TGCGAATGTC GAGGAAGAGC GGGTCACCGA AGAAGATATT
CGGGCGCTCG TTCGGGAAGG TGCCGAGACC GGTGAGGTCG AACCGCAAGA GCAGCAATTC
ATCGATCGTG TCTTTAGGTT CAGCGACCGG GCAGTGCGCC ACATTATGAC CCCGCGCCAT
GAAGTTGAGA TGGTAGAAGC CAACCGCACG CTCGGAGAAG TGATCGATGA GTTGTTGGCG
AGTGGCTACT CGCGCTTTCC GGTGTATGAA GAGACACCAG ATCAGATTGT CGGGATTGTC
CATGTGCGTG ATTTGCTCCT ACTCTACCGA AAAAAGGGGG AGCAAGCGTT AGTACGGGAA
GCCGTCTCGC CACCGCTCTA CGTACCGGAA AATAGTCGGG CATCGGCGCT GCTGACCACA
TTTCGTCGCA GCCGTCGCCA TATGGCGTTG GTGGTGGGTG AGCTAGGTGG GATCGAGGGT
GTCGTGACGC TAGAAGATGT ATTGGAAGAG ATCGTGGGCG AGATTGATGA CGAATACGAC
GATGCTACTC CACCACCAAT CGTTCGTCGC GAAGATGGTT CATACCTCGT TGAAGGTTCA
TTACCGGTTG ATGAGGTACG CGCGTTGCTT GAAGTCGATG AGCTACCCGA CGAAGACACA
TTTCGTTACG AGACGTTGGC CGGGCTGGTG ATCAGTCTGA TCGGTCATAT CCCAACTGCC
GGTGATGTCG TGCGGTGGAG CGGATGGCGG TTTGAAGTGG TCGATATGGA CGGGTTGCGC
GTTGATAAAG TGTTGATCGC GCGCGATTCA ACCACGAATC ATCCATCGAC ATCCCCTTCG
CGGTAG
 
Protein sequence
MQELLIIVAL ALANGMFAAT ELAVVSARRG RLEQRAEEGS RGAAVALQLQ EDPDRFLAAV 
QIGITLIGTL NGVFAGATLT GQLAPWLARN EWLRPYADQL AQFLVVLLVT YLSLVLGELV
PKRIALQSAE TIATLMARPM LGLARISTPF IALLSASTRL ILTLIGRANV EEERVTEEDI
RALVREGAET GEVEPQEQQF IDRVFRFSDR AVRHIMTPRH EVEMVEANRT LGEVIDELLA
SGYSRFPVYE ETPDQIVGIV HVRDLLLLYR KKGEQALVRE AVSPPLYVPE NSRASALLTT
FRRSRRHMAL VVGELGGIEG VVTLEDVLEE IVGEIDDEYD DATPPPIVRR EDGSYLVEGS
LPVDEVRALL EVDELPDEDT FRYETLAGLV ISLIGHIPTA GDVVRWSGWR FEVVDMDGLR
VDKVLIARDS TTNHPSTSPS R