Gene Cagg_0111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0111 
Symbol 
ID7266849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp155528 
End bp156916 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content56% 
IMG OID643564983 
Productprotein of unknown function DUF407 
Protein accessionYP_002461499 
Protein GI219847066 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.500585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.341052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTTG TTCAAGCAAT CGCCGACTAC CATGCCCTAC TCGATCCGAA ACTGGCGGCT 
GCCTCGTGGC AATGTTTGAC CGAGGAGATG CGGGCAAGAC GTTTGTATTT TGGTGAGCGT
CCGCTGGCTA CCGTCCTCCG TCCGCGTCTC ATGACGATGA CGCAGTATGA GCTGTTACGA
CACGGCACGA AGCAGGTTGC CGAAGCCGCC CGCCTGATCG TCGCAGCGGC GTTGGCCGAT
GATGAAGTCG GGCGGGCTGT GCGTGAGGTG CTGATGCTGA CCCCGCTTGA AGAACGGTTG
ATCGCGATGC ACCCCGGCTA CCTCGAGCCG AGCGCACATT CACGAATGGA TACGTTCTTG
ACCGTTGATG GTTCGTCGTT GCAGTTTGTT GAATATAACG CCGAAAGTCC GGCGGCGATT
GCTTACGAAG ATCTGTTGGC ACAGGCCTTT TTGGCGATGC CGGTGATGCA AGAGTTTATC
AAGCGCTACC CGCTGCTGCC GTTACCGGCA CGTCAATATA TGTTGCGTAC TCTGCTCAAT
TGTTGGCGGA CTGCCGGTAG CCCCGGCCAC GAGCCGCGAG TGGCGATTGT AGATTGGTAT
GGAGTGCCGA CGGCTACCGA GTTTGAGATG TTCCGACAGT ATTTTAGTGA GCACGGCTTG
CCCACCGTTA TCTGTTCCCC GCACGATCTC GTCTTCCGCG ACGGACAATT AATTGCCAAG
ACTGCTGACG GTGGCGAGAT GCCGGTAACG ATTGTCTTCA AGCGTGTGCT GACCAGCGAA
TTCTTGAGTC ATTATGGTGA TGATGCGCTC TCGCATCCGC TGGTGCAGGC TTATGCTGCC
GGTGCGTGTG TCATTGTGAA CTCGTTCCGG GCCAAGCTAC TCCACAAAAA GTCTCTCTTT
GCACTTCTCT CTGATGAACG GTTCCACGAA CCGCTCAGCG CCGAGCAACG GTCAGCAGTA
GCGGCCCACG TCCCATGGAC ACGAGTGGTA CGGCCCGGGC AGACAACGTA TCAGGGAGAA
ACGATTGACT TACTGGCGTT TGCCCGCGCT AACCGCGAGC GATTGGTGTT AAAACCGAAC
GATGAATACG GCGGCAAGGG AATTACTATT GGCTGGGAAG TCAGTGCCGA TGCGTGGGAT
GCTGCTCTCC AAGCTGCGCT CGAGACGCCG TTTGTCGTGC AAGAACGAGT GACGATTGCC
TATGAGCCGT ATCCGGCGTT TGTCGATGGG CAAGTCGTGA TCGCCGATCG ATTGGTTGAT
AGCGATCCCT ATTTGTTTGG CACCACCGTG CATAGCTGCT TGTGTCGCCT CTCTACGGTC
ACGTTACTCA ACGTGACGGC GGGTGGTGGA AGTACCGTGC CGGTGTTTGT GATTGAAGAT
CGACCATAA
 
Protein sequence
MTLVQAIADY HALLDPKLAA ASWQCLTEEM RARRLYFGER PLATVLRPRL MTMTQYELLR 
HGTKQVAEAA RLIVAAALAD DEVGRAVREV LMLTPLEERL IAMHPGYLEP SAHSRMDTFL
TVDGSSLQFV EYNAESPAAI AYEDLLAQAF LAMPVMQEFI KRYPLLPLPA RQYMLRTLLN
CWRTAGSPGH EPRVAIVDWY GVPTATEFEM FRQYFSEHGL PTVICSPHDL VFRDGQLIAK
TADGGEMPVT IVFKRVLTSE FLSHYGDDAL SHPLVQAYAA GACVIVNSFR AKLLHKKSLF
ALLSDERFHE PLSAEQRSAV AAHVPWTRVV RPGQTTYQGE TIDLLAFARA NRERLVLKPN
DEYGGKGITI GWEVSADAWD AALQAALETP FVVQERVTIA YEPYPAFVDG QVVIADRLVD
SDPYLFGTTV HSCLCRLSTV TLLNVTAGGG STVPVFVIED RP