Gene Cagg_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0749 
Symbol 
ID7268068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp926213 
End bp927466 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content54% 
IMG OID643565600 
Productprotein of unknown function UPF0118 
Protein accessionYP_002462109 
Protein GI219847676 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.104647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCT TCACCCCAAC TCAAATCCGT CGGGTTGCGC GTTGGTTGCT TGTTGCGGTG 
TCAATTTATC TTGTGGGCTG GCTGATTAGC CATACCGGTT CGGCGATCAC GCCGTTTGTG
TTTGGCGGTG TGCTCGCCTA TTTATTTTTA CCGTTAGTCA ATTTCTTCGA GCGCTGGATG
CCGCGTTGGT TGGCAATTCT GGTCGTGTAT CTGCTGACCT TCGGCGCATT GGTGGCTGCG
ATTGCGTTTG TCATTCCACC GCTGATTGCT CAGATCGTCG AACTGATCCG TACCCTACCT
GATATTGCCA CGATCCAACG CGAGGCTAAT CGGTTGCTCG ATGAGTATGA GCAGTTGCTT
GCCAGCTTGC CACCTGCTAT ACAGTCTGAG GTGCAAAGCG CGATTGCGTC GGCAGCTTCA
GAGGGGTTGA GTACCCTGCG GGCTAATTTC GTTAGCTATT TGCAAGGGAT CGGCCAGTTT
CTGATTACGA GTGTTTTGTC GGTTGTTAAT ACGGTCACCT TCCTGTTGGG TTTCTTTCTG
GTGCCATTTT GGCTCTTCTA CGTGCTGATG GATCAGCGTG CCGGACGCGA TTATCTTAAT
CGCTTGATCC ATCCCCGCTT ACGGGCCGAT TTTTGGGCAA TGGTATCAAT TATCGATTAC
GACCTGAGCG GTTATCTGCG CGGTCAGTTG ATTCTGGGTA CGTCCGTTGG CTTAGCCGCG
TGGATCGGCC TCACGGCACT GAATATGGCG GGGATGAAGG TGCCATATAC GGTACTGTTA
GCGGTTGTGG CCGGTGTTAC CGAGGTGGTA CCGGTGATCG GACCGATTAT TGGTGCCATC
CCGGCAATCT TGTTGGGTCT AGCCGATTCG CCGACGACTG CGCTGGCCGT TACTATTCTC
TACATTGCTA TCCAGCAGCT CGAGAATCAT ATCCTCGTGC CACGCATTAT CGGCGAAAGC
GTGGGAGTCC ATCCGGCGAT TCTCATGGTT GTGCTGGTCG TGTGTTCGCA GGTTTTTGGT
TTGTTGGGAG CGATCCTTTC GGCGCCACTG AGTGCAATGG CCCGCGATCT GTTTCTCTAT
CTCTACGGGC GTTTGAGTGA TCCGCCCCGT CCGGCAGGTG TTCTGCCCGA ACGGTTGCGT
CCGATAGCAG CTCTTACCGA AGTAGTGGCC CAATCGACGA CCGATCAATC GGCACCGTCT
CCGCCCACCT CGGAAGACGT TCCCCGAACG CGCCCACTTG ATGAACCTCG ATGA
 
Protein sequence
MTIFTPTQIR RVARWLLVAV SIYLVGWLIS HTGSAITPFV FGGVLAYLFL PLVNFFERWM 
PRWLAILVVY LLTFGALVAA IAFVIPPLIA QIVELIRTLP DIATIQREAN RLLDEYEQLL
ASLPPAIQSE VQSAIASAAS EGLSTLRANF VSYLQGIGQF LITSVLSVVN TVTFLLGFFL
VPFWLFYVLM DQRAGRDYLN RLIHPRLRAD FWAMVSIIDY DLSGYLRGQL ILGTSVGLAA
WIGLTALNMA GMKVPYTVLL AVVAGVTEVV PVIGPIIGAI PAILLGLADS PTTALAVTIL
YIAIQQLENH ILVPRIIGES VGVHPAILMV VLVVCSQVFG LLGAILSAPL SAMARDLFLY
LYGRLSDPPR PAGVLPERLR PIAALTEVVA QSTTDQSAPS PPTSEDVPRT RPLDEPR