Gene Cagg_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1201 
Symbol 
ID7267950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1480010 
End bp1481857 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content52% 
IMG OID643566044 
Productamino acid permease-associated region 
Protein accessionYP_002462546 
Protein GI219848113 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0215506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCC AGCTCAAGCG GATTCTCGTC GGTCGTCCAA TTGCTACCGA ACACCAGCAT 
CAAGAACGAC TAAACAAGGT GACGGCGTTA GCCGTCTTTT CTTCTGATGC GCTCTCGTCA
GTCTCGTATG CGACCGAAGC CATTCTCACA ATTCTCGTCT TGGGTGGCAG TGTAGCACTT
GGTCTTTCAC TACCGATTGC CATCGCTATT GCGATATTGC TGCTGATTGT GGGCTTCTCA
TATCGCCAAA CCATTCACGC CTATCCACAA GGTGGTGGCA GCTATATTGT GACCCGTGAT
AATTTGGGCG ATTTGCCGGG ATTAATCGCA GCAAGCGCAT TACTCATTGA TTATGTGCTG
ACGGTAGCGG TGAGTATTTC TGCCGGAGTG GCTGCAATTA CCTCACTCGC TACCAACTGG
GGCTTCCCAA TTGTGCGCGA TTACGCGGTT GAGATTGCGC TGTTATGTAT TCTCTTAGTC
ACGATTGCCA ATCTGCGGGG GGCTAAAGAG AGCGGATTGA TCTTCTCGGT GCCTACCTAT
GCGTTTATTG CAAGTATCCT TTCGATGATT GTGGTAGGTG TCGGCCAAGA TATGCTGTTT
GGTGCCGAAC CGGTGCGGCA TAGTATTGAT CCAGATATTC CACCGGTCGG TGAGACGTTA
TCGCTTTGGT TGATTTTACG TGCTTTTGCC GCCGGGTGTA CCGCATTAAC CGGGATCGAG
GCCATCAGCG ATGGTGTGCA GGCCTTTAAG CCACCGGAGG CACGTAATGC CGCAATGACG
CTGACATGGA TGATCAGCTT GTTGGTGACG ATGTTTCTCG GGATCACATG GTTGGCCTAT
GTACATCAAG CTGTCCCCAA CGAGTTTACC CACGAAACGA TTGTTTCCCA AATTGCTCGT
ACCATTTTTG GAATAGGGCC GGTCTATGGG TTTATTCAAA TTGCTACGGC CTTGATCTTG
GTGTTAGCTG CCAATACCGC TTTTGCCGAC TTTCCCCGTT TAGCGTCGTT TCTGGCGCGT
GATCGTTTCC TGCCTCGCCA GTTCGCCTCG CGCGGTGATC GGTTGGTCTT TTCAAATGGT
ATTCTGGTGT TGGGGTTGTT TTCGGCATTG CTTGTGGTTA TCTTTCAGGC CAACGAGATA
GCGATGCTTC CGCTGTACGC GGTTGGTGTC TTTACCTCGT TTACGTTTTC GCAGTCGGGT
ATGGTGCGGC GTCACCTACG GTTGCGGCAA CCGGGGTGGG CACGGAGTGC TATCATTAAT
GGGTTTGGGG CGACCCTTAC CGCGATTGTG TTGGTGATCT TGATGATTAC CAAGTTTGTC
CACGGTGCGT GGATGGTAAT CCTCACGATC CCGGTTTTGG TGATGATGTT TCGCGGGATT
AATCTCCATT ACCGACGGGT GGCTGAACAG CTTTCATTGA GTGGCGCAGT GGTGCCACCT
GAACTGCGAC GCCACACGGC GATTGTGTTG ATTAGCGGTA TTCATCGGGG GGTGTTGCCG
GCGTTGCAGT ATGCACGCTC GATCGCGCCC GATAATGTTA CAGCCGTCTA CGTTGATCTC
GATCCTGAAG CAACTGAGAA GTTACGTAAG CGCTGGCAAG ATTGGGGATG CGGCATCCCT
CTGGTAGTGC TTGAGTCGCC GTTCCGCTCG TTGATCAATC CCATCGTGCG CTATATCGAA
GAGGTTGAGA CGCGCTACGG CGATGATGTG ATCACGGTTA TTTTGCCTGA ATTTGTTCCG
GCCCGCTGGT GGGAGCATTT ACTGCATAAT CAGACCGGTA TTTTGATCAA GACGGCATTG
CGCCTACGAG GAACAGTGGT GACAAGCGTG CCGTATCGGT TGCGCTAG
 
Protein sequence
MFAQLKRILV GRPIATEHQH QERLNKVTAL AVFSSDALSS VSYATEAILT ILVLGGSVAL 
GLSLPIAIAI AILLLIVGFS YRQTIHAYPQ GGGSYIVTRD NLGDLPGLIA ASALLIDYVL
TVAVSISAGV AAITSLATNW GFPIVRDYAV EIALLCILLV TIANLRGAKE SGLIFSVPTY
AFIASILSMI VVGVGQDMLF GAEPVRHSID PDIPPVGETL SLWLILRAFA AGCTALTGIE
AISDGVQAFK PPEARNAAMT LTWMISLLVT MFLGITWLAY VHQAVPNEFT HETIVSQIAR
TIFGIGPVYG FIQIATALIL VLAANTAFAD FPRLASFLAR DRFLPRQFAS RGDRLVFSNG
ILVLGLFSAL LVVIFQANEI AMLPLYAVGV FTSFTFSQSG MVRRHLRLRQ PGWARSAIIN
GFGATLTAIV LVILMITKFV HGAWMVILTI PVLVMMFRGI NLHYRRVAEQ LSLSGAVVPP
ELRRHTAIVL ISGIHRGVLP ALQYARSIAP DNVTAVYVDL DPEATEKLRK RWQDWGCGIP
LVVLESPFRS LINPIVRYIE EVETRYGDDV ITVILPEFVP ARWWEHLLHN QTGILIKTAL
RLRGTVVTSV PYRLR