Gene Cagg_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1703 
Symbol 
ID7269409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2079740 
End bp2080936 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content50% 
IMG OID643566545 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002463040 
Protein GI219848607 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACTG CCGAAGAACA TCTTTGCAAG GCCACCATAC TTCTTCCAGC TTCCAAGGTT 
GCCTACGGAA ATATCCTGGT ACCATGGATA TGGTACGCTT TGTTATGTCT AAATAACGGT
GGTAAAAACA AGGTAAAAAT GCTCAGCCAA CTCTTTGTAA CCATCTGTCT CATCTGCCTA
ACAGCGTGTA CTTTCATACC GGCGAGTAAT CCTTCGCCTC AGCCACCGAC ACTCACCATC
ACCGGCTGGG CCGGTTATAT GCCGCCCGCA CTACTCGACG CTTTTAAAAA CGAAACCGGT
ATTGAAGTTA CATACATTGG CTATGACAAT ACTGAAGAAG CAATTTCTCA GTTGCAACAA
GGGGCACAGT ACGATTTATT GATAGTAAGC TATCAGTTCA TCCCCGAATT GATACACAAC
GGCACTCTTG CGCCGATTAA TCGAGCAAAT ATTCCCAATA TTCGCAACCT CAGTGCAACC
TTTCGCGATT TGGCGTTCGA TCCCGGTGAA CGGTATTCGG TGATTTACCA GTGGGGCATT
ACCGGCTTGC TCGTGCGCAC CGATCTGCTC GCCCGCCCAA TCAAGCGTTG GTCTGATCTC
TGGAATCCGA CACTTGACGG TAAAATCTTA CTGTGGCCGA TCCCACGTGA TACGATCAAC
GTGCTGCTCA AATCGCTCGG CTACTCGATT AACACCACAG ACACTGATCA GCTCGCTACC
GCCCTGGCCC GCGCACCCGC ACTGGCGCAA CGAGTTGGTT GGGTAGACAG CGGGGTAGCG
ACGGCCACCC AGTACCTTGT ATCCGGTGAA TACGCCGTCG CCCTCGCTTG GGCGTATGAT
GCACGCGACG CTCAACAACA GGATGAACGT ATCACGTTTA TCATTCCTGA AGATGGTACC
GTCATTTGGC TGGACAGTTT CGTCATTCCT ACGACTAGTA CCCACCCAGA ACTCGCCGAA
CAGTTTATCA ATTTTTTCTT ACGACCAGAC ATGAGCGCAC TCGTGACGAA TGAGTTGGTC
ATAGCTACGG CAAATGAAGC GTCATGGCCG TTGGTCAAGC CGGAGCTACT GGCTAATACC
AGCATCTTCC CAAGCAATGA CATCCTCGAG CGCGCAGAAT TGGAGGTACC ACTTGACCCG
GCAACCCAAA AACTTCATCA TGAAATTTGG CGTGTATTCA CCTTGACGAG ATCGTAA
 
Protein sequence
MPTAEEHLCK ATILLPASKV AYGNILVPWI WYALLCLNNG GKNKVKMLSQ LFVTICLICL 
TACTFIPASN PSPQPPTLTI TGWAGYMPPA LLDAFKNETG IEVTYIGYDN TEEAISQLQQ
GAQYDLLIVS YQFIPELIHN GTLAPINRAN IPNIRNLSAT FRDLAFDPGE RYSVIYQWGI
TGLLVRTDLL ARPIKRWSDL WNPTLDGKIL LWPIPRDTIN VLLKSLGYSI NTTDTDQLAT
ALARAPALAQ RVGWVDSGVA TATQYLVSGE YAVALAWAYD ARDAQQQDER ITFIIPEDGT
VIWLDSFVIP TTSTHPELAE QFINFFLRPD MSALVTNELV IATANEASWP LVKPELLANT
SIFPSNDILE RAELEVPLDP ATQKLHHEIW RVFTLTRS