Gene Cagg_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1704 
Symbol 
ID7269410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2080940 
End bp2082004 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID643566546 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002463041 
Protein GI219848608 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCGTG CTACCTTCAT TATCTTACCC ATCATACTCG CCGCATGTAG TGGTGGATCA 
TTCGTCTCGC CACCGGTTAC CCCAACTGCC ACCAAGCTGG TGATCGCCGG CTGGGCCGGA
TATGTGCCGC AAACGATCCT CGATGCGTTT AGCGCCGAAA CAGGTATTGC TGTTGAGTAT
GTGATCTACG AAGAACAATC CGCAGCTATT GCCCAACTTC GCGCCGGGGC CGATTACGAT
CTCGTCGTTA TGGGAAGCTC TTTCGTACCG CGGTTGATCG GCGATGGCTT GCTCGCACCG
CTCGATTACG GCCAAATTCC AAACCATCGC AATATTAGCA TCAATTTTCA CGATCTGAGT
TATGATCCGG GTAACCGCTA TTCCGTCGTG TACCAGTGGG GTGTCGGTGG GTTGATCGTT
CGCCCTGACC TGCTCGACCG GCCGATCACA CGTTGGGCCG ATCTGTGGGA TCCAGCGCTA
GCCGGTAAAA TTGCGATGTG GGTGACGGAA GAAGACCTGT TTGCTATTAC CCTGAAAGCG
ATGGGTCAGC CAGTGAACAC AACCGATCGC AGTGTTCTTG CGGAGGCAGC CGAGCGCATC
AGCACGCTTT TACCTAACAT CGTTGCGCTC GATCCGATAA TGCCGAACGC TGCCGATCTC
CTTGCCAACG GCACCTATCC AATCGTATAT GGATGGTCGT TTGACGCGAT AGCCGGTCAT
GCTTTGAATC CGGCGGTAGC GTTCGTTTTT CCTGAAGAAG GGCCGATTTT CTGGATCGAC
ACCTTGATCG TGCCTAAGGC TAGTACACGC CAAGCAGCCG CCTTTCAGTT TATCAATTTT
GTCTTGCGTC CAGAGATAAG TGCGCAAATT ACCAATGAAA TCTACGTCGC AACAGCCAAT
GAACGAGCAA TGTCGTTGAT CGATCCAGCC TTGCGAGACC ATCCGTGGAT CTTTCCTGGG
CGTATAATGT TGAAAGCGGA GTATTTGAGT GAACCGCCGG TGGACATCAA GGCATACCGC
CACCAGCTTT GGGAACAAAT CGCAACCACA CAACGTGTGA GGTGA
 
Protein sequence
MYRATFIILP IILAACSGGS FVSPPVTPTA TKLVIAGWAG YVPQTILDAF SAETGIAVEY 
VIYEEQSAAI AQLRAGADYD LVVMGSSFVP RLIGDGLLAP LDYGQIPNHR NISINFHDLS
YDPGNRYSVV YQWGVGGLIV RPDLLDRPIT RWADLWDPAL AGKIAMWVTE EDLFAITLKA
MGQPVNTTDR SVLAEAAERI STLLPNIVAL DPIMPNAADL LANGTYPIVY GWSFDAIAGH
ALNPAVAFVF PEEGPIFWID TLIVPKASTR QAAAFQFINF VLRPEISAQI TNEIYVATAN
ERAMSLIDPA LRDHPWIFPG RIMLKAEYLS EPPVDIKAYR HQLWEQIATT QRVR