Gene Cagg_0338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0338 
Symbol 
ID7268439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp420647 
End bp421816 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content59% 
IMG OID643565206 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002461720 
Protein GI219847287 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01096] lysine-arginine-ornithine-binding periplasmic protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000810551 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.021176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAC AGATCGTCTT CGTGCTCGCA GGGTTACTCA GTTTCATCTT AGCCGCGTGT 
GGCGGTGGTG GTCAGCAGGC CCCCACCACG GCACCCGCCC AACCGGCTAC AACCGCCCCT
GAGCAGCCAA CTACCGCGCC GGCTGCTCCG GCACCGACAG CAGCTCCGGC TGCAACGTCG
GCGCCGGCCG GTGGTACCAC CAACCAGCCG GCAAGTGGTG GTCTGATCCA GCAGATTTTG
GCCCGTGGCC GGTTGATCTG CGGTGTCAAC AACAACCCGC TACCCGGTTT TGCCTCGGTT
GATGCGTCCG GCGTCTATTC CGGCTTTGAC ATCGACTTCT GTCGGGCAGT GGCGGCTGCG
CTGTTTGATG ACCCCACGAA AGTTGACTTC CGTCCGCTCA GCGCTCAAGA GCGATTCACC
GCTCTGCAAA CCGGTGAAAT CGATGTGCTG ATCCGCAATA GCACGTGGAC GCTGGGCCGC
GATGGTAACC TGGGTCTCGA TTGGGCACCG ACGACTTTCT ACGACGGCCA AGGCATGATG
GTACGCAAGG ACAGCGGAAT CAACACGCTG GAAGATATGG ACGGCGCAAC CATCTGTGTG
CAGACCGGTA CCACGACCGA GTTGAACCTG GCCGACCAAT TCCGCGCCCG TGGTCTGACC
TTCACGCCGG TCGTCTTCCC CGACGGTGAC TCGACACGCG CCGCCTACGA CGCCGGTCAG
TGTGATGGCT TCACCACCGA CAAATCGGGG TTGATCTCGA GCTTAACCCT GCTCTCCAAC
CCGGCCGACC ACAAGATTCT CGAAGTCACG ATGTCGAAAG AGCCACTTGG GCCGGCAGTT
AAGCAGGGTG ATCCGCAATG GTTTGATGCA GTACGCTGGA TTGTCTTCGC TACCTTCCAA
GCCGAAGAGT ACGGAATTAC GTCGCAGAAC GTGAACGATT TCTTGAACAG TGACGTTCCT
GAGATCCGCC GCTTCCTCGG CATCGAAGGT GATCTGGCGG CCGGTATCGG GTTGCCCAAT
GATTTCGCAG TACGCATCAT CCGTCACGTC GGTAACTATG CGGAGATTTA TAACCGCAAC
CTTGGCCCCG ATACCCCCTT CAACTTGCCT CGTGGTTTGA ATGCGCTGTA TACGGACGGT
GGTCTGCTCT ACTCACCACC CTTCCGCTAA
 
Protein sequence
MRKQIVFVLA GLLSFILAAC GGGGQQAPTT APAQPATTAP EQPTTAPAAP APTAAPAATS 
APAGGTTNQP ASGGLIQQIL ARGRLICGVN NNPLPGFASV DASGVYSGFD IDFCRAVAAA
LFDDPTKVDF RPLSAQERFT ALQTGEIDVL IRNSTWTLGR DGNLGLDWAP TTFYDGQGMM
VRKDSGINTL EDMDGATICV QTGTTTELNL ADQFRARGLT FTPVVFPDGD STRAAYDAGQ
CDGFTTDKSG LISSLTLLSN PADHKILEVT MSKEPLGPAV KQGDPQWFDA VRWIVFATFQ
AEEYGITSQN VNDFLNSDVP EIRRFLGIEG DLAAGIGLPN DFAVRIIRHV GNYAEIYNRN
LGPDTPFNLP RGLNALYTDG GLLYSPPFR