Gene Cagg_2346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2346 
Symbol 
ID7268696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2854467 
End bp2855615 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content51% 
IMG OID643567175 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002463660 
Protein GI219849227 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00216942 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAGATGA TCCGTGTAAT GCTCGTAGCG CTGTTGCTGG CGCTAACAGC CTGTGGCGGT 
GGCAGTACCG CAACTGGGCC GGGTAATGAG TACGGGGCGA GCGGTGGTGA CACCACAACC
ACCACCGATG GTGTGGATCG CAGCAAGCTG GCAAAGGAAC TCTACTTCTA CAATTGGTCA
GATTATATTG ACCCTGCCAT CCTCGATCAA TTCGAGGCTG AGTACGGCGT GAAGGTGATC
ATCGACACCT ACGATAGTAA CGAAGACATG TTGGCAAAGA TCCGTGCCGG TAATTCGGGC
TACGATCTCG TCGTGCCATC GGATTACGCC GTGCAGATCA TGATTGCCGA AGGAATGGCG
CTGCCGATTG ATAAGTCGCT CGTCCCCAAC ATCGTTCATC TTGATCCAAA CTTGCTCGAT
CAGTATTTCG ATAAGGGGAA TGTCTATTCG CTCCCGTACA TGTACGGTCT GACCGGCATT
GCATACAACA CGAAGTTCTT TCCCAATGGA ATTGATAGTT GGGCAGCCAT CCTTGAACCA
GATCAAGTTG CTGCATTTGC CGGTAAATTC AGCATGCTTG ACGATGCGCG GGAGACACCG
GGTGCAGTAC TGCGTTACAT TGGTCAATCA CTCAATTCGA CCGATCCGAC AGCATTAGCG
CGGGTGAAGG AGATTTTACT GGCTCAAAAG CAGTACTTAG CAGCATACAA CAGCTCGGAT
GTCAACCGCA AACTCGCCAG TGAAGAATAT GTACTGGCGC ATGCGTGGAG TGGCACAGCC
ATTCAAGCCC GCAATGGGTT GGGTGAAGAG TTTAGCGGTA ATCCGAATAT TGCCTTTATC
ATACCGAAAG AGGGTGGCAC TATTTGGATG GACAATTTTG TCATCTTGAA AGACTCACCA
CATGCCTACA CTGCGCACGT GTTCATCAAC TACCTCCTGC GTCCGGAGAT CGCGGCCCAG
AATACCGAAT ACATCGGTTA TCTATCGCCA AACAAGGATG CCCTCGAGTT GCTGTCGGAG
GAGTTGCGCA CGCTCTACGC ACAAGGTTTT GCGCCAAATG AAGAGATGTA TAAGCGGCTA
GAGTGGATTG TGCGTAACGA AGGTACGACG GTTTTCGACG ATCTATGGAC AGAGGTTAAA
GGTCAGTAG
 
Protein sequence
MQMIRVMLVA LLLALTACGG GSTATGPGNE YGASGGDTTT TTDGVDRSKL AKELYFYNWS 
DYIDPAILDQ FEAEYGVKVI IDTYDSNEDM LAKIRAGNSG YDLVVPSDYA VQIMIAEGMA
LPIDKSLVPN IVHLDPNLLD QYFDKGNVYS LPYMYGLTGI AYNTKFFPNG IDSWAAILEP
DQVAAFAGKF SMLDDARETP GAVLRYIGQS LNSTDPTALA RVKEILLAQK QYLAAYNSSD
VNRKLASEEY VLAHAWSGTA IQARNGLGEE FSGNPNIAFI IPKEGGTIWM DNFVILKDSP
HAYTAHVFIN YLLRPEIAAQ NTEYIGYLSP NKDALELLSE ELRTLYAQGF APNEEMYKRL
EWIVRNEGTT VFDDLWTEVK GQ