Gene Cagg_2385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2385 
Symbol 
ID7268737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2900519 
End bp2902303 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content56% 
IMG OID643567212 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002463695 
Protein GI219849262 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCA CGCGCTGGCG TCACTCACTC GCGCTCACGC TGCTGTTTGG GTTATTGTTA 
CCCATCCTGG CAGCATGTAC CGGCGGTTCG ACACCGAGCG CCGAGCCAAC CAGTGCTCCC
GCCGCACAAC CCACCACAGC CCCTGCCGCA CAACCCACCA CAGCCCCTGC TGAACAACCA
ACAGCGGCGC CGGTTAGCGG TACAGCGCAA CCGGGAGTGC TACGACTGGT GACCGGTTCT
GAACCAGATA ACATCGATCC ACAGAAGGCC AGCTTTGTCA ACGAGATCCA GTTCATTATG
ATGGCCTATC AGGCCCTAAT GACGTTTGAT CTCGATATGA ACCCGATTCC GGGCGCTGCC
GAAAAAGTCG ATGTTTCGGC GGATGGAACC GTCTACACCT TCACTCTGCG CCGCGACGCC
AAGTACAGCG ACGGCACGCC GGTCACTGCC CGTAATTTCG AGTATGCGTG GAAGCGGCTG
GCCGATCCGG AAACGGCAGG TGAATATCAG TCGCTGCCCT GTGGCATTAT CAAAGGCTAC
AGCGAATACA GCGCTGCTGC CTGCCAAGGC TTGTCGATGG AAGAGGTGAT GAAGCTCGAT
CTTGAGAAGC TCCGCAACGA CTTCGGTGTC AAAGCACTCG ACGATTACAC GCTCCAGATC
GAGCTGGTCG CGCCAGCACC CTATTTCCTC AGCATGGCTG CGCTCTGGAT CGGCGTCCCA
GTACGCGAAG AGGATGTGGC AAAGGGTGAG GACTGGTGGT ACTACCCCGA AAACTATATC
GGGAATGGTC CCTTTGTTCT GAAAGAGTGG GAACACGACA GCAAAGCGAT CTGGGAACCG
AATCCCAACT ACGTCGGACC ACTGGGGCCG GTCAAACTTA AGCGAGTCGA GTATTACATG
ATCACCGATG GTCAGGTGGC GTTCCAAGCT TACCAGAACG GTGAGATTGA CATGCTCGGC
GTCGGTGCCG AAGATCTTGC GACCGTCAAG TCCGATCCGG TTCTGAGCCA GCAAACAATT
GACGTTCCCG GCTCCTGCAC CTTCTACTTC GGCTTCAATC TCGCCAAGCC GCCGTTCGAC
AACAAGCTGG TGCGGCAAGC TTTTGCCCAA GCCCTCGACC GAGAAGCTTA CGTGCGTGAT
GTCTTCCAAG GCTTGGGAAC CCCCACCTAC AGCTTTATCC CGCCCGGCTT CCCCGGCTAC
CAGCCTGACT TGAAGCTGTG GGAGTTCAAC CCTGAAAAGG CGAAACAGAC ATTGGCCGAG
GCCGGGTATC CGAACGGTGA AGGGCTACCG GAAATCAAGC TGACCTATTC GGGTACGGCC
CGCAACAATG CCCGGTTTGA GTGGGTGGCG AACCAGTTTA AGCAGAACCT TGGTATCGAT
GTGATCCTTG ATCCAATCGA TCCTACAGCT TTTGCTGCGG CCACTAAGGA TAACCCGCCA
CAAATGTTCT CGCTTGGCTG GTGTGCCGAC TACCCCGATC CACAGAACTG GATCTCGCTC
TTCCAGACCA ACGGTCTGCT CAGCGCTCGC GTCAACTACT CGAACCCGCA ACTCGATGAG
TTGATACGAC AGGCAGACGT TGAGCAAGAT CCGGTCCGTC GTGCCGAACT CTACGCCCAA
GCGCAGAAGA TACTGGTAGA AGATGCACCG GTTGTCTTCA TGCTGAACAA TGGCGGTCCG
GTATTAGTGA AGCCGTATGT TAAGGGGGTA ACACCGGATA CGATCACCCC ACTCGACTAC
TGGCTCGGCT TCTTCAACCT ACCGAACGTT GATGTTCAGC CTTAG
 
Protein sequence
MNFTRWRHSL ALTLLFGLLL PILAACTGGS TPSAEPTSAP AAQPTTAPAA QPTTAPAEQP 
TAAPVSGTAQ PGVLRLVTGS EPDNIDPQKA SFVNEIQFIM MAYQALMTFD LDMNPIPGAA
EKVDVSADGT VYTFTLRRDA KYSDGTPVTA RNFEYAWKRL ADPETAGEYQ SLPCGIIKGY
SEYSAAACQG LSMEEVMKLD LEKLRNDFGV KALDDYTLQI ELVAPAPYFL SMAALWIGVP
VREEDVAKGE DWWYYPENYI GNGPFVLKEW EHDSKAIWEP NPNYVGPLGP VKLKRVEYYM
ITDGQVAFQA YQNGEIDMLG VGAEDLATVK SDPVLSQQTI DVPGSCTFYF GFNLAKPPFD
NKLVRQAFAQ ALDREAYVRD VFQGLGTPTY SFIPPGFPGY QPDLKLWEFN PEKAKQTLAE
AGYPNGEGLP EIKLTYSGTA RNNARFEWVA NQFKQNLGID VILDPIDPTA FAAATKDNPP
QMFSLGWCAD YPDPQNWISL FQTNGLLSAR VNYSNPQLDE LIRQADVEQD PVRRAELYAQ
AQKILVEDAP VVFMLNNGGP VLVKPYVKGV TPDTITPLDY WLGFFNLPNV DVQP