Gene Cagg_3058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3058 
Symbol 
ID7269475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3719554 
End bp3721218 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content56% 
IMG OID643567878 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002464352 
Protein GI219849919 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000104524 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCGCC GCATCCGTTG GCAGATTGCC ATTGCCCTAT TCGGTATTAG CGTTATCGTC 
GGTTTGTTTG GGCGTTTGGC CGCACTGAGC GCCTCGACCG ATAATCCGGC TATCGGTGGT
ACCTACCGCG AGGCAGTGAT CGGTTCGCCC AGCGTCCCGA TCCCGCTTCT CAACGATCCA
ATTACCGATC CGACCGGTCG TGTGTTGATC AGTCTGCTCT TCGACGGTCT CACTCGCATT
GGGATCGATG GATTGATTGA GCCGGCACTC GCGGAAGGGT ATACGGTTGA TGCAACCGGT
GAGATTTATA CCTTTAAACT ACGCTCTGGA TTACGCTGGC ACGATGGGAC GCCGATTACC
ACCGATGATG TTATCTTTAC CGTCCGCACT TTGCAAGAGT TAGAGACGCC CGGCGAGCCG
GCACTTGCCA ATTTTTGGCG CACGACCCTT GTCGAACGGG TAGATAACCG AACGGTACGG
TTTATTCTAT CCGGCCCTTT TGCACCTTTT CCGGCATTGG CACGTATGCC GATCTTGCCG
GCGCATCTCT TGCGTGGTAT TGCGCCTTCC GAATGGCCAA CCGGTGACTT TGCGCGTCGG
CTGATCGGGA GTGGCCCCTT CCGTTTGGCC GAATTGCGGG CTGATGCCGC CATCCTCACT
GCCAATCCAA CCTATTACAA TGGGCGGCCA TTCCTCGATC AGATTGAGTT GCGCTTTATG
GCTACGCCCG AAGCAGCGGT TGCTGCACTC TTGCGGGGTG AAGTGATGGG TTTTGCCGAA
CGATGGGGAA CTAATCTCAA AACTGTTGAT GTACCCGGTG AAGAACGCCG CATCGTTTTA
CCGATAGATG AATATACGAC GCTAACGTTT AATCTGCGCC TACCTTTATT TCAAGAGATT
CCCCTGCGCC GAGCACTGGC CCTCGGCCTC AACCGTGATG CCCTGATTGA GACGGTACTC
AACGGACTAG CCCAACCGAT CGATACACCG CTCTTGCCCG GTACATGGGC CGATGATCCG
ACAATACGTC GGGTACCTGC CGATCCGACG GTTGCAGCCG AACGACTGGC CGAAATTGAC
TTCGAGCCGG GTAGTGATGG AATCCGCCAG CGTGGTGCTC AGCGCCTGAG CTTTAGTTTG
CTGGTTGATC AAAATGAACG ACGGTTGGCA GTTGCTACGG CCATTGCTGA ACAATGGCGT
GCGATTGGGG TAGAGGTAAC GGTTGAATCG GTTGAGAGTA CTACCCTCGT CGAACGATTG
CGCAAGGGCG ATTTCATGGC AGCAATCCAT ACGTGGACGC GAATCGGCCC TGATCCCGAC
GTGTACAGCC TCTGGCATTC TAGCCAGGCC AAGGGTGGTC TGAATTATGC CGGTCTGAAC
GATGGACGGA TCGACACCTT GCTCGAACAA ACCCGATCGG AACCAGAATT GGCTGCACGG
GCAGAACTCT ACCACGAGTT TCAACGCCGA TGGCTGGAGT TGTCGCCGGC AATAACCCTC
TACCAACCGC AGTACCTCAT CGTCAATAGT GCGAATGTGC AAGGACGAGC CTTTGCCAGT
CCTGATTTTG CCAACCAAAC CCTCTTCGGC GCCGAAGATC GGTTCCGCGA TGTGCAGCGT
TGGTTCGTCA ATAGCTTTCG CCGCCTCGAA GGTGATCTGC CGTAG
 
Protein sequence
MARRIRWQIA IALFGISVIV GLFGRLAALS ASTDNPAIGG TYREAVIGSP SVPIPLLNDP 
ITDPTGRVLI SLLFDGLTRI GIDGLIEPAL AEGYTVDATG EIYTFKLRSG LRWHDGTPIT
TDDVIFTVRT LQELETPGEP ALANFWRTTL VERVDNRTVR FILSGPFAPF PALARMPILP
AHLLRGIAPS EWPTGDFARR LIGSGPFRLA ELRADAAILT ANPTYYNGRP FLDQIELRFM
ATPEAAVAAL LRGEVMGFAE RWGTNLKTVD VPGEERRIVL PIDEYTTLTF NLRLPLFQEI
PLRRALALGL NRDALIETVL NGLAQPIDTP LLPGTWADDP TIRRVPADPT VAAERLAEID
FEPGSDGIRQ RGAQRLSFSL LVDQNERRLA VATAIAEQWR AIGVEVTVES VESTTLVERL
RKGDFMAAIH TWTRIGPDPD VYSLWHSSQA KGGLNYAGLN DGRIDTLLEQ TRSEPELAAR
AELYHEFQRR WLELSPAITL YQPQYLIVNS ANVQGRAFAS PDFANQTLFG AEDRFRDVQR
WFVNSFRRLE GDLP