Gene Cagg_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1749 
Symbol 
ID7269455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2137893 
End bp2139509 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content55% 
IMG OID643566591 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002463086 
Protein GI219848653 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.483604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGGAT CAGATGAACG ATTACAGATA TTGCACATGG CCCGATCTGA GGTATTGGCC 
GGACGCATGA GCCGGCGCGC TTTTTTACGG CTTGGGCTGG CACTTGGTCT CTCGGCAGGA
GCAACGGCAT TAGTAGCATG TAGTAACCCT TCACCCACAC CCACACCTGT TCAATCTGCA
CCGACTCCTA CCGGAGCTGG TGGTCGTTTG CGTGTCGCTA CAGAGATTCC CGTACAACTC
GACCCGGCAT TCGCGTCTTC CGATGCCGAA ATCCTGATTT TGAATCATGT GTATGATTAC
CTGGTCGATA TCGATGCCAA CAATGCCATT ACACCTCGCC TTGCCCGCGA GTGGACGGTT
AGCGATGATG GTTTGCGCTA TCGCTTCACC CTTCACGACG GGGTAACATT TCACGACGGC
AGTCGCCTGA CCGCCGCCGA TGTGGTATGG ACGTTTAATC GCCTGCGCGA TCCCGCTCTC
CAATTGCCAA CTGCCGATCT CTACGCCAAT ATTGCCGATA TCGCTGAGGA AAACGAAACT
ACGGTTGCTT TTACCTTGTC AGAACCCAAC CCCTTCTTCC TCTACGATCT ATCGGATAAT
CACGCCCTCA TTCTGCGTGC CGGAACGACA AATGCAGCAA CGTCGTTCAA TGGGACCGGC
CCCTTCAAGG TCGTCGAATA TCGCACCGAG AATCGGATCG ATCTGGTTGC AGCCACACCC
TATTTTCAGG CCGGTAAGCC TCTCGTCTCT GCGCTAGAGA TTATCTTCTT TGCCGATCAA
GCAGCGGCCG TTGATGCGTT GCGTGGCGGG CAGGTCGATC TGGTGCTGCG GATGCCAACG
CCGTTGTTCC AAACGCTGCA ACAAACCACC GGCATTGTCA CCGTACAAAC GCCAACTAAC
GGTTTTGATC TGGTGCGATT ACGATCTGAC CGTGAACCGG GTAATAAACC AGAGGTGATC
CGTGCGCTCA AACTGGCGAC CGACCGGCAA GCTATCTTCC AACAGGTGAA GGCCGGGTTG
GGTGCTGTTG GACGCGATAG CCCGATCGGA CCGCTCTTTA GCGCGTACTA TTCGGAAGAG
ACGCCGTTGC CACCGCGCGA TCCGGCGGCA GCCCGCGCGT TACTCGAAAC AGCCGGTTAT
CGTGATGGCT TGAAGCTCGA TCTCCATGTA CCCGATTCAG GTGACCGGCC TGATCTGGCA
GTGGTGCTGA AGGAGCAATG GGCCGAAGCC GGGATTGATG TTAACGTGAT CGTCGAACCC
GAAAGTGTCT ACTATGGCGA TGATCGGTGG CTCGAAGTCG ATCTCGGGAT TACCGGCTGG
GGTTCACGTC CGGTACCACA ATTTTATCTC GATGTGATGC TGGTCAGCGG AGCAATTTGG
AACGAGAGCC ACTTTAGCGA TGCCGAATTC GATGCACTCG CAGCTCTTGC CCGCACTACG
CTCAACGAGG ACGAACGTGT CCGGGCGTAT CGGGAGATCC AGCGGATTCT GATCGAGCGC
GGGCCGATTA TTATTCCCTA TTTCTTTGCT CAGCTTAGTG CGCATCGCGA AGGTCTGCGC
GGATATGTTG CGAAAGCTTT TCCCGGTCGC ACCGATTTGG CAAGTATTGC TGTTTAG
 
Protein sequence
MHGSDERLQI LHMARSEVLA GRMSRRAFLR LGLALGLSAG ATALVACSNP SPTPTPVQSA 
PTPTGAGGRL RVATEIPVQL DPAFASSDAE ILILNHVYDY LVDIDANNAI TPRLAREWTV
SDDGLRYRFT LHDGVTFHDG SRLTAADVVW TFNRLRDPAL QLPTADLYAN IADIAEENET
TVAFTLSEPN PFFLYDLSDN HALILRAGTT NAATSFNGTG PFKVVEYRTE NRIDLVAATP
YFQAGKPLVS ALEIIFFADQ AAAVDALRGG QVDLVLRMPT PLFQTLQQTT GIVTVQTPTN
GFDLVRLRSD REPGNKPEVI RALKLATDRQ AIFQQVKAGL GAVGRDSPIG PLFSAYYSEE
TPLPPRDPAA ARALLETAGY RDGLKLDLHV PDSGDRPDLA VVLKEQWAEA GIDVNVIVEP
ESVYYGDDRW LEVDLGITGW GSRPVPQFYL DVMLVSGAIW NESHFSDAEF DALAALARTT
LNEDERVRAY REIQRILIER GPIIIPYFFA QLSAHREGLR GYVAKAFPGR TDLASIAV