Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1749 |
Symbol | |
ID | 7269455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2137893 |
End bp | 2139509 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643566591 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002463086 |
Protein GI | 219848653 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.483604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGGAT CAGATGAACG ATTACAGATA TTGCACATGG CCCGATCTGA GGTATTGGCC GGACGCATGA GCCGGCGCGC TTTTTTACGG CTTGGGCTGG CACTTGGTCT CTCGGCAGGA GCAACGGCAT TAGTAGCATG TAGTAACCCT TCACCCACAC CCACACCTGT TCAATCTGCA CCGACTCCTA CCGGAGCTGG TGGTCGTTTG CGTGTCGCTA CAGAGATTCC CGTACAACTC GACCCGGCAT TCGCGTCTTC CGATGCCGAA ATCCTGATTT TGAATCATGT GTATGATTAC CTGGTCGATA TCGATGCCAA CAATGCCATT ACACCTCGCC TTGCCCGCGA GTGGACGGTT AGCGATGATG GTTTGCGCTA TCGCTTCACC CTTCACGACG GGGTAACATT TCACGACGGC AGTCGCCTGA CCGCCGCCGA TGTGGTATGG ACGTTTAATC GCCTGCGCGA TCCCGCTCTC CAATTGCCAA CTGCCGATCT CTACGCCAAT ATTGCCGATA TCGCTGAGGA AAACGAAACT ACGGTTGCTT TTACCTTGTC AGAACCCAAC CCCTTCTTCC TCTACGATCT ATCGGATAAT CACGCCCTCA TTCTGCGTGC CGGAACGACA AATGCAGCAA CGTCGTTCAA TGGGACCGGC CCCTTCAAGG TCGTCGAATA TCGCACCGAG AATCGGATCG ATCTGGTTGC AGCCACACCC TATTTTCAGG CCGGTAAGCC TCTCGTCTCT GCGCTAGAGA TTATCTTCTT TGCCGATCAA GCAGCGGCCG TTGATGCGTT GCGTGGCGGG CAGGTCGATC TGGTGCTGCG GATGCCAACG CCGTTGTTCC AAACGCTGCA ACAAACCACC GGCATTGTCA CCGTACAAAC GCCAACTAAC GGTTTTGATC TGGTGCGATT ACGATCTGAC CGTGAACCGG GTAATAAACC AGAGGTGATC CGTGCGCTCA AACTGGCGAC CGACCGGCAA GCTATCTTCC AACAGGTGAA GGCCGGGTTG GGTGCTGTTG GACGCGATAG CCCGATCGGA CCGCTCTTTA GCGCGTACTA TTCGGAAGAG ACGCCGTTGC CACCGCGCGA TCCGGCGGCA GCCCGCGCGT TACTCGAAAC AGCCGGTTAT CGTGATGGCT TGAAGCTCGA TCTCCATGTA CCCGATTCAG GTGACCGGCC TGATCTGGCA GTGGTGCTGA AGGAGCAATG GGCCGAAGCC GGGATTGATG TTAACGTGAT CGTCGAACCC GAAAGTGTCT ACTATGGCGA TGATCGGTGG CTCGAAGTCG ATCTCGGGAT TACCGGCTGG GGTTCACGTC CGGTACCACA ATTTTATCTC GATGTGATGC TGGTCAGCGG AGCAATTTGG AACGAGAGCC ACTTTAGCGA TGCCGAATTC GATGCACTCG CAGCTCTTGC CCGCACTACG CTCAACGAGG ACGAACGTGT CCGGGCGTAT CGGGAGATCC AGCGGATTCT GATCGAGCGC GGGCCGATTA TTATTCCCTA TTTCTTTGCT CAGCTTAGTG CGCATCGCGA AGGTCTGCGC GGATATGTTG CGAAAGCTTT TCCCGGTCGC ACCGATTTGG CAAGTATTGC TGTTTAG
|
Protein sequence | MHGSDERLQI LHMARSEVLA GRMSRRAFLR LGLALGLSAG ATALVACSNP SPTPTPVQSA PTPTGAGGRL RVATEIPVQL DPAFASSDAE ILILNHVYDY LVDIDANNAI TPRLAREWTV SDDGLRYRFT LHDGVTFHDG SRLTAADVVW TFNRLRDPAL QLPTADLYAN IADIAEENET TVAFTLSEPN PFFLYDLSDN HALILRAGTT NAATSFNGTG PFKVVEYRTE NRIDLVAATP YFQAGKPLVS ALEIIFFADQ AAAVDALRGG QVDLVLRMPT PLFQTLQQTT GIVTVQTPTN GFDLVRLRSD REPGNKPEVI RALKLATDRQ AIFQQVKAGL GAVGRDSPIG PLFSAYYSEE TPLPPRDPAA ARALLETAGY RDGLKLDLHV PDSGDRPDLA VVLKEQWAEA GIDVNVIVEP ESVYYGDDRW LEVDLGITGW GSRPVPQFYL DVMLVSGAIW NESHFSDAEF DALAALARTT LNEDERVRAY REIQRILIER GPIIIPYFFA QLSAHREGLR GYVAKAFPGR TDLASIAV
|
| |