Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0551 |
Symbol | |
ID | 7267048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 670481 |
End bp | 672517 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643565414 |
Product | Oligopeptidase B |
Protein accession | YP_002461926 |
Protein GI | 219847493 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTG AGGCTGTCCG TCCGCCACAT GCCGAACGCA AACCGGTGAT ACTTCGGCTT CACGGCGATG AGATCGTTGA TGAGTTTGCG TGGCTGGAGA ATCGTGATGA TCCGGCAGTC ATTGCCTATC TCGAAGCCGA AAATACCTAT GCCGAAGCAG TGATGGCACC GGCTGCACCG CTGCGTGAAC AGCTCTACGC TGAAATGCGT GGCCGGATCA AAGAGGAGGA TCGGTCGGTG GCGGTGCCGC GGGGACGGTA TCGTTACTAC TCGCGGACTG AGGCCAGTGC TGAGTATCCG GTGATGTGTC GTACCGAGGG CGACGATGGA CTGGAAGAGG TGGTACTTGA TCTGAATACA TTAGCAATCG GGCATGCGTT TTGTCAGTTA GGAGCTTATG AGCCATCACC GAACCAGCAC TTGCTGGCTT ATGGCTTAGA TACCACCGGT TCAATTATCT TTACGCTGTT CATCAAAGAT CTCACGACCG GTGCCTTACT CGACACGCCT ATCGAGCGGG TGAATGATGT GCAGTGGGCC GATGACCGGA CCCTCTTCTA CACCGTCTTC GATGATGCTC ACCGGGCGTA TCGCCTCTAC CGTCATGTAC TGGGGACCTC ACCTGCCGAC GATGCGTTGA TCTATGAAGA GACCGATGAA CGGTTTAGTC TGAGCCTACG CCGTACCCGT TCGGGAGCCT ATCTCCTCCT TACGAGTTAT AGTCACGGTG GGACAGAAGT ACACTATGTT TCAACTGCCA CCCCATTTGC CGACTGGCAG GTGATCTATC CGCGACGGCC CAAGATCGAC TACTTTGTCG ATCATCACGG CGATTACTTC TACATCCGCA CGAATGACGG TGCCGAGAAT TTTCGCCTGA TCCGCGCACC GATCTCCGAT CCGACGGCAA TGATCGAACT CGTGCCGGGT CGGGTTGATG TGCTGATCGA CCATTTCGAT TGTTTCGCCG ACTATTTGGT GGTCTATGAG CGGCGCGATG GATTACGCCA GATTCGGATC AGCACTCCCG ATGGTGATCA GGTGCGTTAC GTTTCGTTCC CCGAACCGGT CTACACGTGT GGACCGCATG AAAACAAAGA GTTCGCAACT GACCGGTTAC GCCTGAGCTA CAGTTCACTC ATCACACCGC CGTCGGTGGT TGAATACAAT ATGCGCACCG GCTCATGGCA GGTGGTGAAG CAGGAGGAGA TTCCGTCTGG CTACGATCCA TCGCGCTACG TTAGCGAACG GCTCACTGCG ACGGCGCCAG ATGGAGCACG GGTGCCTATT TCACTCGTTT ACCGGCGTGA TCGACCGCGT AACGGTGGGC CTTGTCTTTT GGTTGGGTAT GGCTCGTATG GCTACAGTTA TGAGCCATCA TTCGATAGTA AGCGCCTCAG CCTTCTCGAT CGAGGCTTTG TTGTGGCAAT TGCCCATATT CGCGGCGGTC AAGAACTAGG GCGACGGTGG TATGAGCAGG GGCGTATGCT GCATAAGCCC AATACGTTCA GTGACTTTAT TGCCTGCGCC GAACACCTGA TCGCTGCCGG ATACACTTCA CCTCGTCAAT TGGCGATTAG TGGGCGGAGT GCCGGTGGTT TGCTGATGGC TGCCGTCGTT AATGCTCGTC CCGATCTCTT TCAGGCGGTG GTCGCCGGGG TACCGTTTAC CAACGTGATT ATCGCGATGC TCAAACCCGA TCTGCCGCTC ACCGTCACCG AATGGGAACA GTGGGGTAAT CCGGCTATCG AAGCTGAATA TCGGGTGATG CGTTCATACG ATCCCTATCT GAACGTGAAG CCGGGTCCGT ACCCGCACAT TCTGGCGACT GCCGGTCTCC ACGATTTGCA AGTGCCGTAC TGGGATCCGG CCAAATGGGT GGCTAAGCTG CGTACTGTTA AAACTAATGA TACGATGTTA CTGTTGCGCA CCAATATGCA GGCCGGTCAT AGTGGCCATT CTGGGCGCTT TGCCCGCCTC ACCGAGTTTG CGTGGGAGTA TGCTTTTATC TTGACTGCCT TGGGAATTGC GTCGTAG
|
Protein sequence | MSVEAVRPPH AERKPVILRL HGDEIVDEFA WLENRDDPAV IAYLEAENTY AEAVMAPAAP LREQLYAEMR GRIKEEDRSV AVPRGRYRYY SRTEASAEYP VMCRTEGDDG LEEVVLDLNT LAIGHAFCQL GAYEPSPNQH LLAYGLDTTG SIIFTLFIKD LTTGALLDTP IERVNDVQWA DDRTLFYTVF DDAHRAYRLY RHVLGTSPAD DALIYEETDE RFSLSLRRTR SGAYLLLTSY SHGGTEVHYV STATPFADWQ VIYPRRPKID YFVDHHGDYF YIRTNDGAEN FRLIRAPISD PTAMIELVPG RVDVLIDHFD CFADYLVVYE RRDGLRQIRI STPDGDQVRY VSFPEPVYTC GPHENKEFAT DRLRLSYSSL ITPPSVVEYN MRTGSWQVVK QEEIPSGYDP SRYVSERLTA TAPDGARVPI SLVYRRDRPR NGGPCLLVGY GSYGYSYEPS FDSKRLSLLD RGFVVAIAHI RGGQELGRRW YEQGRMLHKP NTFSDFIACA EHLIAAGYTS PRQLAISGRS AGGLLMAAVV NARPDLFQAV VAGVPFTNVI IAMLKPDLPL TVTEWEQWGN PAIEAEYRVM RSYDPYLNVK PGPYPHILAT AGLHDLQVPY WDPAKWVAKL RTVKTNDTML LLRTNMQAGH SGHSGRFARL TEFAWEYAFI LTALGIAS
|
| |