Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2614 |
Symbol | |
ID | 4897377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2755377 |
End bp | 2758700 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640113214 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001044488 |
Protein GI | 126463374 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.95635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATCA AGGCGAGCAT TCACCACCTG ACCCACTACC GCTACGACCG TCCGGTGACG CTCGGGCCGC AGCTGATCCG CCTGCGCCCC GCGCCCCATT CCCGCACGCG GGTGATCTCG CATGCGCTGA AGGTCTCGCC GGGGGGGCAT TTCGAGAATC ATCAGCAGGA TCCCTACGGC AACTGGCTGC TGCGCGTGGT CTTTCCCGAG CCGGTGACCG AGTTCCGCAT CGAGGTCGAT CTGGTGGCCG ACATGACCGT CTACAATCCG TTCGACTTCT TCGTCGAGGA GACGGCCGAG CACTGGCCCT TCGACTATCC CGAGGAGATC GTCGAGGATC TCTCGATCTA CCGCAGCCCC GAACCTGCGG GGCCGCATCT TCAGGCCTTC CTTCAGACGA TCCCGCGCGA CCGGCAGCGG ACCGTCGACA TGGTGGTCGG GCTGAACGCG CGCCTCGCCC GCGAGATCGC CTATGTGATC CGAATGGAGC CCGGCGTCTT CAGCCCCGAG GAGACGCTGG CCCAGAGGCG CGGCTCGTGC CGCGACAGCG CCTGGCTTCA GGTGCAGATC CTGCGCCACC TGGGCCTCGC CGCCCGCTTC GTCTCGGGCT ACCTGATCCA GCTCAAGCCC GATCTCGAGG CGCTGGACGG GCCGTCGGGG ACCGATCACG ATTTCACCGA CCTCCATGCC TGGGCCGAGG TCTATCTGCC GGGAGCGGGC TGGATCGGGC TCGACGCCAC CTCGGGGCTT CTGACGGGCG AGAGCCACAT TCCGCTGGCC GCGACCCCGC ATTACCGCAA CGCCGCCCCC ATCGCCGGCA TGGCGAGCTA TGCCGAGGTG GATTTCGCCT TCGACATGAA GGTGACCCGC GTGGCCGAGC ATCCGCGGAT CACGAAACCC TTTTCCGACG AGAGCTGGCA GCGGCTCGAT GCGCTGGGCC ACCGGGTCGA TGCGGCGCTG AGGGCGGGCG ACGTGCGGCT CACCATGGGG GGCGAGCCCA CCTTCGTCTC GATCGACGAT TTCGAATCGG GCGAGTGGAA CACCGATGCC GTCGGCCCCA CCAAGCGCGT CCTCGCCGAC CGGCTGATCC GGCGGCTCCG CGACCGGTTC GCGCCGGGGG GCTTCCTGCA TTACGGGCAG GGGAAATGGT ATCCGGGCGA GACCCTGCCG CGCTGGACCT TCTCGCTCTA CTGGCGCGAG GACGGGGCGC CCATCTGGCG CGACGGCGCG CTGGTGGCGG GCGAGACGGG GCAGGGCGGC GTGGGGCCGG CCGAGGCGGA GCGGCTGATG CAGGGCATCG CGGCCGAGCT CGGGCTCGAG CCCGACCTCG TGGTGCCCGC CTACGAGGAT CCGGGCGAGT GGCTTCTGAA GGAGGCGAAC CTTCCCGAAA ATGTGACGCC CGAGAATTCG GAGCTGAAGG ACCCCGAAGA GCGGCTGCGC ATGGCCCGCG TCTTCGAGCG CGGTTTGACC GAGCCCTCGG GCTTCGTCCT GCCGGTGCAG CGCTGGCAGG CGCAGGCGCC GCGCCCGCGC TGGCGGTCCG AACGGTGGCG GCTGCGGCGC AGGCACCTGT TCCTCGTGCC CGGCGACAGC CCGGTGGGCT ACCGCCTGCC GCTGGGCGCG CTGCCCCACA TCCCGGCCTC GCGCTACCCC TACATCAACC CGACCGATCC CACGGTCGAG CGCGGGCCGC TGCCGCCCGC GGGCGAGGCG CAGATCGTGC CGCTCCAGAC CCCCGAGGCC GCGGTGGCGA GCTTCACCGC CTCGGCGCCG GGCCAGACGA TGGTCGAGCA GATCCTCGGC GACGAGGGCG CCGTGCGCAC CGCGCTCGCC GTCGAGGTGC GGGACGGGCG GCTCTGCATC TTCATGCCGC CGGTGGAGGC GGTCGAGGAT TACCTCGACC TCCTGACCGC CGCCGAGGAG GCCGCGCGCA AGCTGGGCCT GCCGGTCCAT GTCGAAGGCT ATGCGCCGCC GCACGATCCG CGGCTGAACG TGATCCGCGT GGCGCCCGAT CCGGGCGTGA TCGAGGTGAA CATCCATCCC GCCACCAGCT GGGAGGAGTG CGTGTCGATC ACGACGGCGG TCTACGAGGA AGCCCGCCAG TGCCGCCTCG GCGCCGACAA GTTCATGATC GACGGCAAGC ATTGCGGCAC CGGCGGCGGC AACCATGTCG TCGTGGGCGG GCGCACGCCC ATGGACTCGC CCTTCCTGCG GCGGCCGGAC CTGCTGCGCA GCCTGATCCT GCACTGGAAC CGGCATCCGT CGCTCTCCTA CCTCTTCTCG GGCCTCTTCA TCGGCCCGAC CAGTCAGGCG CCGCGCATCG ACGAGGCGCG CCACGACAGC CTCTTTGAGC TGGAAATCGC GCTGTCGCAG ATCCCGGAGC CCGGCGATCC GCGCGCGGCC CTCTGGCTGC CCGACCGGCT TCTGCGCAAC ATCCTGACCG ACGTGACCGG CAACACCCAC CGCGCCGAGA TCTGCATCGA CAAGATGTTC TCGCCCGACG GGCCCACCGG GCGGCTCGGC CTCGTCGAGT TCCGCGGCTT CGAGATGCCG CCCGACCCGC GCATGAGCCT CGCCCAGCAG CTTCTGATCC GCGCCCTCAT CGCGCGCATG TGGCAGAACC CGGTGACGGG GCCGCTCACC CGCTGGGGCA CGGCGCTGCA CGACCGTTTC ATGCTCCAGC ATTACGTCTG GGAGGATTTC CTGGACGTGC TGGCCGATCT GCGGGCACAC GGGTTCGATC TCGACCCGGA ATGGTTCCGG GCGCAGGCCG AGTTCCGCTT CCCCTTCTGC GGCGAGGTGA CCTACGAGGG CGCGCATCTC GAGATCCGGC AGGCGCTCGA GCCATGGCAT GTGCTGGGCG AGACGGGCGC CATCGGGGGG ACGGTGCGCT ACACCGACAG TTCGACCGAG CGGCTGCAGG TGACGCTCTC GGGCGCCGAT CCCGCGCGCT ACCGCGTGGC CTGCAACGGG CGCGAGGTGC CGCTCGTGCC GGTGGCCAAT GGCTGCGCCG TGGCGGGGGT GCGGTTCAAG GCCTGGCAGC CCGCCGCGGC GCTGCATCCG ACCCTGCCCG TCGATGCGCC GCTCACCTTC GACATCTACG ACACTTGGTC GGGCCGGTCG CTCGGCGGCT GCGTCTATCA TGTGGCCCAT CCCGGCGGGC GCAACTACGA GACCTTCCCG GTGAACGGCA ACGAGGCCGA GGCGCGCAGG CTTGCGCGCT TCCAGCCCCA CGGGCACAGT GCCGGCCTCT GGCCGCTCGC GCCCGAGCGG CCGCACCCGG AGTTTCCGAT GACGCTCGAC CTGAGACGGC CCGCGGGGCT CTGA
|
Protein sequence | MSIKASIHHL THYRYDRPVT LGPQLIRLRP APHSRTRVIS HALKVSPGGH FENHQQDPYG NWLLRVVFPE PVTEFRIEVD LVADMTVYNP FDFFVEETAE HWPFDYPEEI VEDLSIYRSP EPAGPHLQAF LQTIPRDRQR TVDMVVGLNA RLAREIAYVI RMEPGVFSPE ETLAQRRGSC RDSAWLQVQI LRHLGLAARF VSGYLIQLKP DLEALDGPSG TDHDFTDLHA WAEVYLPGAG WIGLDATSGL LTGESHIPLA ATPHYRNAAP IAGMASYAEV DFAFDMKVTR VAEHPRITKP FSDESWQRLD ALGHRVDAAL RAGDVRLTMG GEPTFVSIDD FESGEWNTDA VGPTKRVLAD RLIRRLRDRF APGGFLHYGQ GKWYPGETLP RWTFSLYWRE DGAPIWRDGA LVAGETGQGG VGPAEAERLM QGIAAELGLE PDLVVPAYED PGEWLLKEAN LPENVTPENS ELKDPEERLR MARVFERGLT EPSGFVLPVQ RWQAQAPRPR WRSERWRLRR RHLFLVPGDS PVGYRLPLGA LPHIPASRYP YINPTDPTVE RGPLPPAGEA QIVPLQTPEA AVASFTASAP GQTMVEQILG DEGAVRTALA VEVRDGRLCI FMPPVEAVED YLDLLTAAEE AARKLGLPVH VEGYAPPHDP RLNVIRVAPD PGVIEVNIHP ATSWEECVSI TTAVYEEARQ CRLGADKFMI DGKHCGTGGG NHVVVGGRTP MDSPFLRRPD LLRSLILHWN RHPSLSYLFS GLFIGPTSQA PRIDEARHDS LFELEIALSQ IPEPGDPRAA LWLPDRLLRN ILTDVTGNTH RAEICIDKMF SPDGPTGRLG LVEFRGFEMP PDPRMSLAQQ LLIRALIARM WQNPVTGPLT RWGTALHDRF MLQHYVWEDF LDVLADLRAH GFDLDPEWFR AQAEFRFPFC GEVTYEGAHL EIRQALEPWH VLGETGAIGG TVRYTDSSTE RLQVTLSGAD PARYRVACNG REVPLVPVAN GCAVAGVRFK AWQPAAALHP TLPVDAPLTF DIYDTWSGRS LGGCVYHVAH PGGRNYETFP VNGNEAEARR LARFQPHGHS AGLWPLAPER PHPEFPMTLD LRRPAGL
|
| |