Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3833 |
Symbol | |
ID | 8335186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4339265 |
End bp | 4341637 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956970 |
Product | transglutaminase domain protein |
Protein accession | YP_003114573 |
Protein GI | 256393009 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0481829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0321175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGC CTGGTACGCG GCTGAGCCGC GCGACGCCGC CCCTGGCCGC CGATCCCCGA GGCGCGCCGG CGCCGTCGCC ATCGAACCGG GGCCCGCTGG CGGCGTTGCT GTGTATCGCA GGCGCAGCGC TCACGGCAGT ACCGTACGGG TCGTTTTTCG CTTCCGGCAA CGCCGTCATA CGGCTGGCGG AGGGCGCGAT CGCCGGCGGT GTCATCGCCT GGCTGTGTGC GGTGTTTCTT CGACGGCCTC TGGCTATCGC CGCCGTGGGC GCCGCGGGAC TCGCGGTCAG CGCCGTGTAT ATCGTGCTCG GCAACACCCT GACACACGGC CTGCCACAGA CGTCCACGCT GCGCGTCGGC AGGCAGGCGC TGGTCGGCGG CTGGGCGGCG ATGCTCTCGG TGGGCGCCCC CGCCGGGACG ACAGCTCGAT TGGTCATGGA TCCGTATGCC GTCACCTACA CCGCAGCTTT CACAGCACTG ATACTCGCCA GGCGTACCCG GGTGGCCCTG GCGCCTGCCG CGGCGCTGAT GGTGGCCGAG CTGGTCGCCC TGTTCTTCGC CGGGGTCCAG CCCACGACAC ACCTCGCACA GTGCGCAGGG CTATTCGTGC TCGTACTCGT CTTGAGCGCA CTGCGCACGC ATGCTTTGCG AACCGGGCCG ATGCGCATCC GCGCAGGAAA TCCGGCTCCC ATCCAGGCCG TCGCCGTCCT CGTCGTGATG GTGATCGCCA CCCTCGGCGC TCGCGCGCTG CCAATGGACG GCAAGCGTTT CGATCCATCC ACGCTGGTGC GGTCGGCGCT GTCCTTGCAT CCGACGGTCA ACCCGCTCGC CGAGGTGCGT GCGCAACTCC AGCGCCCGCA GGCGGAGGAC CTGTTCAGTG TGAGGATATC CGGCGCGAAC GGCGGCGAAC GGCTCTCCTC ACGCGCCACG AACACCGGCG CCGCCGGCGA TGACGGGCTC GGCGGCGGAA TCGACCTGAT CCAGTGCGCC GCACTCGACA GCTTCAGCGG ATCGCAGTGG AGCAGCTCCG CCGCTTACCT GGTCTCCGGT CCCGACCTGG CGCCCGGCCC CGTCCAGCCC GATGCGACAC GTCTGACGGA GCAGATCTCG CTCACCGGTC TCAGCGGCCC GTTCCTGCCG ACGATCGGCC GTCCGTCCAC CATCACCGGG AGCTTCGGCA CCGACGCCGT GGTCGGCTTC GACACCGTCT CGGGAACGCT GGTCACCGAC GCCGCAAAGC TCTCCGGTGT GTCCTACAAG GTCACCTCCG CAGTAAGTCC ATCCAGTTCC GCGCTGGAGA ACGCCCCGGT CGGCACAGGC TCCGCGTACG CGCCGTATCT GGCTCTGCCT TCCGTACCAC CGGACTTGTC GGCGCTGGCG GCGACGATCA CGTCTAATTA TCAGGGCCCT TATGCAAAGG CCGTGGCGAT CGAACGGTAC TTGCGCCAGC TGCCGTACAA CGTGAACGCG CAGCCAGGTG AGTCCTACGC CGCTCTGGAA CGCATGCTCG ACGCCGAGGA TCCGCAATCC GCCGCAGCAT ACGGCGAGCA GCATGTCTCG GCGTTCGCCG TCCTGGCCAG ATCCGCGGGG CTACCGACAC GCATCGCCGT GGGCTACGCC CTCACCGGCG TGAACGCGTC ACAGTACACG GTGACGACGG CTGATGCCTA CGCATGGGAC CAGGTCTACT TCCAAGGCCA CGGCTGGGTC GATTTCGACC CGACCGATCC GAACCGCGGA TTCAGGCTGC CTGATCAGCA ACCCCTGGAG GTGGCCGTGA CAGTGCCGAC GCCTTCGATA CCGCCTCCAC CGACACCGAT CACGAGCGTC CCGTCGGCGA CACCCACCCC CATCGTGCCC CCGGTGGCTC CGCGACAGCA CTCAGCCTTT CCGTGGCGGG CGGTCGCCGG AACCGGGGTC GGCCTCCTGG TCTGCGCGGC AGCAGCCGTC CCGCTCATCC GCGCGTCCAC GCGACGCCGG CGGCGCCGGG CCCGGTTGAG CGGCGGACCC TCGCACCGCG TGGCCGGCGC GTGGCTGGAG ATCTGCGACC GGCTCGGGCG GGCCGGGGTG CCGATCCCGC CCACGCAGAC CGTCGTCGAA GCGGCACGCA CCGCCGAGGC GGCGGCCGCC CGCGAGCCGC TCGGTCGTGG ACGAACGGCT CGGATACGGC GCCGGGCCGC CGAGTCGCTG GCCACCCTCG CGCCGCTGAC CGATCGCGCG GTGTTCGCAC CGCACCCTGT GTCGGAGTCC GACGCACAAG ACGCGCTGAC CGTGGAGCGG GAGTTCCGCA GCGAGTTCAG CCGGATATCC GGCGCTGCCC GGATGATCCG CCGACGTGAC CGGGCTGCGC GGGGCAAGAA GGGACAACGA TGA
|
Protein sequence | MEAPGTRLSR ATPPLAADPR GAPAPSPSNR GPLAALLCIA GAALTAVPYG SFFASGNAVI RLAEGAIAGG VIAWLCAVFL RRPLAIAAVG AAGLAVSAVY IVLGNTLTHG LPQTSTLRVG RQALVGGWAA MLSVGAPAGT TARLVMDPYA VTYTAAFTAL ILARRTRVAL APAAALMVAE LVALFFAGVQ PTTHLAQCAG LFVLVLVLSA LRTHALRTGP MRIRAGNPAP IQAVAVLVVM VIATLGARAL PMDGKRFDPS TLVRSALSLH PTVNPLAEVR AQLQRPQAED LFSVRISGAN GGERLSSRAT NTGAAGDDGL GGGIDLIQCA ALDSFSGSQW SSSAAYLVSG PDLAPGPVQP DATRLTEQIS LTGLSGPFLP TIGRPSTITG SFGTDAVVGF DTVSGTLVTD AAKLSGVSYK VTSAVSPSSS ALENAPVGTG SAYAPYLALP SVPPDLSALA ATITSNYQGP YAKAVAIERY LRQLPYNVNA QPGESYAALE RMLDAEDPQS AAAYGEQHVS AFAVLARSAG LPTRIAVGYA LTGVNASQYT VTTADAYAWD QVYFQGHGWV DFDPTDPNRG FRLPDQQPLE VAVTVPTPSI PPPPTPITSV PSATPTPIVP PVAPRQHSAF PWRAVAGTGV GLLVCAAAAV PLIRASTRRR RRRARLSGGP SHRVAGAWLE ICDRLGRAGV PIPPTQTVVE AARTAEAAAA REPLGRGRTA RIRRRAAESL ATLAPLTDRA VFAPHPVSES DAQDALTVER EFRSEFSRIS GAARMIRRRD RAARGKKGQR
|
| |