Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_2903 |
Symbol | |
ID | 8884102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 3053814 |
End bp | 3055712 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | Microbial collagenase |
Protein accession | YP_003511671 |
Protein GI | 291300393 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.439593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTGA CCCGAACCCG GCGACTGATC GGCGCGGCCA CCGTCGCCGT CGTCCTGACC GCCGGTATCG CCCTGACCGC CTCGCAAGGC GCGGCGGACC CGGCACCCGA CCGACCCCAC ACCCCACCGA CGGCGGCCAG CCCCGCCGCC ACCCACACCG ATCCCGGCCC GGCCGACATG CGCGACCGGC CCCCGAACCC CGCGCCCGAC CCCGGCACCG ACAACCCTTA CGCCGCCGAG GATCCGAAAC GTCCCTCCGA CAAGCACGGC ACCCGGCAGG CCTGCGACCC GGCGGACTTC ACCTCACGCA CCGGCGACGA ACTGGTGGCG TTCATCCGCG AGACCGAGAC CTCCTGCGTC AACACCCTGT TCGGCCTCAC CGGCACCGAC GCCAACGGCG CCTTCCGTGA GGAACAGATG ATCTCGGTGG CCAACGGCAT GCGCGACAAC GCCGCGTCCT ATCCCGGCGA CAACTCCACC GGCACCGCGC CGCTGGTGCT GTACCTGCGC GCCGGATACT ACGTCCAATG GGGTCACCCC GACGACGTCG GCGAATATGG CCCCGAACTG GACCAGGCCG CGAAGTCCGC ACTGGAGACG TTCTTCGCCA GCGAGCGCGC CTTCGACGTC AACGACGCCA ACGGCGAGAT CCTCGCCGAG ACGGTCACCC TCGTCGACAG CGCCGAACAG AATCCCCACT TCCTCGACGT GGTCAAGAAA CTGCTCACCG ACTACGACAC CTCCTGGAAC GAGCACTACT GGATGACCGC CGCGGTCAAC AACGTCTACA TCGTCCTGTT CCGCGGCCAC CAGCTGCCGG AGTTCGTCGA AGCCGTGAAG GCCGACTCCT CGGTCCTGGA CACCGTGCAC GACTTCGCCA TCACCCACAT CGGCCAGTCC CGGGGCGACC AGTGGTACCT GATCCACAAC GCCGGACGTG AACTGGGCCG GTTCCTGCAA CACGACTCAC TGCGCGACAC CGCCCGGCCG CTGGTGAAGG ACCTGCTCGA CAGCAGCGAC ATGACCGGCG ATTCAGCGCC ACTGTGGATG GGTTCGGCGG AGATGGCCGA CTACTACGAC CAGTCCAACT GCGACTACTA CGACGTCTGC GACCTGGCCA ACCGGGTCTC CGAGGCCGTC CTGTCGGTCA AGCACAGCTG CGGCGACACG GTTACGATCC GGGCCCAGCA ACTGGACGCG GGGGAGTTGT CGCAAACCTG CGACAGCCTG TCCGGCCAGG ACGGGTTCTT CCACGACGTC GCGAACGACC CCGGACCAGT GGCCGACGAC AACAACGACT CCCTGGAGGT GGTCGTGTTC GACTCCAGCG TGGACTACCG GGTCTTCGCC GGAGCCGTGT TCGGCATCGA CACCAACAAC GGCGGCATGT ACCTGGAAGG CGACCCCGCC GACCCCGACA ACCAGCCCCG GTTCATCGCC CACGAAGCCG ATTGGCAGAC CGAGTTCGCC ATCTGGAACC TCAACCACGA GTACACCCAC TACCTCGACG GCCGCTTCGA CATGTACGGC GACTTCGCCG CTGGCGTCAG CACCCCGACC ATATGGTGGA TCGAGGGCTT CGCCGAGTAC GTGTCCTACT CGTACCGCGA CGAGCAGTAC GACGCCGCCA TCGCCGAGGC CGCCAAGGGC ACCTACGACC TGGACACCCT GTTCAGCACC ACCTACGACC ACGACCAGAC CCGCGTCTAC CAGTGGGGCT ACCTGGCGGT GCGGTTCATG ATCCAGAACC ACCGCCAGGA CGTGGACACG GTGCTGGGCC ACTACCGCTC CGGCGACTGG GACGCCGCCT ACGCGCACCT CACCGACACC ATCGGCTCTG AGTACAACGC CGAATGGCGC GACTGGCTGA CCGCCTGCGG CGCGGGCGAC TGCGGCTGA
|
Protein sequence | MSLTRTRRLI GAATVAVVLT AGIALTASQG AADPAPDRPH TPPTAASPAA THTDPGPADM RDRPPNPAPD PGTDNPYAAE DPKRPSDKHG TRQACDPADF TSRTGDELVA FIRETETSCV NTLFGLTGTD ANGAFREEQM ISVANGMRDN AASYPGDNST GTAPLVLYLR AGYYVQWGHP DDVGEYGPEL DQAAKSALET FFASERAFDV NDANGEILAE TVTLVDSAEQ NPHFLDVVKK LLTDYDTSWN EHYWMTAAVN NVYIVLFRGH QLPEFVEAVK ADSSVLDTVH DFAITHIGQS RGDQWYLIHN AGRELGRFLQ HDSLRDTARP LVKDLLDSSD MTGDSAPLWM GSAEMADYYD QSNCDYYDVC DLANRVSEAV LSVKHSCGDT VTIRAQQLDA GELSQTCDSL SGQDGFFHDV ANDPGPVADD NNDSLEVVVF DSSVDYRVFA GAVFGIDTNN GGMYLEGDPA DPDNQPRFIA HEADWQTEFA IWNLNHEYTH YLDGRFDMYG DFAAGVSTPT IWWIEGFAEY VSYSYRDEQY DAAIAEAAKG TYDLDTLFST TYDHDQTRVY QWGYLAVRFM IQNHRQDVDT VLGHYRSGDW DAAYAHLTDT IGSEYNAEWR DWLTACGAGD CG
|
| |