Gene Snas_2903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2903 
Symbol 
ID8884102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3053814 
End bp3055712 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content67% 
IMG OID 
ProductMicrobial collagenase 
Protein accessionYP_003511671 
Protein GI291300393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.439593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGA CCCGAACCCG GCGACTGATC GGCGCGGCCA CCGTCGCCGT CGTCCTGACC 
GCCGGTATCG CCCTGACCGC CTCGCAAGGC GCGGCGGACC CGGCACCCGA CCGACCCCAC
ACCCCACCGA CGGCGGCCAG CCCCGCCGCC ACCCACACCG ATCCCGGCCC GGCCGACATG
CGCGACCGGC CCCCGAACCC CGCGCCCGAC CCCGGCACCG ACAACCCTTA CGCCGCCGAG
GATCCGAAAC GTCCCTCCGA CAAGCACGGC ACCCGGCAGG CCTGCGACCC GGCGGACTTC
ACCTCACGCA CCGGCGACGA ACTGGTGGCG TTCATCCGCG AGACCGAGAC CTCCTGCGTC
AACACCCTGT TCGGCCTCAC CGGCACCGAC GCCAACGGCG CCTTCCGTGA GGAACAGATG
ATCTCGGTGG CCAACGGCAT GCGCGACAAC GCCGCGTCCT ATCCCGGCGA CAACTCCACC
GGCACCGCGC CGCTGGTGCT GTACCTGCGC GCCGGATACT ACGTCCAATG GGGTCACCCC
GACGACGTCG GCGAATATGG CCCCGAACTG GACCAGGCCG CGAAGTCCGC ACTGGAGACG
TTCTTCGCCA GCGAGCGCGC CTTCGACGTC AACGACGCCA ACGGCGAGAT CCTCGCCGAG
ACGGTCACCC TCGTCGACAG CGCCGAACAG AATCCCCACT TCCTCGACGT GGTCAAGAAA
CTGCTCACCG ACTACGACAC CTCCTGGAAC GAGCACTACT GGATGACCGC CGCGGTCAAC
AACGTCTACA TCGTCCTGTT CCGCGGCCAC CAGCTGCCGG AGTTCGTCGA AGCCGTGAAG
GCCGACTCCT CGGTCCTGGA CACCGTGCAC GACTTCGCCA TCACCCACAT CGGCCAGTCC
CGGGGCGACC AGTGGTACCT GATCCACAAC GCCGGACGTG AACTGGGCCG GTTCCTGCAA
CACGACTCAC TGCGCGACAC CGCCCGGCCG CTGGTGAAGG ACCTGCTCGA CAGCAGCGAC
ATGACCGGCG ATTCAGCGCC ACTGTGGATG GGTTCGGCGG AGATGGCCGA CTACTACGAC
CAGTCCAACT GCGACTACTA CGACGTCTGC GACCTGGCCA ACCGGGTCTC CGAGGCCGTC
CTGTCGGTCA AGCACAGCTG CGGCGACACG GTTACGATCC GGGCCCAGCA ACTGGACGCG
GGGGAGTTGT CGCAAACCTG CGACAGCCTG TCCGGCCAGG ACGGGTTCTT CCACGACGTC
GCGAACGACC CCGGACCAGT GGCCGACGAC AACAACGACT CCCTGGAGGT GGTCGTGTTC
GACTCCAGCG TGGACTACCG GGTCTTCGCC GGAGCCGTGT TCGGCATCGA CACCAACAAC
GGCGGCATGT ACCTGGAAGG CGACCCCGCC GACCCCGACA ACCAGCCCCG GTTCATCGCC
CACGAAGCCG ATTGGCAGAC CGAGTTCGCC ATCTGGAACC TCAACCACGA GTACACCCAC
TACCTCGACG GCCGCTTCGA CATGTACGGC GACTTCGCCG CTGGCGTCAG CACCCCGACC
ATATGGTGGA TCGAGGGCTT CGCCGAGTAC GTGTCCTACT CGTACCGCGA CGAGCAGTAC
GACGCCGCCA TCGCCGAGGC CGCCAAGGGC ACCTACGACC TGGACACCCT GTTCAGCACC
ACCTACGACC ACGACCAGAC CCGCGTCTAC CAGTGGGGCT ACCTGGCGGT GCGGTTCATG
ATCCAGAACC ACCGCCAGGA CGTGGACACG GTGCTGGGCC ACTACCGCTC CGGCGACTGG
GACGCCGCCT ACGCGCACCT CACCGACACC ATCGGCTCTG AGTACAACGC CGAATGGCGC
GACTGGCTGA CCGCCTGCGG CGCGGGCGAC TGCGGCTGA
 
Protein sequence
MSLTRTRRLI GAATVAVVLT AGIALTASQG AADPAPDRPH TPPTAASPAA THTDPGPADM 
RDRPPNPAPD PGTDNPYAAE DPKRPSDKHG TRQACDPADF TSRTGDELVA FIRETETSCV
NTLFGLTGTD ANGAFREEQM ISVANGMRDN AASYPGDNST GTAPLVLYLR AGYYVQWGHP
DDVGEYGPEL DQAAKSALET FFASERAFDV NDANGEILAE TVTLVDSAEQ NPHFLDVVKK
LLTDYDTSWN EHYWMTAAVN NVYIVLFRGH QLPEFVEAVK ADSSVLDTVH DFAITHIGQS
RGDQWYLIHN AGRELGRFLQ HDSLRDTARP LVKDLLDSSD MTGDSAPLWM GSAEMADYYD
QSNCDYYDVC DLANRVSEAV LSVKHSCGDT VTIRAQQLDA GELSQTCDSL SGQDGFFHDV
ANDPGPVADD NNDSLEVVVF DSSVDYRVFA GAVFGIDTNN GGMYLEGDPA DPDNQPRFIA
HEADWQTEFA IWNLNHEYTH YLDGRFDMYG DFAAGVSTPT IWWIEGFAEY VSYSYRDEQY
DAAIAEAAKG TYDLDTLFST TYDHDQTRVY QWGYLAVRFM IQNHRQDVDT VLGHYRSGDW
DAAYAHLTDT IGSEYNAEWR DWLTACGAGD CG