Gene Sare_4435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4435 
Symbol 
ID5705913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5010412 
End bp5012832 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content66% 
IMG OID641273851 
Productpeptidase M6 immune inhibitor A 
Protein accessionYP_001539200 
Protein GI159039947 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.431227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000592634 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGTCTAC TCGGGCTCTC GCTGACAGCG ACGGGACTGG CGGTCGGACC GTCCGCCAGC 
GCCGCGCCGA AGCCCCAGCA GCCGACACCT GCTCCATCGG TCGCTGAGCC TGCCCACGTT
GACCACGACC TGCCCAACCC GCTGGAAGAA AAGCGGCGGG CGCTGCGTCA GGAGGGGCTG
AGCAATGTCC TCTCCGGCAA GGCGACGTCG CAGCGAATCA ACGGCAGCAC CGTCGTCAAG
GTCGGCGAGA CGGCTGGTGG CGCTCCGGCC ACGAAACGCA TTGCCCGTGA CCGCGACGGC
AAGAAGGACC AGTACGTCGA GCTCTCCCGG GAGAGCACGG ACCGAATCTT CGTGATTCTG
GCCGAATTCG GTAACGACCG GCACCCGGAC TACCCCGACC AGGACACCGC CCCCACCTGG
CCGGGCCCGG CGCGGTTCGA CGGACCGCTA CACAACGAGA TTCCCGAGCC GAACCGAGCG
CTGGACAACT CCACGTCCTG GCAGCCGGAC TACAGCGCCG ACCACTTCCG GACGCTGTAC
TTCGGTGAGT CGCCCGGCGA CGAGTCGGTG AAGCAATACT TCGAGGCCCA GTCCTCGGGC
CGCTACAGCG TCGAGGGCAC GGTCACCGAC TGGGTCAAGG TTCAGTACAA CGCGGCCCGC
TACGGCCGCT CGTTCGACGA CCCCACGGAC GCCAACGGCG ACGACCCGGC AGTCTGCGGC
AGCAGGGTCT GCACCAACGT CTGGACGTTG GTCCGGGACG CCGCCAACCA GTGGGTCGCC
GATCAGAAGG CCGCCGGCCG CACCGACGCG GAAATCGCCG AAGAGATCAA GGCGATGGAC
CAGTGGGACC GGTACGACCA CGACTCCGAC GGCAACTTCA ACGAACCCGA CGGTTACATC
GACCACTTCC AGATCGTCCA CTCCGGCGGC GACATGGCCA ACAGTGACCC GCACCAGGGT
GAGGACGCGA TCTGGAGCCA CCGTTGGTAC GCGTTCGCGT CCGACCAGGG TCGCACCGGC
CCACCCAACT TCCCGGCCGG CGGCACCCAG ATCGGTGACA CCGGCATCTG GATCGGTGAC
TACACCATCC AGCCGGAGAA CGGTGGGCGC AGCGTCTTCT ACCACGAGTA CGCTCACGAC
CTCGGCCTAC CGGATGACTA CAACATCCTC AGCGGTGGCG ACAACAACAA CGAGCACTGG
ACGCTGATGG CCCAGAGCCG GCTGGGCGCC GAGGGCGACG GCGGCATCGG CGAGCGCGGT
GGCGACCTGG GTGCCTGGAA CAAGCTCCAG CTCGGCTGGC TCGACTACGA GGTGGTCGTC
GCGGGTCAGA AACGCACCAT GACCCTTGGC CCACAGGAGT ACAACTCCAA GAAGCCGCAG
GCTGCCGTGG TCGTGCTCCC GCAGCGGGAG TACTCGTTCG ACAACGGTGC ACCGTTCGCC
GGCACGAAGC AGTTCTTCTC CGGAAACGAG GACGACCTCA ACAACACGAT GACCCGGGCG
CTGGACCTCA CCGGGAAGTC GTCGGCCGCG TTGACGTTGA AGGGCCGCTA CAGCATCGAG
GCCGACTACG ACTACCTGTT CTTCGAGGTG TCGGAGGACG GCGGTGACAC GTGGACGCCG
CTGCCCGGCA CCGCCGACGG CAAGCCCTTC AAGGAGATCT CCGCCGGACG GTTCGCCCTG
GACGGCAGCA GCAATGGCGA ATGGGTCGAC GTCAACATCC CGATGGACGC CCAGGCCGGC
AAGACAATCC AGTTCCGGCT GCGCTACCAG ACCGACGGTG GTGTCTCCGA GGGCGGCTTC
TACGGCGATG AGATCACCGT GACCGCCGAC GGCGAGACGG TCCTCAACGA TGGTGGCGAG
ACCGGCACCG GCGACTGGTC CCTGGCGGGC TGGAGCATCG TCGAGGAGAC CTACACCCGG
CTCTTCGACA ACTACTACAT CGCCGGCCAC CGGTCGTACG TCTCGTACGA CAAGTACCTG
AAGACCGGGC CGTACTACTT CGGATACGCG AACACCCGCC CCGACTGGGT GGACCACTAC
GCCTACCAGG AGGGCCTGCT GATCTCCTAC TGGAACACCC GGTGGGCAGA CAACGACACG
TTCGCGCACC CGGGTGAGGG GCGTAACCTC TACATCGACT CGCGCCCGCG GCCGATCTAC
AACCTGACCG GCGAGCCGTG GCGGGCCCGC GTCCAGGTGT ACGACGCGCC GTTCAGCCTC
AAGAAGGCGG ACTCGTTCAC GCTGCACATC GACAGCCAGC CACACTACAT TCGGGGCCAG
GCCGCGCAGC CGCTGTTCGA CGACACCAAG AAGTACTGGT ACGAGGAGTT GCCCAACCAC
GGCGTCAAAC TCCCCGCCAC CGGCACGAAG ATCAAGGTTC TGAAGCAGAA GGGCACCTCC
ATCAAGGTCC GCTTCTCCTG A
 
Protein sequence
MGLLGLSLTA TGLAVGPSAS AAPKPQQPTP APSVAEPAHV DHDLPNPLEE KRRALRQEGL 
SNVLSGKATS QRINGSTVVK VGETAGGAPA TKRIARDRDG KKDQYVELSR ESTDRIFVIL
AEFGNDRHPD YPDQDTAPTW PGPARFDGPL HNEIPEPNRA LDNSTSWQPD YSADHFRTLY
FGESPGDESV KQYFEAQSSG RYSVEGTVTD WVKVQYNAAR YGRSFDDPTD ANGDDPAVCG
SRVCTNVWTL VRDAANQWVA DQKAAGRTDA EIAEEIKAMD QWDRYDHDSD GNFNEPDGYI
DHFQIVHSGG DMANSDPHQG EDAIWSHRWY AFASDQGRTG PPNFPAGGTQ IGDTGIWIGD
YTIQPENGGR SVFYHEYAHD LGLPDDYNIL SGGDNNNEHW TLMAQSRLGA EGDGGIGERG
GDLGAWNKLQ LGWLDYEVVV AGQKRTMTLG PQEYNSKKPQ AAVVVLPQRE YSFDNGAPFA
GTKQFFSGNE DDLNNTMTRA LDLTGKSSAA LTLKGRYSIE ADYDYLFFEV SEDGGDTWTP
LPGTADGKPF KEISAGRFAL DGSSNGEWVD VNIPMDAQAG KTIQFRLRYQ TDGGVSEGGF
YGDEITVTAD GETVLNDGGE TGTGDWSLAG WSIVEETYTR LFDNYYIAGH RSYVSYDKYL
KTGPYYFGYA NTRPDWVDHY AYQEGLLISY WNTRWADNDT FAHPGEGRNL YIDSRPRPIY
NLTGEPWRAR VQVYDAPFSL KKADSFTLHI DSQPHYIRGQ AAQPLFDDTK KYWYEELPNH
GVKLPATGTK IKVLKQKGTS IKVRFS