Gene Shewana3_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1778 
Symbol 
ID4478665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2092479 
End bp2094530 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content48% 
IMG OID639726361 
Productcarboxy-terminal protease 
Protein accessionYP_869417 
Protein GI117920225 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0159186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC TCACTTTGGC TACATCCATT GCCACTGTTT TTGTCGGATT CTCGGCTTGG 
GCTGTACCAC CCACGATTCA AATCAGCGAG TTACCCACTC TCAAGCAGGA AGCGCAGCAT
AAAGTGGCGA GTAAGCGAGT GACGGATTTA TACACTCGTT CCCACTATCA CAGATTCGCC
TTAGACGATG CGTTTTCGGC GCAGATCTTC GACCGTTACC TGCAACAACT CGATTACCGT
CGTAATGTGC TGACGCAAGC CGATGTCGAC AGTTTTAAGC CTTATACCAA TCAATTCGAT
GATATGTTGA GTTCGGGCGA TCTTGATCCT GCCTACAAGA TGTTTGATTT GGTGCAAAAG
CGCCGCTACG AAGGCTTTGT GTACGCGCTT TCTCTGCTCG ATAAAGAGAT GGATTTCACC
GTGCCAGGTG ATGCCTACGA GTACGACAGA GAAGATGCGG CTTGGCCGAA AGATCAAGCC
GAGATCAACG AGTTGTGGCG CCAACGTGTT AAATACGATG CGTTGAATCT GAAACTCACA
GGCAAGAAAT GGCCTGAGAT CGTCGATATT CTGCAAAAGC GTTATAACAA CGCCATCAAA
CGTCTGACCC AGACCAATAG CGAAGATGTG TTCCAAGCGG TGATGAATGC ATTTTCTCGC
AGCATCGAGC CACACACTAG CTATTTATCG CCCCGTAATG CTGAGCGTTT CCAAATGGAA
ATGAACTTAA GCCTCGAAGG TATTGGTGCG CAGTTACAGC TCGAAGACGA TTACACTGTC
ATCAAGAGTT TGATTGCAGG TGGTCCTGCG GCCAGCAGTG AAAAACTGTC GCCGGAAGAT
AAGATTGTCG GTGTCGGCCA AGAAGGCGGT GAGATTGTTG ATGTGATCGG CTGGCGATTA
GACGATGTGG TCGATCTGAT TAAAGGCCCT AAGGGCAGTA AAGTTATATT ACAGATTTTA
CCTAAGAAGG GCGGTTCTAA CGCTAAGCCG TTCAATCTGA CCTTAGTGCG CGACAAAATC
CGTCTAGAAG ACCGTGCCGC GACCTCAAAG ATCATCGAGC CAAAAGACGG TGAATACGCC
AACCGTAAAG TGGGTGTGAT TCAAATTCCT GGTTTCTATA TGAATTTATC CCAGGATGTC
GAAAAAGAAT TGGTGAAGTT AAACGAAGCC AAGGTTGAAG GTGTCGTTAT CGACTTACGT
GGTAATGGCG GCGGTGCGTT AACCGAAGCC GTATTACTGA CCGGACTCTT TATCGATATG
GGCCCTGTAG TGCAAGTGCG TGACGCCGAT GGTCGAGTGT CTGCCCACCG TGATAACGAT
GGCAAGACGA CGTATGCTGG TCCGTTAACC ATTATGGTTG ACCGTTACAG TGCATCAGCC
TCTGAGATTT TTGCCGCTGC CTTGCAAGAT TATGACCGTG CGCTGATTGT CGGTGAGTCT
AGCTTTGGTA AAGGCACTGT GCAGCAGCAT AAGAGCCTGG GTCGTATCTA CGATATGTAC
GAGAAGCCAA TTGGCCATGT GCAGTATACG ATTCAAAAGT TCTACCGTAT CAACGGTGGT
AGTACGCAGC TTAAGGGCGT AACCCCGAAC ATTGCTTACC CAAGTGCGTT AGAGCCGGGT
GAATACGGTG AAGCGGAAGA GAAGAATGCT CTACCTTGGG ACAAAGTGCC GATGGCGCAA
TACGGTACGC TAAACGACAT CACTCCTGAG TTAGTGGCGA GTTTAGAGAA AAAACACCTT
GCCCGTATTC AGAACGATGT TGAGTTTAAC TATATCAATC AAGATATTGC CGACTTTAAA
AAGCATCATA AAGAGAAAAC TGTCTCCTTA GTTGAAAGTG AGCGTATTGC CTCACGTGAA
GCCGATGAGA AGAAAGTCCT CGATAGAACC AACGAGCGTC GTGTTGCCCA TGGTTTAGCC
GCGGTTAAAT CGATGGAAGA CATTAAAGAC AAAGACGATG TTGAAGCACC GGATGCCTTC
TTAGACGAAA CGGCCTATAT CACCTTAGAT ATGGCGGATG CAAAAAAGCT GGCTAACGCT
GGCACTAAAT AG
 
Protein sequence
MRKLTLATSI ATVFVGFSAW AVPPTIQISE LPTLKQEAQH KVASKRVTDL YTRSHYHRFA 
LDDAFSAQIF DRYLQQLDYR RNVLTQADVD SFKPYTNQFD DMLSSGDLDP AYKMFDLVQK
RRYEGFVYAL SLLDKEMDFT VPGDAYEYDR EDAAWPKDQA EINELWRQRV KYDALNLKLT
GKKWPEIVDI LQKRYNNAIK RLTQTNSEDV FQAVMNAFSR SIEPHTSYLS PRNAERFQME
MNLSLEGIGA QLQLEDDYTV IKSLIAGGPA ASSEKLSPED KIVGVGQEGG EIVDVIGWRL
DDVVDLIKGP KGSKVILQIL PKKGGSNAKP FNLTLVRDKI RLEDRAATSK IIEPKDGEYA
NRKVGVIQIP GFYMNLSQDV EKELVKLNEA KVEGVVIDLR GNGGGALTEA VLLTGLFIDM
GPVVQVRDAD GRVSAHRDND GKTTYAGPLT IMVDRYSASA SEIFAAALQD YDRALIVGES
SFGKGTVQQH KSLGRIYDMY EKPIGHVQYT IQKFYRINGG STQLKGVTPN IAYPSALEPG
EYGEAEEKNA LPWDKVPMAQ YGTLNDITPE LVASLEKKHL ARIQNDVEFN YINQDIADFK
KHHKEKTVSL VESERIASRE ADEKKVLDRT NERRVAHGLA AVKSMEDIKD KDDVEAPDAF
LDETAYITLD MADAKKLANA GTK