Gene Shewana3_0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_0800 
Symbol 
ID4477011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp922790 
End bp924706 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content48% 
IMG OID639725336 
Productpeptidase U32 
Protein accessionYP_868444 
Protein GI117919252 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC CAGAGATTTA CACCCCTGAG ACACTTTCTC ACGTTAATAA CCGTCTAGAG 
TTATTGGCGC CAGCAAAAAA TGCCGATTAT GGTATTGAAG CCATTCGCCA TGGTGCCGAT
GCGGTTTATA TCGGTGGTCC AGCATTTGGG GCGCGTGCAA CGGCGGGTAA CAGTGTGGAA
GATATCGCTC GCCTATGTGC TTTTGCACAT AAGTATCATG CTCAAGTGTT TGTCGCCCTC
AATACGATTC TGATGGACGA TGAGCTCGAG ACGGCTGAAA AACTGATTTG GGATGTGTAT
AACGCGGGCG CCGATGCACT GATTGTGCAG GACATGGGCG TATTGCAACT TAATCTGCCG
CCGATTGCGC TGCATGCTAG TACGCAAATG GACAACCGTA ATCCCGAAAA AGTTGCCTTC
TTAGAGCAAG TGGGATTCTC GCAGGTGGTA TTGGCGCGCG AGCTGGGCTT AAGCCAAATC
CGTGAGGTTG CGGCTCATAC CAATATGCAG ATTGAGTTCT TTATCCACGG CGCCCTGTGT
GTGGCCTACA GTGGATTATG TAACTTAAGC CATGCCTTTA GTAATCGCAG TGCTAACCGC
GGTGAATGTT CGCAAATGTG TCGTTTGCCG GGCAATCTAA AGACTCGCCA AGGGGATGTG
TTAGCACAAA ACGAACACTT ACTCTCATTA AAAGATAATA ATCAAACCGA TAACCTCGAA
GCCCTGATAG ATGCTGGCGT GCGTTCGTTC AAAATCGAGG GGCGTTTAAA AGATTTAAGT
TATGTAAAAA ACGTGACTGC TCATTATCGC CAAAAACTCG ATGCCATTAT GGCGCGTCGC
CCTGAGTTTG TGGCTTCATC CCATGGCCGT ACTGAGCATA CTTTTACCCC AGATCCCGAA
AAAACCTTTA ATCGTGGCAG CACCGACTAT TTTGTCCATG AGCGTAGCCA AGGGATTAAA
GATTTTCGCT CGCCCAAATA TATTGGCCAA GATGTGGGTA AAGTGGTCGC TCTTGGTAAG
GATTTTATTC AAGTCAGTTC AACCCACGAA TTTAATAACG GTGATGGGTT GGCTTACTTT
CCACCCAATT ATGCGATGGC AAAGCAGTCC GACGACAAGT TGCAAGGTTT ACGGGTAAAC
CGCGCCGAAG GCCATAAGCT GCATGTGTTA CAAGTTCCAC GGGATTTACG TGTTGGTATG
ACTTTATACC GTAACCATAA TCAGGCATTT GAAGCCTTGT TAGCGAAAGA GTCGGCCAAG
CGAATTATCG GTGTAGATAT GCGTTTAACC GATACCGCTA TGGGCGTAGC GCTGACTTTA
ACCGATATCT ACGGCCTCAG TGCTACGGTT GAGTTGGAAG TCGAAAAGAC ACCCGCCACC
GACGCTGAAA AAACCTTGCA GACGATTCGT ACACAATTAT CGAAGCTTGG CAGTACCGAT
TTTACGGCGC GCCAGATCAG TATCGAAACC GCTCAGCCTT GGTTCCTGCC TGCTTCAACC
CTTAACGGCC TGCGCCGTGA TGCGGTAGCG GCATTAGAAC TTGCCCGTGT TGAAGGTTAC
CAGCGCCCAA AACCTTGGAA ATATAACCAA GATGCTGTGT ATCCATTCAA ACACTTAAGT
TACTTAGGTA ACGTGGCAAA CGAAAAGGCG AAGGATTTTT ATCAACGCCA TGGCGTGATT
GAAATTCAAG ATACCTACGA GAAAAACGGC GTGACTGAAG ACGTGCCTTT GATGGTGACT
AAGCACTGCC TGAGATTTAA CTTTAATCTT TGTCCTAAGG AAGTGCCGGG AATAAAGGCC
GATCCTATGG TGCTCGAAAT CGGTAACGAT GTACTTAAGT TAGTATTTGA TTGTCCTAAG
TGCGAGATGC TGGTTGTCGG TGAAAACCGT CAGGTTCGCG GCCAAAAAGC GCTTTAA
 
Protein sequence
MSQPEIYTPE TLSHVNNRLE LLAPAKNADY GIEAIRHGAD AVYIGGPAFG ARATAGNSVE 
DIARLCAFAH KYHAQVFVAL NTILMDDELE TAEKLIWDVY NAGADALIVQ DMGVLQLNLP
PIALHASTQM DNRNPEKVAF LEQVGFSQVV LARELGLSQI REVAAHTNMQ IEFFIHGALC
VAYSGLCNLS HAFSNRSANR GECSQMCRLP GNLKTRQGDV LAQNEHLLSL KDNNQTDNLE
ALIDAGVRSF KIEGRLKDLS YVKNVTAHYR QKLDAIMARR PEFVASSHGR TEHTFTPDPE
KTFNRGSTDY FVHERSQGIK DFRSPKYIGQ DVGKVVALGK DFIQVSSTHE FNNGDGLAYF
PPNYAMAKQS DDKLQGLRVN RAEGHKLHVL QVPRDLRVGM TLYRNHNQAF EALLAKESAK
RIIGVDMRLT DTAMGVALTL TDIYGLSATV ELEVEKTPAT DAEKTLQTIR TQLSKLGSTD
FTARQISIET AQPWFLPAST LNGLRRDAVA ALELARVEGY QRPKPWKYNQ DAVYPFKHLS
YLGNVANEKA KDFYQRHGVI EIQDTYEKNG VTEDVPLMVT KHCLRFNFNL CPKEVPGIKA
DPMVLEIGND VLKLVFDCPK CEMLVVGENR QVRGQKAL