Gene Shewana3_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2028 
Symbol 
ID4476393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2425817 
End bp2426992 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content42% 
IMG OID639726611 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_869665 
Protein GI117920473 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000400239 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAAAGT TGGGCGATAT TTTTGATATA GCTAGGGGAG GATCGCCTCG CCCTATCGAC 
GACTATATAA CCGATGCCGA TGACGGGTTA AATTGGATAT CCATAAAAGA CGCTAGTAAC
AGTAATAAAT ACATTAATTC AACAAAACTA AAAATTAAAC CTGAAGGTTT AACTAAAACT
CGTATGGTTT ATCCAGGCGA TTTTTTGCTG ACAAACTCTA TGAGTTTTGG ACGGCCATAT
ATTATGAATA CTACGGGATG TATTCATGAC GGATGGTTAG TACTATCAGG GAATCCGGAT
AAAGTTAATT CGGATTATTT CTACTATTTA CTAGGAAGCG ATACTTTAAA ACAACGTTTT
TCTGGATTAG CAGCTGGTGC TGTTGTCAAA AACCTCAATA CTGAATTAGT TAAAAGTGTG
GAAGTCCCAC TCCCACCACT AGCCGAGCAA AAACGGATTG CTGCGATACT GGACAAAGCC
GACGCCATCC GCCGCAAACG CCAACAAGCC ATCCAACTCG CCGACGACTT ACTCCGCGCC
GTCTTCCTAG AAATGTTCGG CGACCCAGTC ACCAACCCAA AAGGCTTTCA GAAGTCAAAA
TTGTCGGCTC TTGCCGACGT TATTACTGGA TTTGCGTTTA AAAGCGCTGA GTATGTCGAA
GACAGTGATG ATGCTGTAAG GCTTTGTCGT GGGGTTAATA CACTGACTGG CTATTTTGAG
TGGAAAGATA CTGCTTTTTG GGATTCAAAT AAAATAAATG GGCTACACAA TTACAAACTA
GAAGCTGGCG ACGTGATACT AGCTATGGAC CGTCCATGGA TTTCAAGTGG ATTAAAGGTA
TGTGTCTTCC CTGAAAACGA GCGAGATACA TATTTAGTTC AACGTGTAGC AAGAATCCGC
TCAAAACAGC CACGTTATAC CGATTATTTG TATTCAAGCA TTTTGTCACC GGCATTTGAG
AAGCATTGCT GTCCTACAGA AACAACAGTC CCTCATATTT CGCCAGTTGA ACTAAAGAAC
TTTGAGATTC TGGTACCTGA TGAAAAATCA GTTAGCAAAT ATCACGATAT AGTCTCTAAG
TTAAGGCGCT CGAAAGATCG AATGGAAATG AACTTGACTG AGGCTAATCA AATCTTCAAC
TCGCTAAGCC AAAAAGCCTT CTCCGGCCAG CTTTAG
 
Protein sequence
MVKLGDIFDI ARGGSPRPID DYITDADDGL NWISIKDASN SNKYINSTKL KIKPEGLTKT 
RMVYPGDFLL TNSMSFGRPY IMNTTGCIHD GWLVLSGNPD KVNSDYFYYL LGSDTLKQRF
SGLAAGAVVK NLNTELVKSV EVPLPPLAEQ KRIAAILDKA DAIRRKRQQA IQLADDLLRA
VFLEMFGDPV TNPKGFQKSK LSALADVITG FAFKSAEYVE DSDDAVRLCR GVNTLTGYFE
WKDTAFWDSN KINGLHNYKL EAGDVILAMD RPWISSGLKV CVFPENERDT YLVQRVARIR
SKQPRYTDYL YSSILSPAFE KHCCPTETTV PHISPVELKN FEILVPDEKS VSKYHDIVSK
LRRSKDRMEM NLTEANQIFN SLSQKAFSGQ L