Gene Shewmr4_0640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0640 
Symbol 
ID4251658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp745025 
End bp746245 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID638117203 
Producthypothetical protein 
Protein accessionYP_732777 
Protein GI113968984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000134735 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAGTA ATTTACATCA GGCTGGGGTG AAGCTACTCA AGCAGTTAGG TCGCCATGCT 
GACATCATTA TGGACGCCTA TTTAGCGGGT TCCTTAAACG AGGATGCCCA TGATCCCGCC
GTAGTTGAAA AACTCAAGCA GGCGGGGATT TTATGGCGCC CAGAGCCTGA CCAAGAGCTG
CGCCTTAAAC GCTCGGTGCG TGCCTTACTC GAAGAGGGCT TAAGTGATGA ACGCAATCGC
CAAATCGACT CCAACGTCGG CTCGGCGCTC GCCACCATTA AGACCTTGGC CGACCACTAT
AAAGAAGCGC GCCACAGCTC AGATTACAGT GCCGCCGAGG CGTATCTGTC TGATTTAAGT
GAGCATGTGT ATAGCTTTGC CGACAGTTTA CGTTACTCCA TCCGCGTGTT GTGGGGGCGC
ATCAACAACG AGTTCGGTTA TGTCGGTACC ATTAACGCTA AGATCCGTGA AAACGAACTC
GCTCAAAGCC AAGTATCTGA ATTACTTAAT GGTTTGGAGA TGTTCCAGTT TAGCGAATTA
GGTGAAATCG CCGGTGATAT CCGTGAGCTG CGTAAGCTGC TGGTGACGAC TTTGCAGGAA
ACCATGAGCG ACTGCGCCCA GGAACTCAGT GTGGTGCAGG GCAGGTTGCT GGAACTCCTC
GGCCGCTTTA GGCAAATTCG CGGCCGTACC CGCTTGCTTA AGGGCTGGTT ACTGTACACC
GATTTGCATC CGGATTATCG CCCTGCGGAC CATGTGTCCC ACAAGGAAAT CCCGAGTTTT
TTCAATCGCG CCGAAGTGCT GTTGGCTCCA GCATCTGTGG ATGTGCATAA CGCCAGCCAA
GAGTTTGAGT TGATGAACAT TGTCGCCCAT ATCAAGGCGA TTAGCCGTCA GGGCATAGTC
GAAACGGTGC GCGAGCAGGA TGTGGCCGTG CCGCTGACGC AGAATGAAGA CTTTGATATT
CCTGATAATC CACTCAAGCA AGCGGTCGAC ACTTACTTTG TCGATGTGAT TGAGTCGGGC
TTACGCCAGT CGGCGCTCGA TTACTTAGCC GAAAAAGCGC TGCCGTGGGA TGCCGAAAGC
TGGATTTATC AAGTGATTGG CGGCTACGAA GGCTTACCCG ATGAGCATAA GGCTTACTTC
GAGTTAGAAC CCTTAGGTGA ACCGCACCCC ATCTACAGCG GTAACTTTAT TATCCGCGAC
GTGGAATTAT GGCTCGCCTA G
 
Protein sequence
MSSNLHQAGV KLLKQLGRHA DIIMDAYLAG SLNEDAHDPA VVEKLKQAGI LWRPEPDQEL 
RLKRSVRALL EEGLSDERNR QIDSNVGSAL ATIKTLADHY KEARHSSDYS AAEAYLSDLS
EHVYSFADSL RYSIRVLWGR INNEFGYVGT INAKIRENEL AQSQVSELLN GLEMFQFSEL
GEIAGDIREL RKLLVTTLQE TMSDCAQELS VVQGRLLELL GRFRQIRGRT RLLKGWLLYT
DLHPDYRPAD HVSHKEIPSF FNRAEVLLAP ASVDVHNASQ EFELMNIVAH IKAISRQGIV
ETVREQDVAV PLTQNEDFDI PDNPLKQAVD TYFVDVIESG LRQSALDYLA EKALPWDAES
WIYQVIGGYE GLPDEHKAYF ELEPLGEPHP IYSGNFIIRD VELWLA