Gene Shewmr4_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0578 
Symbol 
ID4251141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp666465 
End bp668867 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content55% 
IMG OID638117137 
Producthypothetical protein 
Protein accessionYP_732715 
Protein GI113968922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCC ATAGCCTAGC CCTCGCCATC CTACTCTTAG GTTTACCCGC GCTGAGCGTC 
GCCGCGGATC TGCCGAGCAC TAAAGTCGTT AAGCAGAGCC AAGCCGCTAA GGGATTTCTT
AATTTATACT ATGAGCCAAG CGAGGGGGAG TTATACCTCG AGGTGAGCCG TTTAAATCAG
CCTTTTTTAT TGGTGACTAG CTTGCCTGAA GGGGTCGGTT CTAACGATAT CGGTCTCGAC
CGTGGTCAAT TAGGTCAAAC CCGCATGGTG CAGTTTGAGC GCCAAGGTCC CTACATTCAG
CTTAAGCAAC TAAACACCCA GTATCGCGCT AATACCCAAG ACGCCGCCGA AAAGCGCGCC
GTGGATGAGG CCTTTGCCGA TTCGGTATTG TGGCAGGGTA AGTTACTCGA TGGTAAGCCA
GAGATGGTGG CTATCAGCGA GTTGGTACTC AACGATCTGC ACGGCGTCGC GGATGCTCTC
CTGCATCGCG GGCAGGGGAA TTATCGCCTT GATTTAACCC GCTCGGCGAT TTTACCCGCC
GGGGTGAAAT CTTTTGAAAA GAATAGTGAT GTGGATGTGC AGCTTACCTT CAAAGCCGAT
GCGGCGGGTG AGCAAGTGGC TAAGGTCACG CCCGATGGCA CCTTAATGTC GGTGCGGATG
CGCTACTCTT TTGTCGAGCT GCCCGATGAG GGCTATCAAC CTCGCGCTTA TCATCCTATG
AGCGGCTATT TATCCGATGA GTATCGCGAC TATGCCACGC CGTTTTCGGC GCCACTGGTG
CAGCGGTTTA TTTTGCGCCA CCGCCTGCAA AAGGTGAATC CTGGCCCTGC ACCGAGCGAA
GTAGTCAAGC CCATCACCTA TTACCTCGAC CCAGGTGTGC CTGAGCCTAT CCGCTCGGCG
CTACTCGATG GTGCTCGCTG GTGGGAAACG GCCTTCACCC AAGCGGGATT TATCAACGGC
TTCAAGGTTG AACTCTTGCC ACCCGATGCC GATCCGCAGG ATATTCGCTA CAACATGATC
CAGTGGGTAC ACCGCGCCAC GCGCGGATGG TCCTACGGCG CGGCGCTAAC CGATCCGCGT
ACCGGCGAAA TCATCAAAGG CCAAGTGACC TTAGGTAGCT TGCGAGTGCG CCAAGATTAC
TTGATTGCCA AAGGCTTAAC TGCAGGCTGG CGTGATAGAA GCGCCGCCGA GCAAGCGGCC
AACGACTTAG CATTAGCGCG TATTCGCCAA CTTGCCGCCC ATGAGGTCGG CCATACCTTA
GGCTTAGATC ATAACTTTGC CGCCTCGACC AATCAGGACG CGTCAGTGAT GGATTATCCC
CATCCTAAGA TCATGCTAAA AGGTAATGAC ATTGATATAT CAGCACCTTA TGGCGTAGGT
GTCGGGCTAT GGGATAACTT TGCTATCGCT TACGGCTATA GCGATGAAGG CGATGCCACT
GCCCAGCAGG CGCTGCAAAA TCAGTTGCTG GCCGAAGTGG CTCGAAAAGG GCTGCGCTAT
ATTGGCGAAG CCGATTCACG CCAAGCGGAT GCCAGCCAAG CCTATGCGAG TTTATGGGAT
AGCGGTGACG ACCCTATCGT GCAGCTGCTG GATTTGAACC GGATTCGCAC TAAGGCCATC
GAAGGTTTTA GCAGCACGGC CCTGTTGCCG GGCGAGCCAC TGGGTGAGCT AGCCGATGCC
TTTGTGCCTA TCTATTTGCT TAATCGTTAC CAAATCGATG CGGTTATTAA GTTTATTGGC
GGCACTGACT ACAACTATCT GTCCGTCGGC GAGGGTGGCC GCTGGAGCTA CATAGCGCCG
CAGTTACAGC TGTCGGCCCT TGATGCGCTG CTAAGCACCT TAGATGCGGC CAGTTTAACG
GTTCCGCAAA CCTTGCTCGA GACGCTGGTG CCTAAAGCGG GCAATTATCA AGCGACGCGG
GAGTCCTTCG AGTCTGGGCT TGGGGTGGTG AGCGATCCCC TCGGTATGGC TGAAGTGCTA
GCGCGCCATA CTGTGGGGCA GTTGTTGATG CCACAGCGCT TAAATCGCGT CAGCCAAGGG
GCGATGGCGG ATAATGAGCA GCTCTCGATC GAAACCTTAC TCAATAAGCT GTTTGCCGCG
ACCCTATACC AAGAAGACAA GCTCGCTCTG GTTGAAGGTG TGTGGATGCG GGTGAATGCG
GTGGTGATCG ATGAACTCTT GTCTGCATAT CACAATCCGC AAACCTCAGC AGAGGTGAAG
GCGGCTATTT ACGAGCGCGC CCAATTCGTG ATTAAACAGC TTAAAGCCAA AGCGAATCGC
GCGAATGCTA AGGTGGCCTC CCACTACACT TGGTTGCAAC AGGGGCTGAG TGCAGGGCTC
ACTGATGCCA ACAGCAAACT CATTCCAAAA CCCTTGAAAC TGCCGCCGGG TTCACCTATC
TAA
 
Protein sequence
MKPHSLALAI LLLGLPALSV AADLPSTKVV KQSQAAKGFL NLYYEPSEGE LYLEVSRLNQ 
PFLLVTSLPE GVGSNDIGLD RGQLGQTRMV QFERQGPYIQ LKQLNTQYRA NTQDAAEKRA
VDEAFADSVL WQGKLLDGKP EMVAISELVL NDLHGVADAL LHRGQGNYRL DLTRSAILPA
GVKSFEKNSD VDVQLTFKAD AAGEQVAKVT PDGTLMSVRM RYSFVELPDE GYQPRAYHPM
SGYLSDEYRD YATPFSAPLV QRFILRHRLQ KVNPGPAPSE VVKPITYYLD PGVPEPIRSA
LLDGARWWET AFTQAGFING FKVELLPPDA DPQDIRYNMI QWVHRATRGW SYGAALTDPR
TGEIIKGQVT LGSLRVRQDY LIAKGLTAGW RDRSAAEQAA NDLALARIRQ LAAHEVGHTL
GLDHNFAAST NQDASVMDYP HPKIMLKGND IDISAPYGVG VGLWDNFAIA YGYSDEGDAT
AQQALQNQLL AEVARKGLRY IGEADSRQAD ASQAYASLWD SGDDPIVQLL DLNRIRTKAI
EGFSSTALLP GEPLGELADA FVPIYLLNRY QIDAVIKFIG GTDYNYLSVG EGGRWSYIAP
QLQLSALDAL LSTLDAASLT VPQTLLETLV PKAGNYQATR ESFESGLGVV SDPLGMAEVL
ARHTVGQLLM PQRLNRVSQG AMADNEQLSI ETLLNKLFAA TLYQEDKLAL VEGVWMRVNA
VVIDELLSAY HNPQTSAEVK AAIYERAQFV IKQLKAKANR ANAKVASHYT WLQQGLSAGL
TDANSKLIPK PLKLPPGSPI