Gene Shewmr4_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2037 
Symbol 
ID4252610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2422969 
End bp2424168 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content48% 
IMG OID638118653 
Producthypothetical protein 
Protein accessionYP_734167 
Protein GI113970374 
COG category[S] Function unknown 
COG ID[COG4394] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000141693 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000596189 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGCGA TACACTCAAT GGCAACCACC GCCCCCCACT GGGACATCTT TTGCTGCGTC 
GTCGATAATT ACGGCGATAT CGGCGTCACT TGGCGCTTAG CCAAACAGCT GGTCAATGAA
TATCAGCTCC CCATTATACT CTGGGTCGAT GACTTAAACA GCTTCTCGCA TATTTTACCA
AGCCTTGATC CAAACCAAAG CAGCCAAGTC TTTAATGGCG TCACCATCAA TCATTGGACA
ACGCCCCTGC CCGTGGCATT TGTGCCCGGC GCCGTGTTAA TTGAAGCCTT TGCCTGCGAA
CTGCCTGACG AGGTCAAACA ACAACTCATC ACGCTGCACA GCACCACACC GCAAGCTGTG
CCCGTATGGC TGAATTTAGA ATATTTAAGC GCCGAAGACT GGGTCGATGG CTGCCATGGG
TTACCCTCGA TGCAGGCAAG TGGCATCAAA AAGTATTTCT TTTTCCCGGG TTTTACCCCA
AAGACGGGTG GACTGATCTG TGAGCGTGAG CTGTTTGCCG AACGCGATGC ATGGCAACTG
GATAGCACCA ATAAATTGCA ATTATTTGAG CGCCTTGGTC TTAAGGATAT TCAAGCGCAA
GATACTGTCT ACAGTGTCTT TAGTTATGAA ACTGATTCTC TGCCGGCCTT ATGTGAGCTC
TGGCAAGCCA GTGCAACAAG CGATGCCAAA ATCCATGCGC TTATTCCCAA GGGACGCAGC
TTAAACAGCT TACAACACTT ATTACCCTGC AAGGTTGAGG CGCTCAGCCC CGGACAGCAA
ATTAAGCTAG GTCATTTGAC CCTGCATATC TTGCCGATGA CAGACCAACA AGGGTTCGAC
CGCCTGCTGT GGAGCTGCGA CTTTAATATT GTCCGCGGTG AAGACAGCTT CCTGAGGGCG
CAATGGGCCG CTAAACCCTT CATTTGGCAT ATTTATCCGC AAGAAGATGA TTATCATCTA
ATAAAATTAG AAGCTTTTAT CCAACTTTAC TGCGATAATC TGCCCCCTGA TATTGCTGGT
ACTTGGTCTA AATTGAATGT TGCATTTAAC CAAGGCGAGC AATCTGCCGT GAAAACTCAC
TGGCAAAACC TAAATCCTGT CAGTTTGCCA CTTTTGCAAC ATGCTAAAGA TTGGCCAATT
GACGCAATAA ATGCTGCAGA TCTTGCGACT CGGCTAGTCC AATTCGTCAA AAAAAGCTAA
 
Protein sequence
MKAIHSMATT APHWDIFCCV VDNYGDIGVT WRLAKQLVNE YQLPIILWVD DLNSFSHILP 
SLDPNQSSQV FNGVTINHWT TPLPVAFVPG AVLIEAFACE LPDEVKQQLI TLHSTTPQAV
PVWLNLEYLS AEDWVDGCHG LPSMQASGIK KYFFFPGFTP KTGGLICERE LFAERDAWQL
DSTNKLQLFE RLGLKDIQAQ DTVYSVFSYE TDSLPALCEL WQASATSDAK IHALIPKGRS
LNSLQHLLPC KVEALSPGQQ IKLGHLTLHI LPMTDQQGFD RLLWSCDFNI VRGEDSFLRA
QWAAKPFIWH IYPQEDDYHL IKLEAFIQLY CDNLPPDIAG TWSKLNVAFN QGEQSAVKTH
WQNLNPVSLP LLQHAKDWPI DAINAADLAT RLVQFVKKS