Gene Shewmr4_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2110 
Symbol 
ID4252683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2517009 
End bp2518574 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content49% 
IMG OID638118734 
Producthypothetical protein 
Protein accessionYP_734240 
Protein GI113970447 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.664501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT CTCAATCTTT TTCCCTCGCC CAGATCATCG GCCTACTCGC CGTGGCCTTT 
TTAGGACTGT TAATCGGTGC CCTGCTCAAT CAACGTCTCA CTCGTTCTCG CTGGCAACAG
TTTAAAGACG AATTAGAGCA AGAAATGCGG CAGGTAAACG AAGACGCCGA GCTCTCCCTC
GCCCAGCAGC AAATTTTAGT CGACGATAAA GACAGCCAAC TGCGCCAGTG CCAACAGCGC
TTAGAGCAAA AGATAGAACA ACTGGGCAAA GCCGAAGCCA TGGCCGAGCG CCTGCCGAGT
TTAGAGCAAC AACTTGCCGA CAGCCATCGC CGCCAACTTG AGCTACAACT GGCCTTATCC
AAATCCAATG CGATGCAACA AACCATTCAA GCCAAGGCTG ATGCCCAGCA ATCTGCGATG
CAGGAAAAAA TCGCCACTTT AGAAATGGCC GAAGTGCGCC TGCAAACTCA GTTTGAGAAC
CTTGCCAATC GAATTTTCGA AGAGCGCAGC GAAAGCTTTA AACATCAAAA TGCGAACCAG
TTAGAAGGCG TGCTCGGGCC GTTAAAGCAG CAGTTAGAAG GTTTTAGACA GCAAATTCGC
GAATCCTACA ACCATGAACA GTCGGAACGC AGCGCCCTTA AACATCAATT AGAACACCTG
CGCGAGCTCA ACTTAAAAAT GAGCCAGGAT GCCATTAACC TGACCAAAGC CTTAAAGGGC
GATAACAAGC AGCAAGGCAA CTGGGGCGAA GTGATTTTAG ACCGCGTGCT GCAAGAAAGC
GGCCTGCGTG AAGGTCACGA ATACCACACC CAGCAGGATC TGAAGGACGA CAGCGGCAAA
CGCTTTAAGC CGGATGTGAT CGTACATTTG CCTGAAAATA AAGACGTGGT GATCGATGCC
AAAATGTCGC TCATCAGCTA CGAGCGCTAT TTTAATAGCG AAGATCCGCT GGTGCGTGAA
CAGGCCATCA ATGAACACGT TTTATCGATC CGAAATCATA TTAAGGGCTT GAGTCAAAAG
GATTATCAGC GTTTACACGG GCTAAAAAGC TTAGATTATG TGCTGATGTT TATCCCGATT
GAACCCGCCT TCTTGCTGGC CCTAGAGCAT GACCCAAGCC TAGTTAACTT TGCCCTTGAG
CAAAATATTA TGCTGGTCAG TCCAACCAAC CTCTTGGTTG CCCTGCGAAC AATCAATAAT
ATCTGGCGTT ACGAGTATCA AAACCAGCAC GCCCAAACCA TTGCCAAACA GGCGGGTCGC
ATCTACGACA AACTCTGTGG CTACCTCGAC GATATGGAAA AACTCGGCCG TGCTCTGGAT
AACGCCGAAA AAACCTATCA CAGTGCCATG AACAAATTGT CATCGGGCAA AGGCAATTTA
GTGCGTCAAG CGCATTTAAT GCAGCAACTA GGTGTTGATA CCAGCAAACA ACTCGATAAG
ATGTTACTTG AGAAGGCGCT CAATGAAGCC TTAGACGAGG GTGATGCCCA GGACAGCAGT
GATGATGATA CAAATCGCGA TACGCTTTTG ACCCATACTG AGGATGCCAC CGCACTCGAA
CAATAA
 
Protein sequence
MPFSQSFSLA QIIGLLAVAF LGLLIGALLN QRLTRSRWQQ FKDELEQEMR QVNEDAELSL 
AQQQILVDDK DSQLRQCQQR LEQKIEQLGK AEAMAERLPS LEQQLADSHR RQLELQLALS
KSNAMQQTIQ AKADAQQSAM QEKIATLEMA EVRLQTQFEN LANRIFEERS ESFKHQNANQ
LEGVLGPLKQ QLEGFRQQIR ESYNHEQSER SALKHQLEHL RELNLKMSQD AINLTKALKG
DNKQQGNWGE VILDRVLQES GLREGHEYHT QQDLKDDSGK RFKPDVIVHL PENKDVVIDA
KMSLISYERY FNSEDPLVRE QAINEHVLSI RNHIKGLSQK DYQRLHGLKS LDYVLMFIPI
EPAFLLALEH DPSLVNFALE QNIMLVSPTN LLVALRTINN IWRYEYQNQH AQTIAKQAGR
IYDKLCGYLD DMEKLGRALD NAEKTYHSAM NKLSSGKGNL VRQAHLMQQL GVDTSKQLDK
MLLEKALNEA LDEGDAQDSS DDDTNRDTLL THTEDATALE Q