Gene Shewmr4_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1777 
Symbol 
ID4252351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2110902 
End bp2111987 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content52% 
IMG OID638118388 
Producthypothetical protein 
Protein accessionYP_733908 
Protein GI113970115 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAG TATTAACGGC TTTCTGGCAA CGCTGGTTGG CAAGGCGCAT GCCGCCCAGT 
TCTGAGTTTG TCCTCAGCCA TCGCAGCATC TTTATTCTGC CCAGTGGCTT TGGTTTAGTC
TGGCTTGGGC TGGTACTGCT GCTGTATCTG TTTGGCACTA ACTATCAAAA CAATCTTGTA
ATTGGCCTGA GCATTTTGCT CCTCAGTCTA TTTAATACCT GCATCCTTTA CAGTTATAAA
AATCTCGCGG GTTTGCGCCT TCGCGCGCTC ACGGCGCCAG AGGTGTATGC GGGCGAAACC
ATCACCTTTC CAGTCCTGCT CACCTCAAAT CACAGCAGCC ACAATATCAG CTTAAATTAC
CCCAATAATT TAGCGTACTT ACTCAAACAG GTTGGCGCCG ATGAGGTGCA AGCTCTGGTC
TCGTTTGCCC ATGACAGCCG TGGTCTGGTA TCGCCTGGCC GACTTAAGAT TGAATCCTTT
TATCCCTTAG GGCTTTGCCG CGCTTGGTCC CATATTGATC TCGATAATGC GCACATCGTC
TACGCCCACC CGATTGAAAG CCCTTTGCAG CTAAAGGCGG CGACCGAATC CGGTGAAGAC
GAACGGTTAG AACGAGCAGG AAAGTACATC GCCGGCATCG ATGAATACAA AGGGCTTAAG
CCCCATGTGC TCGGTGAATC TCTTAAACAA GTGGCATGGA AACAATGGGC TCAAGGGCGT
GGCATGCTAA CCAAGGAGTT CGAGCAGCCT CAGGGCGATC CCGTATGGTT AACCTTAGTC
CCCGATCCCG CGCAGCTTGA ACAGCAATTA GGTCAGCTCA GCTGGCAAGT CAACCATTTG
AGTCAGCAGG AGCAATATTT TGGTCTGTGG CTGCAGCGTG TTGGGGCGGA AGATCTGATC
CTCACACCCG ATATGGGCAA TGCCCATCGT ATCGCCTGCC AAAGGGCGCT CGCAATTTAC
GGGCAAGATA TCAGCACGCT GGATAAAACC GATAAATTGA TGAATAACCA AGGTGTTAAG
CATCAGCATC CGCTTCCTCC CCATGGGCAA GGCGTGCGTT CTGCAACAAT GGAGCCGCGG
CGATGA
 
Protein sequence
MKRVLTAFWQ RWLARRMPPS SEFVLSHRSI FILPSGFGLV WLGLVLLLYL FGTNYQNNLV 
IGLSILLLSL FNTCILYSYK NLAGLRLRAL TAPEVYAGET ITFPVLLTSN HSSHNISLNY
PNNLAYLLKQ VGADEVQALV SFAHDSRGLV SPGRLKIESF YPLGLCRAWS HIDLDNAHIV
YAHPIESPLQ LKAATESGED ERLERAGKYI AGIDEYKGLK PHVLGESLKQ VAWKQWAQGR
GMLTKEFEQP QGDPVWLTLV PDPAQLEQQL GQLSWQVNHL SQQEQYFGLW LQRVGAEDLI
LTPDMGNAHR IACQRALAIY GQDISTLDKT DKLMNNQGVK HQHPLPPHGQ GVRSATMEPR
R