Gene Shewmr4_1927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1927 
Symbol 
ID4252501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2297290 
End bp2298450 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content46% 
IMG OID638118538 
Producttetratricopeptide repeat protein 
Protein accessionYP_734058 
Protein GI113970265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000214151 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000455272 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTTGAGA TCCTCTTCCT GTTGCTCCCT ATTGCTGCCG GTTACGGCTG GTATATGGGG 
CGGCGGAGCA TAAGGCAAAA CCAGAGTAAT CAGCGTAAAC AATTAAGTCG TGATTATTTC
ACCGGCTTAA ATTTCCTGTT GTCGAACGAG TCAGACAAAG CGGTCGACTT GTTTATCAGT
ATGCTCGATG TGGATGATGA AACTATCGAT ACCCATCTTT CCCTCGGTTC GTTATTTCGC
AAACGCGGTG AAGTTGACCG TTCCATTCGT ATCCATCAGA ACTTAATTGC ACGACCAACG
CTCACCAATG AGCAGCGCGA TATGGCTATG ATGGAACTGG GTAAAGATTA CCTTGCCGCA
GGGTTTTACG ACCGCGCTGA GGAAATTTTC TTGAATTTAG TGAGTCAAGA TGATCACAGT
GAAGAGTCTG AAACTCAGCT GATTGCCATT TATCAGGTGA TTAAGGAATG GCAAAAAGCC
ATCGACATCA CCAAACGCTT AAGCCGTAAG CGTCAGCAGG CACTCAAACC GATTATTGCC
CATTTTTATT GCCAGCTTGC GGATGAATCC GGGGATGATG CGGATAAAAT AAAGCTGCTG
CAACAGGCAT TAAAACAAGA TCCTAAGTGC GGCCGAGCCT TGCTCACGCT CGCCAAAAAA
TTTCTCGATG CTAAGGATTA CACCCAGTGT AAATCCATGC TGATGGCACT GAAAAAAGCC
GATATAGAAC TCTTTGCCGA TGCTTTGCCC ACCGCGAAAC AAGTGTATCG CGATACCCAA
GATAAAGAGG GTTATCAAGA ATTATTAGCG GGTGCTATGG CCGAAGGGGC GGGAGCCTCT
GTGGTGGTAG CGCTTGCTCA ACATATGATT AGTCTCGATG AGATAAAGGC CGCTGAAAAC
ATGGTATTGG ATGCCCTGTA TCGCCATCCC ACCATGAAGG GATTTCAGCA CTTAATGCAG
ATGCACCTGC GTCAAGCCGA AGAAGGGCAA GCCAAACAAA GTTTGACTAT GCTTGAGCAA
CTCGTTGAAC AACAAATTAA ATTCCGTCCA AGTTATCGCT GTAAAGAATG CGGTTTTCCA
TCCCACACAC TTTATTGGCA TTGTCCCTCC TGTAAAAAAT GGGGCACCAT TAAACGGATC
CGTGGTTTAG ACGGTGAATA A
 
Protein sequence
MLEILFLLLP IAAGYGWYMG RRSIRQNQSN QRKQLSRDYF TGLNFLLSNE SDKAVDLFIS 
MLDVDDETID THLSLGSLFR KRGEVDRSIR IHQNLIARPT LTNEQRDMAM MELGKDYLAA
GFYDRAEEIF LNLVSQDDHS EESETQLIAI YQVIKEWQKA IDITKRLSRK RQQALKPIIA
HFYCQLADES GDDADKIKLL QQALKQDPKC GRALLTLAKK FLDAKDYTQC KSMLMALKKA
DIELFADALP TAKQVYRDTQ DKEGYQELLA GAMAEGAGAS VVVALAQHMI SLDEIKAAEN
MVLDALYRHP TMKGFQHLMQ MHLRQAEEGQ AKQSLTMLEQ LVEQQIKFRP SYRCKECGFP
SHTLYWHCPS CKKWGTIKRI RGLDGE