Gene Shewmr4_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1990 
Symbol 
ID4252563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2366420 
End bp2367556 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content51% 
IMG OID638118603 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_734120 
Protein GI113970327 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00291978 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGCAA AACACAACAT GGATAATCGT CAACCTAACC TGTCGGTCTT TCGACGCTTG 
CCTTTGACCT GCGGCTTACT CGCCGCATTG GCGGTGCCAT TAGCACAAGC CGTGCCTGTG
CTTGAGATTA GCCGCGCACA ATCCCACAGC GCCACGGTGC AAACCCTCGA TGCGAATGCG
CCCTTTATCG AGAGAAGGGC GGATCCTTGG GTCATCCGCG ATGATGACGG CAGTTACTAC
TTTATTGCCT CGGTGCCTGA GTTTGACCGC ATCGAACTTC GCCACGCCAA AACCATCGAC
GGTTTACGCC AAGCAACACC TAAAACCCTG TGGCACAAGC ATGAAAATGG CCCCATGAGT
ATCGATATTT GGGCACCTGA GCTGCACAAA ATCGATGGTC GCTGGTATAT CTATTTTGCG
GCCAGCAATA AGGATGTGCG TTTTCATAAC CGCATGTTTG TCTTAGGACT TGAAGGCGAC
TCACCGATGA CAGGCCAATG GCAAGAACTT GGTAAGTTAC AATCGGCGCA GGATGCTTTC
TCCCTCGATG CGACCAGCTT TAGCCTCAAG GGCGAGCGAT ATTTTATTTG GGCGCAGCAG
GACAAAGCCA AGCGTTACAA CACCGGCTTA GTGATTGCCA AAATGCTATC GCCAACTCAA
CTCTCTGATA ACGAAACCAT TATCAGCGAG CCCTTATTGG ATTGGGAACG TTTGGGCTTT
AAAGTCAACG AGGGCGCCGC TGTGTTGGTT AAAAACGGTA AAGTCTTCGT GACCTATTCC
GCCAGTGCCA CGGATGACCG TTATGCCATG GGGCTGTTAT GGGCCGATGA AAATGCCGAT
TTACTCGATC CCAAGAGTTG GCATAAATCG CCAAGCCCAG TCTTTACCAC TGAGCCTAGC
TTGAATCGTT TTGGTCCTGG CCATAATAGT TTTGTGCTGG CAGAGGATGG TAAAACCGAG
CTGATGTTTT ACCATGCACG TAATTACCTT GAGCTGCAGG GGACGCCATT GACCGACGGT
AATAGACACA CTTACTACCG CGCCATTCGC TGGTCGGCGG ACGGTTTCCC AGTATTTGAT
AATCCGCAGA GCGATAGCCA AACCTTGAAT CAGCAAGACA CGCAATCCCG CGAATAA
 
Protein sequence
MDAKHNMDNR QPNLSVFRRL PLTCGLLAAL AVPLAQAVPV LEISRAQSHS ATVQTLDANA 
PFIERRADPW VIRDDDGSYY FIASVPEFDR IELRHAKTID GLRQATPKTL WHKHENGPMS
IDIWAPELHK IDGRWYIYFA ASNKDVRFHN RMFVLGLEGD SPMTGQWQEL GKLQSAQDAF
SLDATSFSLK GERYFIWAQQ DKAKRYNTGL VIAKMLSPTQ LSDNETIISE PLLDWERLGF
KVNEGAAVLV KNGKVFVTYS ASATDDRYAM GLLWADENAD LLDPKSWHKS PSPVFTTEPS
LNRFGPGHNS FVLAEDGKTE LMFYHARNYL ELQGTPLTDG NRHTYYRAIR WSADGFPVFD
NPQSDSQTLN QQDTQSRE