Gene Shewmr4_3027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3027 
Symbol 
ID4253598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3619931 
End bp3621406 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content51% 
IMG OID638119669 
ProductN-acetylglucosamine-binding protein A 
Protein accessionYP_735155 
Protein GI113971362 
COG category[S] Function unknown 
COG ID[COG3397] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.483728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACATC TAACTGCTTT TACATCAATC ACAAATGCTA ACCACAAACA CCCACAACTG 
GCTCGTTTAA GCCTAATTGC CTTGGCCCTA AGCGCTACGA GTGCGCTAGT TAGCCAAACG
GCGTCGGCCC ACGGTTATGT GGTTTCACCA GAATCACGCT CATACGCCTG TAAAACTGGC
AGCAATTTAA ACTGCGGCGC CGTTCAATGG GAACCGCAAA GTGTTGAAGG TCCATCAGGT
TTTCCTGAGT CAGGCCCTGC CGACGGCAAA ATTGCCAGCG CAGCCAACGC AGCGTTTTCT
CCCTTGGATG AACAGAGCCC AAGTCGTTGG TCTAAGCATG ACATTAAGTC GGGTTGGAAT
GACTTTAGCT GGCAGTTCAC CGCTAACCAT GTGACCCGCA ATTGGCGTTA CTATTTAACT
CGTCAAGGCT GGGATCAAAA CCAAGCCTTG AGCCGTGCAA GCTTTGACTT AGCTCCCTTC
TGTGTGGTCG ACGGAGGTAT GGTTCAGCCG CCTAAGTTAG TGACACATAA CTGTTATGTG
CCTGAAGACA GAAGCGGTTA TCACGTGATT TTGGCCGTGT GGGAAGTCGG TGATACCACC
AACAGTTTCT ATAATGCTAT CGATGTGAAC TTTAGCTCTG GTGTTGTGGT GCCGGGCGAG
TGGACCGATA TTGGCGATAT CAATCCGTCA CTTGATCTTA AGGCGGGTGA TAAGGTGATG
ACGCGGGTGT TTGATGCTAA TGGCGAGCAA ACTGCCAAGC AGACTCAGAT AACCATTGCC
GACACTACTC AAGGTGCCAA GCAAAATTGG CCATTCCTGT TAGCCAGTGC CATTAATGCC
CAGCAGCCAC AACTTAAGGC GGGGCAGAAG AATCCCTCTG GGGTGATCTC GCCCGTTTAC
GGTAAAAATG AGATTTATGC CGCGCCTAAT TCGGGCCTAG AGCGAGTGGA AGTGAGCTTT
GATATTGCGC CTGCGCCGGG CAATCAGCTC GATGTCACGT CACTGGCCGA TGATTACACT
ATTGTCGATG GTGCCGCAAA GGTCAGCTTC GATGTCAGCA CTAATGCGGA TATGCAGGTC
TCGGCTTACC TATTTAGCCA CGATGGCACG GCAGCTGGAT ATGTCACACA AGTGGTTAAT
AATACTAGCG CGAGTCTAGT GCTTGATGTC GTCGCGCCTA AGGCTGGCCA TTATCACTTA
CAAGTGAAGG GCGAGCCGAA GCAAGGTGAG GTTATCCAGC AAAACTTCGA TCTGTTCTTA
AAAGATCAAG CCACAGCGCC GGATGCCGAT TATGTCTTCC CCGAGGGCAT TAAAAACTAT
GTGGCGGGTA CTAAAGTGCT GCAACCTAAA ACTGGCAAGG TCTATCAATG TAAACCTTGG
CCTTACAGTG GTTATTGCAT GCAATGGTCG CCAACTGCAA CCGGGTTTGA ACCGGGTGTC
GGCGGCTCTT GGAATATGGC TTGGACTGAG CTGTAA
 
Protein sequence
MAHLTAFTSI TNANHKHPQL ARLSLIALAL SATSALVSQT ASAHGYVVSP ESRSYACKTG 
SNLNCGAVQW EPQSVEGPSG FPESGPADGK IASAANAAFS PLDEQSPSRW SKHDIKSGWN
DFSWQFTANH VTRNWRYYLT RQGWDQNQAL SRASFDLAPF CVVDGGMVQP PKLVTHNCYV
PEDRSGYHVI LAVWEVGDTT NSFYNAIDVN FSSGVVVPGE WTDIGDINPS LDLKAGDKVM
TRVFDANGEQ TAKQTQITIA DTTQGAKQNW PFLLASAINA QQPQLKAGQK NPSGVISPVY
GKNEIYAAPN SGLERVEVSF DIAPAPGNQL DVTSLADDYT IVDGAAKVSF DVSTNADMQV
SAYLFSHDGT AAGYVTQVVN NTSASLVLDV VAPKAGHYHL QVKGEPKQGE VIQQNFDLFL
KDQATAPDAD YVFPEGIKNY VAGTKVLQPK TGKVYQCKPW PYSGYCMQWS PTATGFEPGV
GGSWNMAWTE L