Gene Shewmr4_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0100 
Symbol 
ID4250979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp110959 
End bp112500 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content54% 
IMG OID638116642 
Producthistidine ammonia-lyase 
Protein accessionYP_732238 
Protein GI113968445 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAG TCAATCATTT AGTATTAACG CCCGGCAGTT TAAGTCTGGC GCAATTGCGT 
GAAATCAGCC GCCATAAGCT GACACTCGAA CTGGCGCCAG AGGCGATAAA CGATATCAAC
ACCAGCGCGC AAATCGTGCA AAAGGTGTTG GATGAAGGTC GCACCGTTTA CGGCATCAAC
ACGGGTTTTG GTCTGCTGGC CAACACTAAG ATTGCCCCGG AAGATCTGCA ATTACTGCAA
CGCTCTATCG TGTTATCCCA CGCTGCGGGC ACGGGCCAAT ACATGCAGGA CGCGACCGTG
CGCCTGATGA TGGTGTTAAA GATCAACTCC TTAAGCCGTG GCTTCTCGGG TATCCGTTTA
GAAGTGATTA ATTTCCTTAT CAGCCTAGTG AACGCCGAGG TTTATCCTTG TGTGCCTGAA
AAAGGTTCTG TGGGCGCCTC TGGCGACTTA GCGCCGTTAG CCCATATGTG TTTGCCGCTG
TTGGGTGAAG GCGAGATGAG CTATCAAGGT CAGATTATTT CGGCCGCCGA AGGCTTAGAA
ATCGCCGGCC TCAAGCCTAT CGATTTAGCC GCGAAGGAAG GCTTAGCCCT GCTCAACGGT
ACTCAGGCTT CTACTGCTCT GGCGTTGGAA GGTCTGTTCC ACGCTGAAGA CTTGTTTGCT
GCAAGCTCAG TGATTGGCGC CATGAGCGTC GAGGCAGCCA TGGGTAGTCG CAGTCCGTTT
GACCCACGCA TCCATGCGGC TCGTGGTCAG AAAGGACAAA TCGATGCGGC CATGGTGTTC
CGTCATCTGT TGGGCGAAGA GTCTGAAATC AGCTTAAGCC ACATCAACTG CGAGAAGGTG
CAAGATCCTT ACTCACTGCG CTGCCAACCA CAGGTATTAG GTGCGTGCTT GACCCAAATC
CGCCAAGCGG CCGAGGTGTT AGGCACAGAA GCCAACGGTG TGACCGATAA CCCGCTGGTA
TTTCAAGATA CTGGCGATAT TATCTCCGGT GGTAACTTCC ACGCCGAGCC CGTTGCTATG
GCAGCCGATA ATTTGGCGAT TGCGATTGCC GAATTAGGCG CGATTGCAGA GCGTCGTATC
GCGCTGCTTA TCGACTCTAG CCTATCTAAA CTGCCACCTT TCCTGGTTAA AAATGGCGGG
GTGAACTCGG GCTTTATGAT CGCCCAAGTG ACGGCGGCGG CATTGGCCTC TGAAAACAAA
ACCTACGCCC ATCCAGCATC GGTCGACAGT TTACCGACCT CGGCCAACCA AGAAGACCAT
GTGTCTATGG CGACCTTTGC GGCGCGCCGT TTACGGGATA TGAGCGAAAA CACCCGTGGC
GTGTTAGCTG TTGAGTTATT GGCGGCCGCC CAAGGCTTGG ATTTCCGCGC GCCATTAATG
CCAAGCAAAG CAGTGGCGCA GGCGAAGGCC GAGCTACGCG AAGTGGTTGC CTACTATGAT
AAAGACAGAT ACTTTGCGCC GGATATCGAT GCGGCAACGG ATCTGCTTTA TACCGCCAGC
TTCAATGCTT ACTTGCCCCA AGGCGTATTG CCGAGTCTGT AA
 
Protein sequence
MKSVNHLVLT PGSLSLAQLR EISRHKLTLE LAPEAINDIN TSAQIVQKVL DEGRTVYGIN 
TGFGLLANTK IAPEDLQLLQ RSIVLSHAAG TGQYMQDATV RLMMVLKINS LSRGFSGIRL
EVINFLISLV NAEVYPCVPE KGSVGASGDL APLAHMCLPL LGEGEMSYQG QIISAAEGLE
IAGLKPIDLA AKEGLALLNG TQASTALALE GLFHAEDLFA ASSVIGAMSV EAAMGSRSPF
DPRIHAARGQ KGQIDAAMVF RHLLGEESEI SLSHINCEKV QDPYSLRCQP QVLGACLTQI
RQAAEVLGTE ANGVTDNPLV FQDTGDIISG GNFHAEPVAM AADNLAIAIA ELGAIAERRI
ALLIDSSLSK LPPFLVKNGG VNSGFMIAQV TAAALASENK TYAHPASVDS LPTSANQEDH
VSMATFAARR LRDMSENTRG VLAVELLAAA QGLDFRAPLM PSKAVAQAKA ELREVVAYYD
KDRYFAPDID AATDLLYTAS FNAYLPQGVL PSL