Gene Shewmr4_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1798 
Symbol 
ID4252372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2133719 
End bp2134909 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content53% 
IMG OID638118409 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_733929 
Protein GI113970136 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAG TGCCAACTAG CCAAGCACCA ACGAGCAATG TGCCTGTGAC AAATATTCCG 
AGCGCCAATT GCGCCAGCAA TACTCTCGAC AACAATACTC TCGATAACAC AAGCATCGAG
CCAACAACCC TTGCCGCCCG TCTTGCGCGG CCCGAGCTGC TCGAGTTAAC GCCTTACCAA
AGTGCTCGCA GGCTGGGTGG TCGTGGGGAT ATTTGGATCA ACGCTAACGA ATCGCCCTTC
AATAATGTGG CCGTTGCCGA ACTCGATTTA TCTAAGTTAA ATCGTTACCC CGAGTGCCAA
CCGCCCGCGT TAATCAATGC CTATAGCCAA TATAGCGGTG TTGCGGAGAG CAAAATTGTC
GCCAGCCGCG GCGCCGATGA GGCCATCGAG CTACTTATTC GTGCTTTTTG TATCCCAGGT
ATCGACTCAA TTGCCACCTT TGGGCCCACT TACGGCATGT ACGCCATTAG CGCGCAAACC
TTTAATGTGG GCGTTAAGGC ATTAAGCTTA ACGGCGGAGT ACTGTCTCCC AAGTGACTTT
GCGACGGCCG CGCGCGGCGC TAAGTTAGTG TTTATCTGTA ATCCCAATAA CCCAACTGGC
ACTGTGATTG AGAAGGCGCG CATAGAGCAA GCCATCCAAG CCCTGCCCGA CGCCATTGTT
GTTGTCGATG AGGCTTATAT CGAGTTTTGC CCCGAATATA GCGTCGCCGA TTTACTCGAG
TCTTACCCAA ACCTTGTGGT GCTACGCACT CTTTCAAAGG CCTTTGCCTT AGCGGGCGCG
CGCTGCGGCT TTTTGCTCGC CAATGAAGAG ATTATCGAAA TCATCATGCG GGTGATTGCG
CCCTATCCTG TGCCATTGCC CGTGAGTGAA GTGGCCGTGC AAGCACTATC AGCTGCTGGG
ATTGCGCGGA TGAAAACCCA AGTCAAAGCG CTCAATGCTC AGGGCGAGCG ACTCGCGGCG
GCGCTGAATT TGTACTGCGA ACAATGGGGC GGCGCCGTGC TAACACCCAA TGGCAACTAT
GTACTCGCCG AATTCGACGA TGTGGCAAAA GTGGCACAGC TGCTTATCGA CAATGGCATT
GTCGCGCGGG CCTATAAGGA CCCTAGATTG GCTAAAGCCA TTCGTTTTAG CTTTAGCTCT
GAGGCTGACA CCGACCGCTT AGTGTCGCTA TTTGAATCGC AAAAGCTGTG A
 
Protein sequence
MSQVPTSQAP TSNVPVTNIP SANCASNTLD NNTLDNTSIE PTTLAARLAR PELLELTPYQ 
SARRLGGRGD IWINANESPF NNVAVAELDL SKLNRYPECQ PPALINAYSQ YSGVAESKIV
ASRGADEAIE LLIRAFCIPG IDSIATFGPT YGMYAISAQT FNVGVKALSL TAEYCLPSDF
ATAARGAKLV FICNPNNPTG TVIEKARIEQ AIQALPDAIV VVDEAYIEFC PEYSVADLLE
SYPNLVVLRT LSKAFALAGA RCGFLLANEE IIEIIMRVIA PYPVPLPVSE VAVQALSAAG
IARMKTQVKA LNAQGERLAA ALNLYCEQWG GAVLTPNGNY VLAEFDDVAK VAQLLIDNGI
VARAYKDPRL AKAIRFSFSS EADTDRLVSL FESQKL