Gene Shew_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_2154 
Symbol 
ID4923326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp2498652 
End bp2499812 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID640163739 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001094279 
Protein GI127513082 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.634063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT ATGTAAGACA GGGGGAGATC CCCCATAAAC GCCACATCAC CTTCAAGAAA 
GAGAATGGTG AACTCTACCG GGAAGAACTC TTCTCCACCC ATGGCTTCTC CAATATCTAC
TCCAACAAGT ATCACCACAA CATGCCGACC AAGGCATTAG AGGTGAGCCC GTTCGAGGTC
AATCACGGCC AAACCTGGCA AGACACCTTG ATTCAGAACT ATAAGTTGGA CGCTAAGCTG
GCCGACCGCG AGGGTAACTT CTACTCGGCG CGCAACAAGA TCTTCTTTAA TAACGACGTG
GCCATGTACA GCGCCAAGGT CACCGAGGCG ACCGAAGAGT TTTACCGCAA CGCCTACGCC
GACGAGGTGA TCTTCGTCCA CGAGGGCCAG GGCAAGCTCT ACAGCGAATA TGGTGTGCTC
GATGTGAAAA AGTGGGACTA CCTGGTTATC CCGCGCGGCA CCACCTACCA GCTTAAGTTT
GACGATTACA GCCAGGTTCG CCTGTTTGTT ATCGAAGCCT TCTCCATGGT GGAGGTGCCT
AAGCATTTCA GAAACGAGTA CGGACAGCTT TTGGAGTCCG CTCCCTATTG TGAGCGCGAC
ATCCGAGTTC CTAGCCTGCA AGAAGCCGTG GTCGAGAAAG GCGCCTTCCC CTTGGTCTGT
AAATTTGGTG ACAAGTATCA GCTGACACAG CTTGAGTGGC ATCCCTTCGA TCTCGTGGGC
TGGGACGGCT GTGTCTACCC TTGGGCTTTC AACATTCAGG ATTACGCCCC TAAGGTAGGT
CAGATCCACC TGCCGCCGTC GGATCATCTG GTGTTTACCG CCCACAACTT CGTTATCTGT
AACTTTGTGC CGCGCCCGTA CGACTTCCAT CCCCAGTCTA TCCCGGCGCC TTACTACCAC
AACAATATCG ACAGCGACGA AGTGCTCTAC TATGTCGACG GCGACTTCAT GAGCCGCACC
GGCATCGAGG CCGGCTACAT GACCCTACAT CAGAAAGGGG TACCACACGG ACCACAACCT
GGCCGCACCG AGGCCTCGGT CGGCAAGAAA GAGACCTATG AATACGCAGT GATGGTCGAT
ACCTTCGCCC CACTGCAGCT AACCCAACAT GTACAAGGTT GCATGAGTAA AGACTACAAC
CGCTCTTGGT TAGAAGACTA A
 
Protein sequence
MPFYVRQGEI PHKRHITFKK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVSPFEV 
NHGQTWQDTL IQNYKLDAKL ADREGNFYSA RNKIFFNNDV AMYSAKVTEA TEEFYRNAYA
DEVIFVHEGQ GKLYSEYGVL DVKKWDYLVI PRGTTYQLKF DDYSQVRLFV IEAFSMVEVP
KHFRNEYGQL LESAPYCERD IRVPSLQEAV VEKGAFPLVC KFGDKYQLTQ LEWHPFDLVG
WDGCVYPWAF NIQDYAPKVG QIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PQSIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVPHGPQP GRTEASVGKK ETYEYAVMVD
TFAPLQLTQH VQGCMSKDYN RSWLED