Gene SO_3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3791 
Symbol 
ID1171435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp3942266 
End bp3943456 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content50% 
IMG OID637345561 
Productrenal dipeptidase family protein 
Protein accessionNP_719328 
Protein GI24375285 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCTA TCAATCACCG TCGCCGAACT CTGCTCAAAG GACTCAGTGC AGTTACTGGA 
TTGAGCGCCG CCAGTGCATT AAGCCCTTTT GCCAGTTTCT CTAGTCTTGC CGCGCCGCTC
CCGCCACGTG CCCGGCGCTT ATATATCGAT GGATTATCCT TTTTGCCCGA TGATTTAGCC
GATGTTCGCG CCTCAGGTCT TGATGCCTTT TTGTGCGATA TATCAGCCAT TGAGACCATT
GAACAAGCCG ATGGCACTGT AAACTACAAG CGCACCTACA AAGCCTGCAT GGAAAGCATC
CAACAAGCCG CCAAACGGGT GAGTGAGCAT CCGGATATCC TTTTACAAGG CTTAACAGGA
CGCGATATAC AGCTTGCAAG GGAGAACAAT CGCACTGCGG TTTTCTTTCA AATTCAAGGT
GCTGATTGCG TTGAAGAAGA TAGCGATGCT AACCAATGGG CGCGCGTTGA TGCGTTTCAT
CGCCAAGGCC TATGCGCACT GCAGCTCACC CACCATTATG GCAATACCTT TGCGGGCGGC
GCGCTAGATA ACGATGCCAA TGGCGGGCTC AATAAACCGC TGACTAATCA TGGTCGAGCG
CTTATCGCAA AACTCAACCA AGCCAATATA TTAATTGATG TTAGCCACTC AAGCGCCCAG
ACCGCTTTAG ATGTGGCGAA ACTGAGCCGC GCACCCATAG TTCAAAGCCA TGGCGCGGCA
CGCGGGATCG TCAAGCATGC CCGTTGTAGC CCAGATGAAG TGATCCGTGC CATTGCTAAT
TCAGGCGGTG TTTTTGGTGT CTTTATGATG AGCTTTTGGC TCACCAATAA GGCCATCCCA
ACGGTCAATG ACTATATTCG CCAGTTAGAA TATGTTGCAC GCATTGGCGG GGTCGATTGC
GTTGCCATCG CCAACGATTT TCCGCTGCGA GGCCAAGAGA ACTTATTAGC TCTCAATAAC
GACAACACTC AAGGTGTTAA GGAATATCAG GATTGGTGGT ACAGCCTAAG GGCCAAAAAA
GTATTAGGCT TTGATGCCGA GCCAAGGCAT GTGGTGATCC CAGAGTTAAA CCATATAGAA
CGCATGAGTC GAATTGACGA CGCCTTAGCC AAGGCTCGAT TTAAATCGAC CGACCGTGAT
CGCTTTATGG GTGGAAACTG GCACCGAGTG TTAAATCAAG TATTAATCTA A
 
Protein sequence
MKAINHRRRT LLKGLSAVTG LSAASALSPF ASFSSLAAPL PPRARRLYID GLSFLPDDLA 
DVRASGLDAF LCDISAIETI EQADGTVNYK RTYKACMESI QQAAKRVSEH PDILLQGLTG
RDIQLARENN RTAVFFQIQG ADCVEEDSDA NQWARVDAFH RQGLCALQLT HHYGNTFAGG
ALDNDANGGL NKPLTNHGRA LIAKLNQANI LIDVSHSSAQ TALDVAKLSR APIVQSHGAA
RGIVKHARCS PDEVIRAIAN SGGVFGVFMM SFWLTNKAIP TVNDYIRQLE YVARIGGVDC
VAIANDFPLR GQENLLALNN DNTQGVKEYQ DWWYSLRAKK VLGFDAEPRH VVIPELNHIE
RMSRIDDALA KARFKSTDRD RFMGGNWHRV LNQVLI