Gene Saro_3261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3261 
Symbol 
ID3917519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3482170 
End bp3483354 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content66% 
IMG OID640446045 
Productpeptidase M19, renal dipeptidase 
Protein accessionYP_498530 
Protein GI87201273 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.35092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCA AGATTGGCTG GGCAGTGATC TTCATTGTCC TGTTTGCCGC CGCCTTCGTG 
CTCGGCCCGC TGCCGGCGAT GGTCGAGAAG CGGATGAACG TGATCGACGG CCAGCCGCTG
CTGACCGTCA GCGAAAGGGC CAAGGCGCTT CACCGCACGT TGACGATCGT CGACCTTCAT
GCCGACACGC TGATGTGGCG CCGCAACCTT ACCGACCGCG CTGCCCAAGG CCATGTCGAC
CTGCCGCGCC TGATCGACGG GCACGTTGCG CTCCAGGTCC TGTCGTCGGT CACAAAATCG
CCCAAGGGAC TGAATTACGA CGCGAATCCT TCGAACAGCG ATACCATCAC CGCGCTTGCC
GTCACCCAGA TGCAGCCGGT ACGCACATGG AATTCGCTAC TCGAGCGGTC GCTCTGGCAT
GCCGAAAAGC TCGACCGCGC GGTGGCCGGC TCCAGCGGCG AACTGGTCAA GGTCACTGGC
CAGGCTTCGC TCGACGATCT GCTGCGCGAA CGCGGCGAGG GCGCGCTGCC GGTCGGTGCG
ATGCTTTCGA TCGAGGGACT CCACGATCTC GAGGGCAAGC GCGAGAACCT CGACCGGCTC
TACGACGCGG GCTTCCGCAT GGCGAGCCTG ACACACTTCT TCGACAACCA GCTCGCCGGG
TCGATGCACG GCGAACGGAA GGGCGGCCTA ACTCCGTTCG GGCGGCAGAT CGTGCGCGCG
ATGGAAGACA AGGGCATGAT CGTCGACATC GCCCACCTGT CGCATCCCGG CGTTGCCGAG
CTGCTTGCCA TGGCCCGCCG CCCGGTCGTC TCCAGCCACG GCGGCGTCCA GGCCACCTGC
AAGGTCAACC GCAACCTCAC CGACGCAGAG ATTCGCGGCG TCGCCCGCAC GGGCGGGGTG
ATCGGCATCG GCTACTGGGA TGCCGCCATC TGCGACACAT CGCCCCGCGC CGCCGCGCGC
GCCATGCGCC ATGTGCGCGA CCTTGTCGGC ATCCAGCATG TCGCGCTGGG CAGCGACTTC
GACGGCGCCA CCACCACCCG CTTCGATACC TCGCAGCTCG AACAGGTGAC CCAGGCCCTG
CTTGACGAAG GCTTCAGCGA CGACGAAATA CGCGCCGTGA TGGGGCTCAA CGCACTTCGG
GTGATCCGCG CCGGGATCGT TCCGCTGGGA GGCGGCGCAC GGTGA
 
Protein sequence
MRRKIGWAVI FIVLFAAAFV LGPLPAMVEK RMNVIDGQPL LTVSERAKAL HRTLTIVDLH 
ADTLMWRRNL TDRAAQGHVD LPRLIDGHVA LQVLSSVTKS PKGLNYDANP SNSDTITALA
VTQMQPVRTW NSLLERSLWH AEKLDRAVAG SSGELVKVTG QASLDDLLRE RGEGALPVGA
MLSIEGLHDL EGKRENLDRL YDAGFRMASL THFFDNQLAG SMHGERKGGL TPFGRQIVRA
MEDKGMIVDI AHLSHPGVAE LLAMARRPVV SSHGGVQATC KVNRNLTDAE IRGVARTGGV
IGIGYWDAAI CDTSPRAAAR AMRHVRDLVG IQHVALGSDF DGATTTRFDT SQLEQVTQAL
LDEGFSDDEI RAVMGLNALR VIRAGIVPLG GGAR