Gene Saro_2785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2785 
Symbol 
ID3916945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3005420 
End bp3006472 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content68% 
IMG OID640445564 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_498055 
Protein GI87200798 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTGA TCCTTGGCAT AGAATCCAGC TGCGATGAAA CCGCCGCCGC GGTGATCGAC 
AGCAATGGCG CGTCGCTCGA AGCGCGCATC GTGGCGCAGC GCATCGCCTC GCAGGACGAA
GCGCATCGAC CCTATGGCGG CGTCGTTCCG GAAATCGCGG CGCGCGCGCA CGCCGAAGTG
CTCAGTCCGA TGATCGCGGC GGTGCTGGCC GACGCGGGGA TCGGGCTGGA CGACCTCGAT
GCCATCGCCG CGACCGCGGG GCCGGGGTTG ATCGGCGGCG TCATGGTCGG TCTCGTTACC
GGAAAGGCGC TGGCCATGGC GGCGGACAAG CCGCTGATCG CGGTCAACCA CCTGGAAGGC
CACGCGCTTT CGCCGCGACT GGCCGAACCC TCGTTGCAAT ACCCCTACCT GTTGCTGCTG
GTTTCGGGCG GACATTGCCA GATCCTGGAA GTTGCGGGGG TCGGGCAGTT CCGCCGCCTT
GCCACCACCA TCGACGATGC CTTGGGCGAA GCGTTCGACA AGACCGCGAA GATCCTCGGC
CTCGGCTATC CAGGTGGGCC GGCGGTGGAA CGGATGGCGC GAGAGGGCAA CCCCAAGGCC
GTGCCCCTGC CGCGCCCGCT GGTCGGCAGC GGGGAGCCGC ACTTCTCGTT CGCGGGCCTG
AAGAGCGCGG TCATGCGGGC GAAGGATGCC GGTGTTCACG GGGACGCCGA CATCGCCGCT
TCATTCCAGC AGGCGGCGAT AGATTGCGTG ATCGATCGCA CCCGCATCGC GCTTGAGACT
GCTTCTCCGG GCATGACGGC GCTGGTGGTG GCTGGAGGCG TCGCCGCCAA TGCCGCCTTG
CGTGGCGCGC TGGAAGGACT GGCGGAGAGC CACGGGCTTT CGCTGGTCGC GCCGCCGCCG
AAACTGTGCA CAGACAACGC TGCGATGATT GGTTGGGCTG GCGCGGAACG CCTTGCTCTG
GGATATGTCG ATCCACTTGA CGTGGCGGCG CGTCCGCGCT GGCCGCTCGA CGAGAACGCC
GCGCCGGTGC GCGGAGCAGG GGTAAAGGCA TGA
 
Protein sequence
MALILGIESS CDETAAAVID SNGASLEARI VAQRIASQDE AHRPYGGVVP EIAARAHAEV 
LSPMIAAVLA DAGIGLDDLD AIAATAGPGL IGGVMVGLVT GKALAMAADK PLIAVNHLEG
HALSPRLAEP SLQYPYLLLL VSGGHCQILE VAGVGQFRRL ATTIDDALGE AFDKTAKILG
LGYPGGPAVE RMAREGNPKA VPLPRPLVGS GEPHFSFAGL KSAVMRAKDA GVHGDADIAA
SFQQAAIDCV IDRTRIALET ASPGMTALVV AGGVAANAAL RGALEGLAES HGLSLVAPPP
KLCTDNAAMI GWAGAERLAL GYVDPLDVAA RPRWPLDENA APVRGAGVKA