Gene Saro_2515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2515 
Symbol 
ID3916836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2719619 
End bp2720899 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID640445272 
Productamidohydrolase 
Protein accessionYP_497785 
Protein GI87200528 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTGT TCGCCCGTGT TCTCTTGATT GCCTCGAGCC TTTGCGCCGC GCCGCTGGCC 
GCGCAGAAGC AGGAGGCGCT GACGCTCCTG CGACCCGACG CGGTATTCGA CGGCGAGACC
GCCGTCCTGC GCAAGGGTTG GGCCGTGCTG GTGCGGGGCA ACCGGATCGA GGCGGTGGGA
CCGGATGTCG GCGCACCTGC CGAGGCCTCG GTGCTGGAAC TGCCGGGAAC GACCCTGATG
CCCGGCATGA TCGAGGGGCA TTCGCACCTC TTTCTCCACC CCTACAACGA GACGCCGTGG
GACGACCAGG TCCTGCACGA ACCGCTCGCG TTGAGGACTG CGCGGGCGAC GGTTCATGCG
CGCGCGACGC TGATGGCAGG CTTTACCACC GTGCGTGATG TCGGCACCGA GGGCGCCGGC
TATGCCGACG TGGGGCTGAA GCAGGCCATC GAGCAAGGGA TCGTGCCGGG GCCGCGCATG
CTGGTGGCGA CGCGGGCCAT CGTGGCGCCC GGCGCCTACG GGCCGCGCGG GTTCGAGCCG
GGCGTGGCAG TACCGCTGGG GGCCGAGGAA GCCGGCGGGC CGGACCTTGT CGACGCGGTG
CGTCGGCAGA TCGGGGCGGG TGCGGATCTG GTGAAGGTCT ATGCCGACTA CCGCTGGGGA
CCGGGCGAGC CGAGCCGCCC GACCTTTACC GAAGGCGAGC TGAAAGCGGC GGTAGAGGCT
GCGCACAGCG CCGGGCGGCA GGTCGTCGCC CATGCCAGCA CGGCGGAAGG AATGCGCCGT
GCCGTGGCAG CGGGCGTCGA CACCATCGAG CATGGGGACG AAGGTACGCC GGAGGTCTTC
GCCGCGATGA AGGCCAGGGG CGTGGGCTTC TGCCCGACGC TGGCGGCCGG GGATGCGGTG
GCGCGCTATC GCGGGTGGAA CGGTACAGCG CCCATGCCGA AGAGCGTGCA GGAAGGGTTC
GATGCACTTG CAAAGGCGCG GAAGGCCGGG GTGGCGATTT GCATGGGCGG CGATGTTGGC
GTCTATGCGC ACGGCGACAA TGCGCGCGAA GCGGAAATGA TGGTCAAGGG CGGAATGACG
CCTGGCGAAG TGGCCATCGC CGCAACATCG GGCAATGCGC GCATGTTCGG CATCGGCGGC
CGTCTGGGCG CGGTCAGGAC GGGTATGCTG GCTGACCTCG TGGCGGTCGA AGGCAATCCG
CTCGCCGATA TTTCAGCGAT CCGGAAGGTG GCGCTGGTGA TGAAGGACGG CGTGCTGTGG
AAAGGGCCTG TGGGGCGCTA G
 
Protein sequence
MRLFARVLLI ASSLCAAPLA AQKQEALTLL RPDAVFDGET AVLRKGWAVL VRGNRIEAVG 
PDVGAPAEAS VLELPGTTLM PGMIEGHSHL FLHPYNETPW DDQVLHEPLA LRTARATVHA
RATLMAGFTT VRDVGTEGAG YADVGLKQAI EQGIVPGPRM LVATRAIVAP GAYGPRGFEP
GVAVPLGAEE AGGPDLVDAV RRQIGAGADL VKVYADYRWG PGEPSRPTFT EGELKAAVEA
AHSAGRQVVA HASTAEGMRR AVAAGVDTIE HGDEGTPEVF AAMKARGVGF CPTLAAGDAV
ARYRGWNGTA PMPKSVQEGF DALAKARKAG VAICMGGDVG VYAHGDNARE AEMMVKGGMT
PGEVAIAATS GNARMFGIGG RLGAVRTGML ADLVAVEGNP LADISAIRKV ALVMKDGVLW
KGPVGR