Gene Saro_3563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3563 
Symbol 
ID5077712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp181743 
End bp183917 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content64% 
IMG OID640481287 
Productcatalase domain-containing protein 
Protein accessionYP_001165949 
Protein GI146275789 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000309586 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACCC CCACGAAAGG CAAGGCCCCC GCTCGTGCCG CGAAGGAGCG CTCGCCCGCG 
CTCGACAATG CGCTTCGCGA CCATCAGCCC GGAGCGGGCC AGACCGATGA AGCGACTGCT
CTCGGCAATG CCGGGGAAAT CCACCAGGAG GCTGCTGCCG AAAACGATGC GGCTGCCTTC
CTGACCGACA ACTTCGGCCA TCGCCTGTCC GACAACCAGA ATAGCCTCAG GGCCGGAATG
CGCGGTCCGA CGCTGATCGA GGACTTCATC CTCCGCGAGA AGATCTTCCA CTTCGACCAC
GAGCGCATTC CGGAGCGGAT CGTCCACGCG CGCGGATCGG GTGCGCACGG CGTGTTCGAG
GTGACGCGCG CGATCCCCGA CCTGACCAGG GCCGGGTTGT TCCAGAAAAA GGGCCAGACC
TGCCCGGTCT TCGTGCGCTT TTCCACCGTG GCCGGCGGGG CCGGCTCGAT CGACACCCCG
CGCGACGTGC GCGGCTTCGC GGTCAAATTC TATACCGACG AGGGCAACTG GGACCTGGTC
GGCAACAACA TCCCGGTGTT CTTCATCCAG GACGCGATGA AGTTTCCCGA TCTGGTCCAT
TCGGTGAAGA TGGAAGCCGA TCGCGGCTAT CCGCAGGCGG CCAGTGCCCA TGACACCTTC
TGGGACTTCA TCGGGCTGAT GCCGGAATCG ATGCACATGA TCATGTGGGC GATGAGCGAC
CGCGCCATTC CCCGCACGCT GCGCATGATG GAAGGCTTTG GCGTGCACAC CTTCCGCTTC
GTCAATGCGG CGGGCGAGGG GCGGTTCGTC AAGTTCCACT GGAAGCCGGT GCTGGGCATG
GAATCGCTGA TCTGGGACGA GGCGGTGAAG GTCGCCGGGG CCGATCCCGA TTTCCATCGC
CGCGACCTGT TCGAATCCAT CGCTGCCGGG CACTTCCCGG CGTGGGACCT TGGGGTCCAG
GTCTTCGACG AGGAATTCGC CGCGAGCCAG CCGTACGATG TGCTCGATGC GACCAAGCTG
ATTCCGGAAG AGGACGTCCC CGTCGAGATC GTCGGGCGCA TGACGCTGAA CCGCAATGTC
GACAACTTCT TCGCCGAGAC CGAGCAGGTG GCGTTCCTGC CATCGAACGT GATCCCGGGT
ATCGACTTCT CGAACGACCC GTTGCTGCAA GGGCGCCTGT TCTCCTACCT CGATACGCAG
AAATCGAGGC TTGGCACGAC AAACTTCCAC CAGATTCCGG TCAATGCGCC CAAGTGCCCG
TTCCATAACA TGCAGCGCGA CGGCCTGATG CAGACGCTGG TGCCCACAGG CCGCGCCAAC
TACGAGCCCA ATTCGCTCGA CGAAGCGGGC GAGGACAGCG GGCCGCGCGC CTGCCCGGAA
ACCGGCTTCA CGTCGTTCCG CGAGAATGGC GAGCGCCACG ATCCGACCGA AAAGGTGCGC
GTGCGGGCGG ATCTCTTCGC CGACCACTAC AGCCAGGCGG CGCTGTTCTT CCACTCGCAG
ACCGAGAGCG AACAGGCGCA CATCGCCTCT GCGCTGGTGT TCGAACTGTC CAAGGTCGCG
CTGGAACATG TCCGGGCGCG GGTCGTGTCG CGGTTGCGCA ACATTGACGA GACGCTGGCG
CAGCGCGTTG CCGATGGCCT TGCGATGGAC CTGCCGGAAA AGGCGCCTGC CGCACGCCAG
CCGGTGAAGA TGAAGCCATC GGACGCCCTG TCGATCCAGA AGCAGGCGAA GAAGACCTTT
GCCGGACGCA AGGTCGGCAT TCTCTTTGCC GAAGGATCGG ACAAGGCGAC GATCGACAAG
CTGAAGGCGG GTGTGGAGGA GGCGGGTGGC ACCGTCTTCC TCGTCGCGCC CAAGGTCGGC
GGCATCCCGG TCAAGGGCGG CACGCTGAAG GCCGATGGCA AGCTGGATGG ATCGCCCTCC
GTCCTGTTCG ACGCGGTGGC ATCGGTGCTG ATGCCGGAAG CGGCGGCGAA GCTCGCCATG
CAGGGTGCGG CCGTGCAGTG GTTCATGGAT GCCTATGGCC ACTGCAAGAC AATCGCCCAC
TGCAACGGCA CCCGGATCAT CCTCGAGAAG GCCGGGGTGG AGCCTGACGA GGGCGTGGTG
CCCAATGAAA AGCTGCTCGA AGTCGGCCCT GTGCGCCACT TCGCGCGTGA GCCGAAGGTT
CGCGATCTGG CCTGA
 
Protein sequence
MATPTKGKAP ARAAKERSPA LDNALRDHQP GAGQTDEATA LGNAGEIHQE AAAENDAAAF 
LTDNFGHRLS DNQNSLRAGM RGPTLIEDFI LREKIFHFDH ERIPERIVHA RGSGAHGVFE
VTRAIPDLTR AGLFQKKGQT CPVFVRFSTV AGGAGSIDTP RDVRGFAVKF YTDEGNWDLV
GNNIPVFFIQ DAMKFPDLVH SVKMEADRGY PQAASAHDTF WDFIGLMPES MHMIMWAMSD
RAIPRTLRMM EGFGVHTFRF VNAAGEGRFV KFHWKPVLGM ESLIWDEAVK VAGADPDFHR
RDLFESIAAG HFPAWDLGVQ VFDEEFAASQ PYDVLDATKL IPEEDVPVEI VGRMTLNRNV
DNFFAETEQV AFLPSNVIPG IDFSNDPLLQ GRLFSYLDTQ KSRLGTTNFH QIPVNAPKCP
FHNMQRDGLM QTLVPTGRAN YEPNSLDEAG EDSGPRACPE TGFTSFRENG ERHDPTEKVR
VRADLFADHY SQAALFFHSQ TESEQAHIAS ALVFELSKVA LEHVRARVVS RLRNIDETLA
QRVADGLAMD LPEKAPAARQ PVKMKPSDAL SIQKQAKKTF AGRKVGILFA EGSDKATIDK
LKAGVEEAGG TVFLVAPKVG GIPVKGGTLK ADGKLDGSPS VLFDAVASVL MPEAAAKLAM
QGAAVQWFMD AYGHCKTIAH CNGTRIILEK AGVEPDEGVV PNEKLLEVGP VRHFAREPKV
RDLA