Gene Saro_3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3642 
Symbol 
ID5077790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp270562 
End bp271830 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content67% 
IMG OID640481365 
Producthypothetical protein 
Protein accessionYP_001166027 
Protein GI146275867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.222709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG CCACACCCAG GCTGATCCAC ATCGACGACA TGGCCGATCC CGTCGTCACG 
CCCGAACTTG CCGCGTGGCG CGAGGGGCCG GACGACTTTC CCTGCCCGAT GACTGCCGAC
GAAGTGCTGG CGCGCGCGAT GGCGGAAACG GGGCTCGACG ACTTTGGCGA GGATACCGGC
TTTCGCACCC GGCTCGGCGT GATCCTCGAC GCGCTCTACG AGGACGAGGG ACTGACGCGG
GGCGGCCGCG TGTTCGTGCT GCAACAGGCG GTGCGCGCGA TGGCCAACCG CCTGCGCGTG
GAAGACCTGA TCCGGCGCCA CCCTGAAATC CTCGACGTGC CAGTGGAAAA GCCGATCTTC
ATCGCCGGCC TGCCGCGATC AGGCACGACG CACCTCGTCA ACTGGCTGTC GCGCGACGAC
CGGCTGGACA GCCTGACGCT GTGGGAATCG GAGGAACCGG TCGCGGGCCC GCCCCTGCCG
CCGGGCGAGA CCGATCCGCG CATGGCCCGT TCTGCCGCGT ACTGGGGAGC GTTCGGCGCG
CTCGTTCCGC ACATGACGGC GATGCACGAG ATGGCGGCGA ACGACATCCA CGAGGACAAC
GAACTGCTGT TCATGGATAT GAACTGCTAC AACTGGGAGT TCTCCTGCCG CCTGCCGCGC
TGGACCGCGC ATTACCTCGC CCATGACCGG ACGGCGTCCT ACGCCTACGA GCGCAAGGTG
CTCCAGGCCA TAGCCTGGCA GCGGGGCAGG AAGAACGGCG TCCGCTGGCT GCTGAAATCG
CCGCAGCACA TGGAAAACCT CGCCGCGATC AAGGCGGTGT TCCCCGACGC GACGATGGTC
ATCACGCACC GCGATCCGGT GGACGTGCTG CGTTCGCTGA CCACGATGCT GGGCTATTCG
GACCGGACCC GGCGCGACCC TGTCGACCCG CCGGGGCTGG CGCGGCTGTG GACCGGGCGG
ATCGAGAAGC TGCTTCGCGA ATGCGTGGCG CAGCGCGACG CCTTCGGGCC GGAGCAGTCG
ATCGACGTCG CGTTCCACGA ATACATGGCC GACCAGGAAG GCATGGCCCG GCGCATCTAC
CGCCTCGCCG GGCTGGACCT GCCGCCCGAA ACAGAGGCGC GCCTGCTGGG CTACCTTTCG
GAGAACCCGC GCCATGCCCA GGGCAAGGTC GTCTACGATC TCGAAGGCGT GTTCGGGGTC
GACATTGCCG CGCTGCGCGA ACGCTTTGCC TTCTACTACG AACGCTTCCC CGTGAAGCAG
GAGAACTGA
 
Protein sequence
MNAATPRLIH IDDMADPVVT PELAAWREGP DDFPCPMTAD EVLARAMAET GLDDFGEDTG 
FRTRLGVILD ALYEDEGLTR GGRVFVLQQA VRAMANRLRV EDLIRRHPEI LDVPVEKPIF
IAGLPRSGTT HLVNWLSRDD RLDSLTLWES EEPVAGPPLP PGETDPRMAR SAAYWGAFGA
LVPHMTAMHE MAANDIHEDN ELLFMDMNCY NWEFSCRLPR WTAHYLAHDR TASYAYERKV
LQAIAWQRGR KNGVRWLLKS PQHMENLAAI KAVFPDATMV ITHRDPVDVL RSLTTMLGYS
DRTRRDPVDP PGLARLWTGR IEKLLRECVA QRDAFGPEQS IDVAFHEYMA DQEGMARRIY
RLAGLDLPPE TEARLLGYLS ENPRHAQGKV VYDLEGVFGV DIAALRERFA FYYERFPVKQ
EN