Gene Saro_0838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0838 
Symbol 
ID3915893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp886512 
End bp888626 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content69% 
IMG OID640443570 
Producthypothetical protein 
Protein accessionYP_496117 
Protein GI87198860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.827538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGGC CTGACATGAA GCCAATCGCG GCCGCGTTCC TGCTTGCCCC CATGATCGCG 
GCGCCGGCCA TGGCAGCATC TCCCAAGGCA TTGCCTGCGT CTAACCGTCA GGTGCAGGTC
GTCCCCGCAC CGGCCTGGGT GGTCCCCCCG CCCACGCCGA CCGAGCAGGC GGACCTTCCG
ACCGCCGCGA TGCGCTTCGT CTACGTCGAC AACCAGGCCT TCGCCGGGCC GGCCGGGCTG
GAATCCTACT CGGCCTACCG CATCCGCCTG CTCAAGCCCG AAGCACTGGC ACTCGGCACG
ATCACCCTTT CGTGGTTGCC GGACGCCGGA AGCGCCCGCC TCCACGCATT GCGCCTGATC
CGTGACGGCA AGGTCACGGA CCTAACCGCC AACGCGAAGT TCGAGGTGAT CCAGCGCGAA
AGCAACCTCG AGGCATCGAT GCTCGATGGC AGGCTGACCG CGGTCTACCA GGTGCCCGGC
CTCCAGGTCG GGGACGAGAT CGAGATCGCC CAGACCGTCA CCATCAAGGA CCCGACCCTT
CCGGAGCATC GTTCGGGCCT GGCCATCCTG CCGCAGGGCG GCGTCCCCGG CGCCTTCCGC
ACGCGCATCG CCTGGCCGGA AGGCGCCGCG ATACGCTGGC AGGCGACAAA GGACGTCGAG
GTTGCGGAAC CATCGCTAGT CGGCGCGAAC CGCGTCCTCT CGGTCGAACT GCGCGATCCC
GCCGCGCCCG AGGAGCCGGT CGTCGGCGCA CCCCCGCGCT ACTCGATCCA CCGCCTGATC
GAGTTCACCG ACTTCGCCAG GTGGCCCGAA CTCTCGGCGC GCCTCTGGCC GCTCTACGAC
AAGACCTCGC GCCCTGCCCC GGGTTCGCCG ATCCTGGCCG AAGCGGCAAA GATCGCGGCT
GCCACATCCG ACCCCACGCG CAGGGCCGAA ATGGCGCTGC GCCTCGTGCA GGACCGCATC
CGCTACGTCT ATGTCGGCCT CGACACCGGC AACATGACGC CGGCCAGCGT CGACGAAACA
TGGACCCGCC GCTTTGCAGA CTGCAAGGGC AAGACCGTCC TGCTCATCGC CATCCTGCGC
GAACTGGGCA TCGCGGCAGA GCCGATGCTG GTCAATTCGA ACGGCGGCGA TGGGCTCGGC
CTGCGCCTGC CCAACCCCGG ACTGTTCGAC CACGTGATCG TGCGCGCGAC GATCGCCGGG
AAGCCGTGGC TCCTCGACGG TACGCGGCTT GGCGACCGCG CGCTCGACCT GCTGCCGGTC
GGGGCATGGC GCGAAGGCCT GCCCCTGCGC GAAGGCGGCG GCGAACTGGA GAAGCTGCCG
ACGCCGTCTC CGGTCCATCC GCAGGCGGTC AATCTCCTCG ACATCGACGC GACGGCCGGG
ATCGACCAGC CTGCACTCGT CACCGCAAGG CGCATCCTGC GCGGGAACGA TGCGGCAAGC
CTCGCCGCAT GGTTCGCCAC GGTCCCCGCG GATCAGGCGC AGCGCGCGAT CAAGGAATAC
TGGCGCGGCG AGGAACCGTG GATCGAAGGC GACAAGGCCT CATGGAAGCT CGACGAGGAT
AGCGGCATCC TCACGCTGAC CCTTACCGGC GAAGGGGAAC TGGGCGACCC GGACGAAGCG
AAGACGGAGA ATGGCAGCGT CGACGTGCCG GCAAGCGGAC TTACCGCGCC AAGCAGGTTG
CGACGCCCCC GGTCGCAGGA CCAGACCCTG CCCTGGGTAA CCGCTTACCC CTCGTTCAAC
TGCTGGGCCA CCACGCTGCG CCTGCCGCCG CCGCCAGCGA ACCAGCGCTG GGATCTTTCG
GGCGAGCCGT TCGACAAGCT GATGGGCGGC GTCGGTTATT GGCGGCGCCT CTCGCTGGCC
GACAACGTGG TACGAACCGT GATGAGCCGC CGTTTCCAGG TCCCGGAAAT CAGCGCCGCG
CAGGCCACCG AACTCAATGG ACAACTGGCC AGCTACGATG GCAGCGCGGC AACGCTGTCG
CTGCGCCAGA CGTTCAAGGG CGCGGCAAAG TGGCCGCAAC AGCCCCAGCC CTTCTCCGAC
GCGACCGACT GGACGCAGGG CGGCACCCCA TGCGCCCCGG CCCAGAACGC CAAACCTTCC
GCCGCCGGAC AGTAG
 
Protein sequence
MQRPDMKPIA AAFLLAPMIA APAMAASPKA LPASNRQVQV VPAPAWVVPP PTPTEQADLP 
TAAMRFVYVD NQAFAGPAGL ESYSAYRIRL LKPEALALGT ITLSWLPDAG SARLHALRLI
RDGKVTDLTA NAKFEVIQRE SNLEASMLDG RLTAVYQVPG LQVGDEIEIA QTVTIKDPTL
PEHRSGLAIL PQGGVPGAFR TRIAWPEGAA IRWQATKDVE VAEPSLVGAN RVLSVELRDP
AAPEEPVVGA PPRYSIHRLI EFTDFARWPE LSARLWPLYD KTSRPAPGSP ILAEAAKIAA
ATSDPTRRAE MALRLVQDRI RYVYVGLDTG NMTPASVDET WTRRFADCKG KTVLLIAILR
ELGIAAEPML VNSNGGDGLG LRLPNPGLFD HVIVRATIAG KPWLLDGTRL GDRALDLLPV
GAWREGLPLR EGGGELEKLP TPSPVHPQAV NLLDIDATAG IDQPALVTAR RILRGNDAAS
LAAWFATVPA DQAQRAIKEY WRGEEPWIEG DKASWKLDED SGILTLTLTG EGELGDPDEA
KTENGSVDVP ASGLTAPSRL RRPRSQDQTL PWVTAYPSFN CWATTLRLPP PPANQRWDLS
GEPFDKLMGG VGYWRRLSLA DNVVRTVMSR RFQVPEISAA QATELNGQLA SYDGSAATLS
LRQTFKGAAK WPQQPQPFSD ATDWTQGGTP CAPAQNAKPS AAGQ