Gene Saro_2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2423 
Symbol 
ID3916742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2604974 
End bp2606890 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content69% 
IMG OID640445178 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_497693 
Protein GI87200436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.204978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTGC CTGCCCTTTC CCGTCGCACC GTGCTGTCGG GCTCGCTCCT TGCAGGCGCC 
GCCGCCTGCA CGCCCAAGGC GGGCGGCCTT GCCTCCTCGC CTGTCGTGCC CGCACTGCCC
GCGCCATGGG GCGCCGTGCC GCATCCGCGC CAGGTGAAAT GGCACGACCG GCGCATGTAC
GCCTTCATCC ATTTCTCGAT GAACACCTTC ACCGACAAGG AATGGGGTTT CGGTGACGAA
GACCCCGCCA TGTTCAACCC CACCGATTTC GACGCCGACC AGATCGTCGG CGCGGCGGTG
GCGGGCGGAC TTACCGGCCT CATCATCACC GCCAAGCACC ACGACGGCTT CTGCCTCTGG
CCCACCACGC TGACCGAACA CTGCGTGCGC AATTCGCCCT GGCGCGGGGG CAAGGGCGAT
GTCGTCGGCG AACTGGAGGC CGCGTGCCGC CGCGCCGGGA TCAACTTCGG CGTGTATCTC
TCGCCGTGGG ATCGCAACCG CGCCGATTAC GGCAAGCCGT CCTACGTCGA ATACTATCGC
GCCCAGTTGA CCGAATTGTG CACCCGCTAT GGCAAGCTGT TCGAGGTGTG GTTCGACGGC
GCGAACGGCG GCGACGGCTT TTATGGCGGC GCGCGCGAGA CGCGGCAGAT CGATGCGCCG
AAGTATTACA ACTGGCCCGG CATCATCGAA CTCGTCCACT CGCTCCAGCC GGATGCCTGC
ACCTTCGACC CGCTCGGCGC GGACATCCGG TGGGTCGGCA ACGAGGAAGG CCATGCCGGC
GATCCCTGCT GGCCAACCAT GCCGAACGCG CCCTACGAAA TGGACAAGGG CTATACCGGC
GTGCGCGGCG CGGAACTGTG GTGGCCGGCG GAAACCGATG TCTCGATCCG TCCCGGCTGG
TTCTACCACG CCGACGAGGA CACCCAGGTC AAGACGCCGC AGAAGCTGAT GGAGATGTTC
GACCGCTCTG TCGGCCATGG CAGCAACTTC CTGCTGAACC TGCCGCCCGA CCGCCGTGGC
CGGATTCCCG ATCGCGACGT CGCCAGCCTC AAGGCCTTCG GCGATGCGAT CCGCGCCACG
TTTGCGCAGG ATCTGGCGCG CGGGGCGCTT GCCAGTGCCA GCGCGGACAT CGGCTCCACC
GCCGCCAGCG CCATCGACGG CAATCCCGAC ACGTTCTGGT GCGCGCCCGC CGAAGCGCGC
GACGCCGCGC TCGCGCTGGA ACTCCAGCCC GGGACCCGGT TCGACACGAT CGTCTTGCGC
GAATGGCTGC CGCTGGGACT GCGCACCACG ACTTTCGCCA TCGACATCGC CGACGACGGC
GGCGAATGGC GCGAAATCGC GCGCAAGGAC ATGGTCGGCC CCGAACGCCA TGTCCGCCTG
CCCGCGCCCG TCTCTCCGCG CCGTGTGCGC TTCCGTGCCA TCGCGGCAGA GGCAGGGCCG
ACGCTGCGGG AATTCGCGCT CTACCTGTCG TCAGCGCCCA TCGAACTGCC GCCCGCGGTG
CCTTCGGACC CCAGCATCGT CTCGCGCCGC CGCTGGAAGA TCGTGGCCGC CAGCGCACCC
GGGGCCGATG CCGTGCTCGA TGAAAACCCG AAGAGCGCGT GGACAGCGCC CGCCACGGCT
TCGCTGACCA TCGACCTCGG CGGGGAAGAG AAGCTCGCGG GCTTCACCCT GACGCCCACG
CGCCACATCG ATCCGCAAGC CGCCCCGCCG GCGCGCTGGC ACGTCGAGAC GAGCCTCGAC
GGCAAGTGCT GGAGCAAGGC GGAAGAAGGC GAGTTCCAGA ACATCAACTA TGCCCGCGCG
ACGCAGCGCA TCGCCTTTTC CGCGCCGCGC AACGCCCGCT ACCTGCGCCT CGCCTTCCCG
CGCCCGGCCG TGCCCGCACC GGCCATCGCC GTGGCGACAA TCGGTGCCTT TCGCTAG
 
Protein sequence
MTLPALSRRT VLSGSLLAGA AACTPKAGGL ASSPVVPALP APWGAVPHPR QVKWHDRRMY 
AFIHFSMNTF TDKEWGFGDE DPAMFNPTDF DADQIVGAAV AGGLTGLIIT AKHHDGFCLW
PTTLTEHCVR NSPWRGGKGD VVGELEAACR RAGINFGVYL SPWDRNRADY GKPSYVEYYR
AQLTELCTRY GKLFEVWFDG ANGGDGFYGG ARETRQIDAP KYYNWPGIIE LVHSLQPDAC
TFDPLGADIR WVGNEEGHAG DPCWPTMPNA PYEMDKGYTG VRGAELWWPA ETDVSIRPGW
FYHADEDTQV KTPQKLMEMF DRSVGHGSNF LLNLPPDRRG RIPDRDVASL KAFGDAIRAT
FAQDLARGAL ASASADIGST AASAIDGNPD TFWCAPAEAR DAALALELQP GTRFDTIVLR
EWLPLGLRTT TFAIDIADDG GEWREIARKD MVGPERHVRL PAPVSPRRVR FRAIAAEAGP
TLREFALYLS SAPIELPPAV PSDPSIVSRR RWKIVAASAP GADAVLDENP KSAWTAPATA
SLTIDLGGEE KLAGFTLTPT RHIDPQAAPP ARWHVETSLD GKCWSKAEEG EFQNINYARA
TQRIAFSAPR NARYLRLAFP RPAVPAPAIA VATIGAFR