Gene Saro_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1879 
Symbol 
ID3917100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1981934 
End bp1983067 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content66% 
IMG OID640444623 
Productlevansucrase 
Protein accessionYP_497153 
Protein GI87199896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTGG TCAATCCATC AGAACAATCG ATTGCCTCGT TTGGCGCGAC GCCGTGGCGT 
CCCGCAGGCT ACGGCCAGAG CGCCCGGATT CCGCTCATCG AAGCCGCCGA TGTCGTGCGC
CTCTTCGACG ACCTGGACCT GTGGGACTGC TGGCCCCTCG CGCACGAGGA CGGGCGTACG
GTTGAGCATC TGGGACGCAA CTGGTGGTTC TTTCTTTCGG CGCCGGTCTT CCCCGATCCG
GTCGAACGGC ATGGCCATGC CCGCATCCGC CTCGTCTCGC TGGGGGAGGA TGGATGGAAG
GATCACGGCA ACGCCTTTCC CGATGGTCTC ACGCCCGGCA GCCGCGAATG GGCGGGTTCG
GCCGTGCTGA TGGACGACGG GCGCACCGTG CAGCATTTCT TCACCGCCGC AGGACGGCGC
GGCGAGGCTG CACCGACCTT CGAGCAACGC ATATTCGTCA GCGAAGGCAC CCTGACCGAG
GCCGGCCCTG GCGGATGGCA AGCCCCGCGC GAGATATTCG AGGCCGATGG CCTACGCTAC
GTGCTCGACC GGCAGGACAG TGGGGCGCCG GGCCAGATCA AGGGTTTTCG CGATCCCGCG
TGGCTTCGAG ATCCGGCCAC CGGCAGGGCG CACATCCTGT TCACCGGCAG CGCCGCATGG
TCGGATCATC CTTTCAACGG CAATGTGGGG ATCGCCACGC TCGAGGGTGA CACCTGGGTT
CTCGGCAATC CACTGGTCGA GGCGATCGAC GTGAACAACG AGCTTGAACG GCCGCACATC
CTGGTGCGCG ACGGGCTGTA CTATCTCTTC TGGTCGACCC AGACCCACAC TTTCGCGCCC
GCTGCGGTGG CAGGGCCCAA CGGCCTCTAC GGCATGGTGG CTGAAAGCCT TGCGGGCCCC
TGGCGCATGC TCAACGAAGG CGGGCTGGTC GCGGCGAACC CGGATGCGGA AGCAAAGCAG
TCCTACAGTT GGTGGGTCAC CGGCGAGGGC GAAGTGTGGA GCTTCGTCGA CTACTGGGGC
ATGGCAGGGC GCACCGTCGA GGAGCAACCC GAATTGCTGC GCAGCAATTT CGGGGGAACC
CCCGCACCTC GGTTCATGCT TAACTTCGAT GGCGAGCGGG TCACCATCGC CTGA
 
Protein sequence
MSVVNPSEQS IASFGATPWR PAGYGQSARI PLIEAADVVR LFDDLDLWDC WPLAHEDGRT 
VEHLGRNWWF FLSAPVFPDP VERHGHARIR LVSLGEDGWK DHGNAFPDGL TPGSREWAGS
AVLMDDGRTV QHFFTAAGRR GEAAPTFEQR IFVSEGTLTE AGPGGWQAPR EIFEADGLRY
VLDRQDSGAP GQIKGFRDPA WLRDPATGRA HILFTGSAAW SDHPFNGNVG IATLEGDTWV
LGNPLVEAID VNNELERPHI LVRDGLYYLF WSTQTHTFAP AAVAGPNGLY GMVAESLAGP
WRMLNEGGLV AANPDAEAKQ SYSWWVTGEG EVWSFVDYWG MAGRTVEEQP ELLRSNFGGT
PAPRFMLNFD GERVTIA