Gene Saro_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1049 
Symbol 
ID3916344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1088007 
End bp1089080 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content67% 
IMG OID640443783 
Productrhamnose-proton symporter 
Protein accessionYP_496328 
Protein GI87199071 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCAA ACCCCGCACT GGGCGTGATC TTCCACTGGC TGGGCGGCTT CTCGTCGGCC 
AGCTTCTACG TGCCGTACAA GCGCATCAAG CTCTGGTCGT GGGAGGTTTT CTGGCTCGCG
GGCGGCCTGT TCTCCTGGGC CATCGCGCCG TGGCTCTTCG CCTCGCTGCG CACCAACAAC
CTTCTCGGCG TGCTTTCGTC CGCGCCTTCC TCGACGCTGT TCTGGTGCTG GTTCTGGGGC
GCGATGTGGG GCTTCGGCGG CCTCACCTTC GGGCTGACCA TGCGCTATCT CGGGCTTTCG
CTCGGCATGG CGGTGGCACT TGGCCTTACC ACGGTAATCG GCACGATGGG CCCGCCGATA
TTCGACGGTA CGCTTGGCCA GATTGCCGCG ACGACCAGTG GCAAGCTGAC CCTGCTCGGC
ATTCTCGTCA CGCTAGTGGG CATCGTCGTG GTGGCGCGCG CCGGCAGCGC CAAGGAAGCC
GAACTGGGCG GCGCGGCCTC CGAAGGCGTC GCCGAATTCA ACTTGCGCAA GGGCCTGCTG
ATCGCGGTAT TCTCGGGCGT GATGTCTGGC TGCTTCGCCT GGGGCCTGGC CGCAGGCCAG
CCGATCCGGG ACCTGACCCT CGCCGCCGGC ACCGATCCGC TCAGCCAGGG CCTGCCGGTC
CTCTGCATCG TCCTTGCGGG CGGACTGACC ACCAATGCCC TGTGGTGCGC GTACCTGATC
GCCCGGAACC GCAGCTTCGG CCAGTTCTTC GGCGCGGCCC CTGCCGATGC CGCTCCGGGC
GAGCGGCCCA ACCTTCTTGC CAACTGGCTT CTGGCCGCGC TCGGCGGAAC GCTGTGGTAC
GGGCAGTTCT TCTTCTACAC GATGGGCGAA AGCCAGATGG GCAAATACGG CTTCTCGTCG
TGGACGCTGC ACATGGCGTC GATCATCCTG TTTTCGACGC TCTGGGGCTT CGCGCTGAAG
GAATGGAAGG GCGCATCCGG CCGCACGCGC ACGCTGGTGT GGACCGGCAT CGGCCTTCTG
GTCGGCGCGA CGGTGGTGAT CGGCGCGGGG AACATGCTGA ATGCGTCAGC CTGA
 
Protein sequence
MGANPALGVI FHWLGGFSSA SFYVPYKRIK LWSWEVFWLA GGLFSWAIAP WLFASLRTNN 
LLGVLSSAPS STLFWCWFWG AMWGFGGLTF GLTMRYLGLS LGMAVALGLT TVIGTMGPPI
FDGTLGQIAA TTSGKLTLLG ILVTLVGIVV VARAGSAKEA ELGGAASEGV AEFNLRKGLL
IAVFSGVMSG CFAWGLAAGQ PIRDLTLAAG TDPLSQGLPV LCIVLAGGLT TNALWCAYLI
ARNRSFGQFF GAAPADAAPG ERPNLLANWL LAALGGTLWY GQFFFYTMGE SQMGKYGFSS
WTLHMASIIL FSTLWGFALK EWKGASGRTR TLVWTGIGLL VGATVVIGAG NMLNASA