Gene Saro_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0872 
Symbol 
ID3917957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp927106 
End bp928386 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID640443605 
Productmajor facilitator transporter 
Protein accessionYP_496151 
Protein GI87198894 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.224242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAGGAA ACGCAGCCGC CAACGGCTCG CAGAAGGCCG CATTCGCCGC GGTCACAGTG 
CTGTTCTTCG CCTGGGGCTT CATCACTTCG CTGATCGACC CACTCGTGGC TGCGGTGAAG
GGCATCTTCA CGCTGACCAC GCTGGAAGCC CAGCTCTCCG CCTTCGCGTT CTTCATCGCC
TACGGCTTCA TGAGCTTCCC CGCGGCCGCG ATCATCGGCA AGGTGCGCGC GGTTCCGGCG
ATCCTGCTGG CGCTGGCGAC AATGGCAGCG GCCTGCCTGG TCATGCTGAC CGCCGCAAAC
GCCGCGTCCT ATCCGCTGGT CCTGGCCGGC CTGTTCATGC TCGCCGCCGG CATCACGGTG
CTGCAGGTCG CCGCGAACCC GCTTGCAGCC GCGCTGGGCA AGCCCGAGGG CAGCCACTTC
CGGCTGACCC TGTCGCAGAC CTTCAACTCG TTCGGCACGT TCCTCGGCCC GGCACTCGGC
GCCGCCCTGT TCCTCAAGGG CGTCGAGGTG ACCGAAGGCA CCGCGCTGAC CCCGGAAGTC
CGCGACGCGG CACTGGCCGG AATCGACCGC GCCTACTTCT GGATCTGCGG CCTGCTCATC
GTGCTGTTCG CGTTCTTCTT CCTCAACCGC AAGCGCATCG CCGCCGCCGC ACCGCCGGCC
CAGCCCACCG GCGGCATCGG CGCGCTGCTG AAAGAGGCGT TCGCATCGCG CTGGGCGCTT
CTCGGCGCCG CCGCGATCTT TGTCTACGTC GGCGCGGAAG TGGCGATCGG CACGCAGATG
GCCTTCTTCC TCAACGCCCC GCACATCTGG AACGTGAGCC TGGAACAGGC CGGCAAGGCC
GTGTCGTTCT ACTGGCTGGG CGCGATGATC GGTCGCGCCA TCGGCACCGT CCTGCTCGCC
CGCTTCCCCG CAGCGCGCCT GCTGATGCTG TTCTCGGTCA TCGCCGCCGT TCTGTGCGCG
TGGATCCTGG TGGTCGGCGG TGTTTCGGCG GGCTACGTGG CGCTTTCCAT CGGCCTGTTC
AACTCGATCA TGTTCCCGGT GATCTTCACC CTGACGCTCG AGCGGTCGAG CGCGGGCGAG
GAAGCGACCT CGGGGCTGCT ATGCACCGGC ATCATCGGCG GCGCCCTGAT CCCGGCGCTG
GCGGGCGCGG TGACCGACGC GACCGGCATC GTCACCTCGT TCGTCGTCCC GCTTGCCTGC
TACCTCCTGC TGGTCCTGTT CGCGGCAAGC GCCGGCAAGG CGCCCATCCT GCGCCGGGCA
AGCAGCGAAA GCGTGCACTG A
 
Protein sequence
MSGNAAANGS QKAAFAAVTV LFFAWGFITS LIDPLVAAVK GIFTLTTLEA QLSAFAFFIA 
YGFMSFPAAA IIGKVRAVPA ILLALATMAA ACLVMLTAAN AASYPLVLAG LFMLAAGITV
LQVAANPLAA ALGKPEGSHF RLTLSQTFNS FGTFLGPALG AALFLKGVEV TEGTALTPEV
RDAALAGIDR AYFWICGLLI VLFAFFFLNR KRIAAAAPPA QPTGGIGALL KEAFASRWAL
LGAAAIFVYV GAEVAIGTQM AFFLNAPHIW NVSLEQAGKA VSFYWLGAMI GRAIGTVLLA
RFPAARLLML FSVIAAVLCA WILVVGGVSA GYVALSIGLF NSIMFPVIFT LTLERSSAGE
EATSGLLCTG IIGGALIPAL AGAVTDATGI VTSFVVPLAC YLLLVLFAAS AGKAPILRRA
SSESVH