Gene Saro_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3149 
Symbol 
ID3918191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3360447 
End bp3361565 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content67% 
IMG OID640445933 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_498418 
Protein GI87201161 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAAGA CGATACTCGT GGTGTTCGGG ACGCGGCCCG AGGCGATCAA GCTGTTTCCG 
GTGGTGCACG CGCTGCGTGC CGATCCGCGC TTTCGCGTGG TGACCTGCGT TTCGGCGCAG
CACCGGGGGA TGCTCGACCA GGTGCTGGAG ATCGCGGGGA TCGTGCCCGA CCACGATCTC
GACCTGATGC GGCCGGACCA GACGCTCGAC GCGCTGACGG CGGCGCTTCT GACGGAACTG
GGCAAGGTGA TGGATGCCGT GCGGCCCGAC TGGGTCGTGG TCCAGGGCGA TACGGCGACG
GCGATGGCCG GGGCGCTGGC GGCTTATTAT CGCAAGCTTC CGGTCGCGCA TGTCGAGGCG
GGCCTGCGCA GCCACAACAT CTATCACCCG TGGCCCGAGG AGGTGAACCG CAAGATTATC
GGCACGATCG CGCGGCTGCA CTTCGCGCCG ACCGAGGTAT CGGCTGCCGC GCTCAGGGCG
GAGAACGTGA CCGAGGGCGT TCACGTGACC GGCAACACGG TGATCGACGC CTTGCAGTGG
GTTTCGGGCC GGATTGCGGC GGAGCCGGCG CTGGCGGCGG GGCTGGCCGA GATCGAGGCG
CGCTTTGCCG GCAAGCGGAT CATCGGCGTA ACCAGCCACC GCCGCGAGAA TTTCGGCGGG
GGGCTTGAGA ACATCGCCGA GGCGATCCGC CGCATCGCGC AGCGGGACGA CGTGGCGCTG
GTCTTTCCGG TCCATCCCAA CCCCAACGTG CGCAAGGTGA TGGACGATGC GCTGGCGGGG
CTGCCCAACG TCGCGATGAT CGAGCCGCTC GACTATCCGC ACTTCGCCCG GCTGTTGTCG
ATCGCGGAAA TCATGCTGAC CGATTCGGGA GGGGTGCAGG AAGAGGCCCC CGCGCTCGGC
AAGCCGGTGC TGGTCATGCG GGAGACGACC GAGCGCCCCG AGGGCGTGAC CGCCGGGACC
GCGCGGCTGG TGGGGACCGA CGTGGACACC ATCGTTACCG AAATCTTCAC CCTGCTCGAC
GATAAGGCTG CCTATTCGGC CATGGCGCGC GCTCACAATC CCTTCGGGGA TGGGCAATCT
TCGCGCCGAA TCGTGGAGTT GCTGGCGAAT GATGGGTGA
 
Protein sequence
MVKTILVVFG TRPEAIKLFP VVHALRADPR FRVVTCVSAQ HRGMLDQVLE IAGIVPDHDL 
DLMRPDQTLD ALTAALLTEL GKVMDAVRPD WVVVQGDTAT AMAGALAAYY RKLPVAHVEA
GLRSHNIYHP WPEEVNRKII GTIARLHFAP TEVSAAALRA ENVTEGVHVT GNTVIDALQW
VSGRIAAEPA LAAGLAEIEA RFAGKRIIGV TSHRRENFGG GLENIAEAIR RIAQRDDVAL
VFPVHPNPNV RKVMDDALAG LPNVAMIEPL DYPHFARLLS IAEIMLTDSG GVQEEAPALG
KPVLVMRETT ERPEGVTAGT ARLVGTDVDT IVTEIFTLLD DKAAYSAMAR AHNPFGDGQS
SRRIVELLAN DG