Gene Saro_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0453 
Symbol 
ID3918321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp495805 
End bp496860 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content65% 
IMG OID640443182 
Productxylose isomerase-like TIM barrel 
Protein accessionYP_495735 
Protein GI87198478 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000058237 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCA TCAAGGGCCC GGGCATCTTC CTCGCCCAGT TCGCCGGCGA CAGCGCGCCG 
TTCAACAGCC TCGAGACCAT CGCCGACTGG GCCGCCGGCC TCGGCTACAA GGGCGTCCAG
ATCCCGAGCT GGGACGGTCG CCTGTTCGAC CTCGAAAAGG CCGCGACCAG CAAGGATTAC
TGCGACGAGG TCAAGGGCAT GCTGGCGGCC AAGGGCATCG AGATCACCGA GCTGTCGACT
CACCTGCAGG GCCAGCTCGT GGCCGTGCAC CCTGCCTTCG ACGCGCAGTT CGACGGATTT
GCTCCTGCCT CGGTCCACGG CAATCCCGCC GCGCGGCAGC AATGGGCGGT ACAACAGCTC
AAGTTCGCGG CCCAGGCCAG CCGCAACCTC GGCCTCGCCG CCCATGCCTC GTTCTCGGGC
GCTTTCGCAT GGCCCTACTT CTATCCCTGG CCGGCGCGTC CGGCCGGATT GGTCGAGGAT
GCCTTCGATG AGTTGGCGCG GCGCTGGAAG CCGATCCTCG ACGTCTTCGA CGAGAACGGC
GTCGACGTTG CCTACGAGAT CCATCCGGGC GAGGACCTGC ACGACGGGGT GACCTTCGAG
ATGTTCCTCG AACGCGTCGG CAATCACCCG CGCGCGAACA TCCTCTACGA TCCGAGCCAC
TTCGTGCTGC AGCAGCTCGA TTACCTGGCG TTCATCGACA TCTACCACGA GCGCATCAAG
TGCTTCCACG TGAAGGACGC CGAGTTCCGC CCCAACGGTC GCTCGGGCGT CTATGGTGGC
TACCAGTCCT GGGTGGACCG GCCGGGCCGC TTCCGCAGCC TGGGTGACGG TCAGGTCGAT
TTCGCCGCGA TTTTCAGCAA GATGGCCGCC AACGACTATG CCGGCTGGGC CGTTCTCGAA
TGGGAATGCG CGCTCAAGCA TCCCGAGGTC GGCGCGGCCG AAGGCGCGCC CTTCATAGAT
CGCCACATCA TCAGGGTCAC CGAACACGCC TTCGACGATT TCGCCGCGGG CGGAGCTGAC
CGGGCGCTCA ACGCAAGGCT CATGGGCATC GGCTGA
 
Protein sequence
MKTIKGPGIF LAQFAGDSAP FNSLETIADW AAGLGYKGVQ IPSWDGRLFD LEKAATSKDY 
CDEVKGMLAA KGIEITELST HLQGQLVAVH PAFDAQFDGF APASVHGNPA ARQQWAVQQL
KFAAQASRNL GLAAHASFSG AFAWPYFYPW PARPAGLVED AFDELARRWK PILDVFDENG
VDVAYEIHPG EDLHDGVTFE MFLERVGNHP RANILYDPSH FVLQQLDYLA FIDIYHERIK
CFHVKDAEFR PNGRSGVYGG YQSWVDRPGR FRSLGDGQVD FAAIFSKMAA NDYAGWAVLE
WECALKHPEV GAAEGAPFID RHIIRVTEHA FDDFAAGGAD RALNARLMGI G