Gene Saro_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0349 
Symbol 
ID3918233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp375088 
End bp376569 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content66% 
IMG OID640443078 
Producthypothetical protein 
Protein accessionYP_495631 
Protein GI87198374 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAA TTCGCACCGC CACTCGCCGT TTTGGCCTGT CCAAATCCAA GATCACCGCT 
TTCGAGCAAT GCCCCAAGAA GCTGTGGCTC GCGACCCATC GCCCCGAGCT TGCGGAGCAG
GACGATGGGG CTGAGGCGCG GTTCGCGACC GGTAATGCCG TGGGCGAGAT CGCCTGCGCG
CTTCATCCAG ACGGTGTGAT GGTCGATCCT CCGTCGTTGT CTGAAGCGCT GGCCAAGACC
TCCTCGCTCA TTTCAGAAGG GCATCCTGGC CCGATCTTCG AGGCGACCTT CGAGCATGAT
GGCGTGCTTG TGCGGGTCGA TGTGCTGGAG CGGCTCGATA GCGGGGGCTG GTCAGCGGCC
GAGGTCAAAA GCTCGGGCAG GGTCAAGGAT TACCACCGCG GCGATCTCGC CACGCAGGTC
TGGGTCATGC GTGAGGCGGG GATCAACCTG CAGCGCGCGG CCATCCGCCA TATCGACACC
AGTTTCGTGC TTTCACGTGA GGGCGATTAC GCGGGGCTCT TCACCGATGC CGATCTGCTC
GCCGACTTGG AGGACGCCAT CGCCACGCGC CCTTCGCTGG TGGCCGAGGC GCGCGCCACT
TTGTCCGGAG ACGAACCGCA GCGCGAGATG GGCGATCATT GCGGCGCGCC GTTCAGCTGC
GAGTTTACCG CTTATTGCGG CCGCGACCTC CCGCAAGGGC CTGAATGGCC CGTGACCCTG
CTCCCCTATG GCGGCGGCAG GCGCTGGCTG GAGCGCGGGG TCGAGGACCT GCTCGACTTG
GCCGAGACCG ACCTCAACGA CCGCCACGCA CGCATCCTTG CCGCCACGCG CGATGGCATC
CCGTTCCACG ATGCCGCAGG GGCGCGTAAA GTGATGGCCG GATGGGGCTG GCCACGCGCC
TGGCTCGACT TCGAGACCGT CGCCCCCGCG ATTCCGCGCT GGGTCGGTAC CCGCCCGTTC
CAGCAGATCC CGTTCCAGTT CTCGCTTCAC CTCGAACGGC GCGGCGGGCG CATGACCCAT
CATGAGTTCC TGAGCTGCGA CGGCAGCGAC CCGCGCCGGG CCTGCGCCGA AGCGCTGGTG
TCAAACATCC CCGAGGGTGC CACGATCATC GCCTACAACG CCGCCTTCGA GCGCAGCGTG
CTACGCGAGC TTGCAGCATC CTTCCCCGAT CTCTCCTCCC GTCTTGAAGC GATGGCCGAG
GCGACGGTCG ATCTTCTGCC GGTCGCTCGG AACCACTGGT ATCACCGTGA TCAGCGCGGC
AGCTGGTCGA TCAAGGCTGT GCTCCCGACC ATCGCTGCCG AGCTTGATTA CGGCGTCCTC
GAAGTGAAGG ATGGCGGTGA TGCGCAGGCG GCATGGTTCG AAGCTGCCGA TCCTGCCTGC
GATCCGCTGC GGCGCGAGGC GCTGGAAGAA GCGCTGAAGG CTTATTGCGC GCGCGACACG
TGGGCGATGG TCGCGGTGGC GCGGGCGCTG GCGGGAAGTT GA
 
Protein sequence
MTQIRTATRR FGLSKSKITA FEQCPKKLWL ATHRPELAEQ DDGAEARFAT GNAVGEIACA 
LHPDGVMVDP PSLSEALAKT SSLISEGHPG PIFEATFEHD GVLVRVDVLE RLDSGGWSAA
EVKSSGRVKD YHRGDLATQV WVMREAGINL QRAAIRHIDT SFVLSREGDY AGLFTDADLL
ADLEDAIATR PSLVAEARAT LSGDEPQREM GDHCGAPFSC EFTAYCGRDL PQGPEWPVTL
LPYGGGRRWL ERGVEDLLDL AETDLNDRHA RILAATRDGI PFHDAAGARK VMAGWGWPRA
WLDFETVAPA IPRWVGTRPF QQIPFQFSLH LERRGGRMTH HEFLSCDGSD PRRACAEALV
SNIPEGATII AYNAAFERSV LRELAASFPD LSSRLEAMAE ATVDLLPVAR NHWYHRDQRG
SWSIKAVLPT IAAELDYGVL EVKDGGDAQA AWFEAADPAC DPLRREALEE ALKAYCARDT
WAMVAVARAL AGS