Gene Saro_0724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0724 
Symbol 
ID3918548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp763254 
End bp765296 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content64% 
IMG OID640443456 
Productglycosyl transferase family protein 
Protein accessionYP_496005 
Protein GI87198748 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.647901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAGG CGCACGCGAA TGCCGCCAGT CTGGAAGCCA GCGATCTGTT CGACGCCGAT 
TGGTATCTTG CCGAATACCC CGACGTGCAA AGCCTGCAGA TGCCGCCAGC GGTCCACTAC
CTGTGGCTGG GAGCGCGGCT TGGCCGCAAC CCGTCTCCGC GCTTCAGTAC GCGCAGCTAC
CTTGATGCTA ATCCGGATGT CGCGCAGGCC GGCATCAATC CGCTGCTTCA CTTCCTCCTG
GCAGGACGCG ACGAAGGGCG GTCAGGCACG ATACTCGCCG CGCCGTCCAG AGGTTCGACA
GAGACCGACG TCAACGCCAG CACCTTTGGC GAACATCGGC CCTATGTTTT CCTCCCGCCC
GCCGATCTCG ATCGCATCGC AGCGCAATGG TCGAAGGACA GCGCGCGTCC CGACGCCCAG
GGCCGGGGCA TCGCCATCTT CAGTGCGATC ACTGGCAGTT ACGACAGCAT CAACCACCAC
GAGCACCTGA TTCCCGGGGC GGACTACCTG CTGTTCAGCG ACGCGCCCAA GCCTCGGTAT
GTGTACCAAC CGCGACAGGC CCCCTGGTTC GATTGCGACA CCGTCCGGGC GGCACGCTTC
ATCAAGACGC ACCCACACAT GCTGCTGGGC GGCTATCGCA TCGCCGTGTG GATCGATGGC
AACATCCTGA TCAGGGGAGA TCTCCTTCCG CTCGTTCAGC GTTTCGAGGA GTCCGGGCTC
GCGTTCGGAG CGGTGCCGCA CCCGTTGCGC CAATCGGTCT ACGCCGAAGC GGTCGAGTGC
ATGAAACGCG GCAAGGATGA CGAAGCGACG ATCCGCCGCC AGATGCAGCG TTACCGCCGC
GAAGAATTCG ACTGCGAGGA CCTGATCGAG AGCAATCTCC TGATGTTCCG GCTCGGCCAC
CCATCACTCG CACCGCTGCT CGATACCTGG TGGGCGCAGA TCGAGAGTGG ATCGCGCCGG
GATCAGTTGT CTCTGAACTA TGCCCTGCAC AAGACAGGCG TCGAGTGGCT CGCGCTGACC
CAGCGCCCGC ACAGCGTCCG CGACCACCCG GCGCTCGCCT TGATGCATCA CCGGTCCCAG
TTCAATCCGG CCGAGCCTGG CGTCTGCCTC CCTCGCCCGC GGGAACGGAC GTTCGCCGAA
GTGCGCGCGG AGCGGATCGC TGCACAATCG GCGCGCCGGG CCGATGTCAT CGTCTGCGTG
CACAACGCTC CCGAGATGGT GGCACGCTGC CTTGATTCCG TGCGCACCGG CCAGAACCCC
GACCGTCATC GCATCATAAT AATCGACGAC GGTTCAGGCC GGGACACCGC GGAACTCGTT
TCGCGCTTTG CGGCGGATAC CCCGAACACC GTGCTGATCC GCAACGCTGT GGCGGGAGGC
TACACCCGTG CCGCCAACCA GGGCCTGAAA GCCATGGATG CCGACATGGC AATCCTTTTG
AACAGCGATA CCGTGGTGGC ACCCGGCTGG ATCGAAAAGC TGCTCGACGC CGCGTTCTCC
AATCCGGGCG TCGGTATTGT CGGTCCGATG TCCAGTGCCG CAAGTCATCA ATCCATTCCC
GAACACCAGA GCCGGCACGA CCAGACGGCG ATCAACGATT TGCCGCCCGG TTGGTCCGCC
GCCGACATGG ACGCCTGGTG CGAACGCATG GCGCCCGCCG ACTTCCTGCC CCGGGTTCCG
CTCGTGCACG GGTTCTGCTT CGCCGTAACC CGAGAAGCCG TTGAGCGGAT TGGATACCTC
GACGAAGACA GCTTTCCGGA CGGGTACGGC GAGGAAAACG ACTACTGCTT GCGCGCCACC
GACGCCGGTG TCGGGCTGGC AATCGCCACG CACACGTACG TCTTCCACGA GAAGTCGCAG
AGCTACCAAA GCGATCGCCG GATATCCCTC ATGAAGAGAG GCAACGGAAA GATCCGCGAG
TTGCACGGCG ACGAGCGTGT CGCTCGCGCG GTTCGCTCCA TGCAGCAGAA CCCGCACCTG
GGCCGCCTGC GCACTGCCGC GGCCCGCCTG TTCGCCGTCA CCACACCGGA GACACCTGCA
TGA
 
Protein sequence
MDQAHANAAS LEASDLFDAD WYLAEYPDVQ SLQMPPAVHY LWLGARLGRN PSPRFSTRSY 
LDANPDVAQA GINPLLHFLL AGRDEGRSGT ILAAPSRGST ETDVNASTFG EHRPYVFLPP
ADLDRIAAQW SKDSARPDAQ GRGIAIFSAI TGSYDSINHH EHLIPGADYL LFSDAPKPRY
VYQPRQAPWF DCDTVRAARF IKTHPHMLLG GYRIAVWIDG NILIRGDLLP LVQRFEESGL
AFGAVPHPLR QSVYAEAVEC MKRGKDDEAT IRRQMQRYRR EEFDCEDLIE SNLLMFRLGH
PSLAPLLDTW WAQIESGSRR DQLSLNYALH KTGVEWLALT QRPHSVRDHP ALALMHHRSQ
FNPAEPGVCL PRPRERTFAE VRAERIAAQS ARRADVIVCV HNAPEMVARC LDSVRTGQNP
DRHRIIIIDD GSGRDTAELV SRFAADTPNT VLIRNAVAGG YTRAANQGLK AMDADMAILL
NSDTVVAPGW IEKLLDAAFS NPGVGIVGPM SSAASHQSIP EHQSRHDQTA INDLPPGWSA
ADMDAWCERM APADFLPRVP LVHGFCFAVT REAVERIGYL DEDSFPDGYG EENDYCLRAT
DAGVGLAIAT HTYVFHEKSQ SYQSDRRISL MKRGNGKIRE LHGDERVARA VRSMQQNPHL
GRLRTAAARL FAVTTPETPA