Gene Saro_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0601 
Symbol 
ID3915613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp645976 
End bp647553 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content69% 
IMG OID640443331 
Productsulfotransferase 
Protein accessionYP_495882 
Protein GI87198625 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAG CGCCTTCCGG ACTGGTCGAC GCCGCCCGCG GTGCCCTTGT CCGGGGCGAT 
CTGGAAGCGA CGAATCGCGC CGCCGCCGCG ATGGTGGCGG CAAGTCCCGG GGACGCCGAA
GGCCACTTTC TGCTGGGGGT GGCCGAAGCC GGTGTCGGGC GTATCAAGGC GGGCATCGGG
CACATCGAGC GGGCGGTGAC GCTCGATCCG AGAGGCGAAT ACCTCGCGCA CCTCGCCAAG
TTCTATTGCC TTGTGCGCAG GGACCGCGAC GCAGCGAACG CGCTGCGCAG GGCCGAGGCC
GCACCGCCCG CCGACGCGCT GGGCCGCGAT ACGATGGGCT GCGTCTTTGC CCGGCTCGGC
GACCATGCTG CCGCGCTTCC CCATTTCGCC GAGGCCGTGC GGCTGCGCCC CGACATTGCC
GAATATCGCT ACAACGAGGC GGTTACCCTG AACTTCCTCG GCCGCGTCGA TGAGGCCGAG
GCGGCCATCG AGGCGCTGCT GGCGCAAGCG CCCGGCCATG CGCGCGCGCA TCACCTGCTG
GCGGGCCTGC GCCGACAGAG CGCTGAGCGC AACCACGTGG CGCGGCTTGG CCAGGCAAGG
GCTCGGGCCC GGGGCGGGCG TGACCGGCTC CTGCTGGGCT ATGCGCTGGC CAAGGAACTG
GAGGACATCG GCGAATACGA CCACGCTCTC GACACGCTGT GCGAGGCCAA TGCCGAACAC
CGGCGCAACC TGCCCTATGC GTTCGAGCGC GACGCCGCGA TCTTCGATGC GATCGAGGAA
AGCTGGCCGA TGTTGCGCGA TGCGGCACCG ACGGCGCCGA GCAAGGCGTC CCCGATCTTC
GTCATCGGCA TGCCCCGCAC CGGCACGACG CTGGTCGATC GCATCCTCGG TTCGCACCCC
GAAGTCGAAA GCGCGGGCGA GTTGCAGGCA ATGCCGCTGG CGGTGAAGAG GGCGGCGGCG
ACCAACAGCG CGACGGTCAT GGACCCGGAA ACGATCCGCG CGGCCACCGG AGCCGACATG
GCGGCGGTCG GTCGGGATTA TCTGGAGCGT GCCCGCCACC ATCTGCGCGG CGGGGCTGCG
CGCTTTACCG ACAAGTTTCC CGGCAACTTC CACTATGCGG GCTTCATTGC CCGTGCCCTG
CCCGAAGCGC GGATCGTCTG CCTGCGGCGT CATCCGATGG ACACGGTGCT GAGCAATTTC
CGCAACCTCT TCGCGGTCGG CTCGCGGTAC TACGACTACA GCTACGACCT GCTCGACATA
GCGGCCTATT ATGCCCGGTT CGACCGCCTG ATGGCGTTCT GGCGCGAGGC GCTGCCCGGG
CGCGTGCTGG AGCTGCGCTA CGAGGATCTG GTCGCGGACC AGGAAGGGCA GACCCGGCGG
CTTCTGGAGC ACTGCGGGCT CGGCTGGTCG GAAACATGCC TGGAATTCCA CAGCAACGCC
GCGCCGGTTT CGACGCCGAG CGCGGCGCAG GTCCGCCGTC CGATCTACGC CGATTCCGTC
GCGCGCTGGA AGCGGCATGG AGAGGTGCTG GGGCCTGTCG CGCGCTTCTT CGAGCAGACC
GGCATCGCGA TCGATTGA
 
Protein sequence
MTTAPSGLVD AARGALVRGD LEATNRAAAA MVAASPGDAE GHFLLGVAEA GVGRIKAGIG 
HIERAVTLDP RGEYLAHLAK FYCLVRRDRD AANALRRAEA APPADALGRD TMGCVFARLG
DHAAALPHFA EAVRLRPDIA EYRYNEAVTL NFLGRVDEAE AAIEALLAQA PGHARAHHLL
AGLRRQSAER NHVARLGQAR ARARGGRDRL LLGYALAKEL EDIGEYDHAL DTLCEANAEH
RRNLPYAFER DAAIFDAIEE SWPMLRDAAP TAPSKASPIF VIGMPRTGTT LVDRILGSHP
EVESAGELQA MPLAVKRAAA TNSATVMDPE TIRAATGADM AAVGRDYLER ARHHLRGGAA
RFTDKFPGNF HYAGFIARAL PEARIVCLRR HPMDTVLSNF RNLFAVGSRY YDYSYDLLDI
AAYYARFDRL MAFWREALPG RVLELRYEDL VADQEGQTRR LLEHCGLGWS ETCLEFHSNA
APVSTPSAAQ VRRPIYADSV ARWKRHGEVL GPVARFFEQT GIAID