Gene Saro_0359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0359 
Symbol 
ID3918243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp395159 
End bp396898 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content66% 
IMG OID640443088 
Producthypothetical protein 
Protein accessionYP_495641 
Protein GI87198384 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3843] Type IV secretory pathway, VirD2 components (relaxase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGACG ACTTCGAGAT CCGTCCCGGC CGATCGCGGG ACCAGGGCAT GCCGGCCAGC 
CGCAAGGCGA CCTCGCTCGT CGGCCAGGTG CTCAAAGCGT CGCGCCGCTC CGGGGCAACG
CCACTGGCTC GCCGGAGCGG CAAGCCTGCC GGGACGGGAA GATATGGCCG GGGCCGCGCA
GCCGCACTGC GCGCGCGGCG TTCGCCCTAC CAGCGCCGCG TGGTGATCAA GGCGCGCGTC
GTGCGACACA AAGGCGCGCG GTTCCGGGCC GCGCCGCTTG CGATGCACGT CTCCTACCTC
GAGCGCGATG GCGTGACGCG CGACCAGGAG CGCGGTCAGC TCTTCGATGC CGGCGTGGAC
AATGCCGATG GCGAGGCATT CGCGCAACGC TGCGCGGATG ACCGGCACCA TTTCCGGTTC
ATCGTGTCAC CCGAAGATGC GACCGAGCTT GCCGACCTGC GCACGTTCAC TCGCGAGCTG
ATGGATGACA TGGCCCGCGA TCTCGGCACC CGGCTCGTCT GGGTCGCGGT GGATCACTGG
AACACCGACA ACCCTCATGT TCATGTCCTC GTCCGGGGGC GGGCAGCGGA TGGCGCGGAC
CTGGTCATCG ACCGCGACTA TATTCGTGAG GGCATGCGCT CACGGGCCGA AGAGCGCGTC
ACCATCGAGC TCGGGCCGCG GAGCGAACGC GACATCCAGC GCGCCATGGC TCGTGAGGTC
TCAGCGGAAC GGTGGACAGG CCTCGACCGG CAACTGCGCA CGCTGCAGGA TCACGACCAG
GTGATCGATC TGCGCCCAGC AGCGGATCAG GATCGCCGGC GCCATGCGCT GCTTGTAGGC
AGGGTCAACT CGCTGGCACG CATGGGGCTG GCGAGCGAAA CCCAGCCAGG GCGCTGGACG
ATGCGCGCCG ATGCGGAGAA GACGCTGCGC GACCTCGAAA TCCGCGGTGA TGTCATCAAG
ACGATCCACC GCTCGATGGC GGAGAACGGA TGGCGCTCAG ACCTGTCCCG GCTTGCGATT
CACGACCAGC AGCCATCGGA TCCGATCATC GGCCGACTGG CCAGCAGGGG GCTTCACGAC
GAGCTATCAG GCAAGGCCTA TGCCGTCGTC GATGGCATGG ATGGTAGGAC ACATCACCTG
CGTTTCAACG ATCTCGAAGC CACCAGCGAT GCCCGGCCCG GCGCGATCGT GGAACTGCGG
CATTGGACCG ATCGCAAGGG ACAAGGCCAT GCCGCGCTGA CGGTCCGTTC GGATCTGGGA
TTGGCGGAAC AGGTCACGGC CAAAGGAGCG ACGTGGCTCG ACCAGCAACT CGTGGCGAAG
GAGCCGACGG CACATGGGCC AGGCTTCGGG CGGGAGGTCG AGGAAGCCCT GCAACAACGC
TCCGAGCATC TGGCTGATGA TGGACTGGCA ACCCGGCAGG GGCGGCGGTT TCTGTTTGCG
CGCGGTCTTC TCGAGACCTT GCGTCAGGGA GAGATGGCCG AGGCGGCGAA CAGGCTCTCC
CGGCAGACCG GGCTTGAGTT GCAAGCGAGT GGACCCGGGG AGCATGTCGC CGGCATTTAC
AGGCAGCGTG TCGACCTGGC GTCGGGTCGC TTCGCCATGA TCGACAATGG TCTGGGGTTC
CAGCTAGTGC CCTGGCAGCC AGTATTGGAG CGCAAACTTG GCCAGGCCGT GGCTGGTGCG
GTGGACCAGC GCGGAGGGGT CGACTGGAGT TTTGCGCGGT CGAGGTCGAT TTCCCTTTGA
 
Protein sequence
MDDDFEIRPG RSRDQGMPAS RKATSLVGQV LKASRRSGAT PLARRSGKPA GTGRYGRGRA 
AALRARRSPY QRRVVIKARV VRHKGARFRA APLAMHVSYL ERDGVTRDQE RGQLFDAGVD
NADGEAFAQR CADDRHHFRF IVSPEDATEL ADLRTFTREL MDDMARDLGT RLVWVAVDHW
NTDNPHVHVL VRGRAADGAD LVIDRDYIRE GMRSRAEERV TIELGPRSER DIQRAMAREV
SAERWTGLDR QLRTLQDHDQ VIDLRPAADQ DRRRHALLVG RVNSLARMGL ASETQPGRWT
MRADAEKTLR DLEIRGDVIK TIHRSMAENG WRSDLSRLAI HDQQPSDPII GRLASRGLHD
ELSGKAYAVV DGMDGRTHHL RFNDLEATSD ARPGAIVELR HWTDRKGQGH AALTVRSDLG
LAEQVTAKGA TWLDQQLVAK EPTAHGPGFG REVEEALQQR SEHLADDGLA TRQGRRFLFA
RGLLETLRQG EMAEAANRLS RQTGLELQAS GPGEHVAGIY RQRVDLASGR FAMIDNGLGF
QLVPWQPVLE RKLGQAVAGA VDQRGGVDWS FARSRSISL