Gene Saro_3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3159 
Symbol 
ID3918201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3371606 
End bp3372700 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID640445943 
Producthypothetical protein 
Protein accessionYP_498428 
Protein GI87201171 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGGC AGATATTCCT GCTGGCTGGC GCGGCGGCGT TGGCGATCGG TGCGCCCGCC 
CTCGCGCAGG GCAAGGGCGG TAACGGCGGC AATGGCAACG GCCAGGGCGG CGAGCACGGC
GCGCAACATG GCGGCGGCGC AAAGGCGCAG GGCCAGGGCA ATCGCGGCGG CGGCGAGGCC
GCAAAGGGTC CGGAACGCAA GATGGCCAGC GTCCAGCCCG GACGGAGCGA CAAGGCCTCG
CCGGCCAAGG CAGAGCGCGG GCCCGACCGC GCGGTTGCCG CCGCCGGCAA GGCAAACCGC
AACGAGCAGG CCGATAGCCG TGCGATGGAG GACACCGCAC CCGGCCGGAG CGGCCAGGCA
AAGGGCATCT ACAACGGCAA GGGACCTGGC AACAGCGCCG ACCTTGCGCG CGGCAATGCG
CAGCGGGCCG CGCCCGGCAA TCTGCGCGAG GCCGCCCGGG TCGTCGCCAC GCGCCGCTGG
GACGGCGGGC GCTATCGCTA TGACGACAGC CGCTATCTCG TCCCGGTAAG CGATTCCTGC
CCGCCGGGGC TGGCGAGGAA AAACAACGGC TGCCTCCCGC CCGGGCAGGC GCGCAAGCTG
GCGCCGACCG GCGGATGGTC TGGCTGGTAT CCCACCCGCT ACTTCGGCGA CGGCTACGAC
TGGCGCTATG ACAACGGCTA TCTCTACCGG CTTGGCAACG GCGGGCTTGT CTCGGCCTTC
GTGCCGCTGC TGGGCGGCGC CCTGTTCGGC GGCAATATCT GGCCTTCGCA GTACACCAGC
TACGAGGTGC CAGCCTATTA TGACCGCTTC TACGGCTACG ATGACGACTA CGATTATCGT
TATGCGCAGA ACGCCATTTT CGCAGTCGAC CCCGAGACGC AGCAGATCGA GGCAATCGCG
GGTCTCCTGA CCGGCGATCC GTGGTCGGTC GGGCAGGCGA TGCCGCTCGG CTACGACATC
TACAACGTGC CGCCAGCCTA CCGCGACCGC TACGTCGACG GGCCGGATGC CATGTACCGA
TACAGCGACG GCTACGTCTA CGAGGTCGAC CCCACGACCC AGCTCGTGCG GGCCGTGATC
GAACTGCTAG TCTGA
 
Protein sequence
MMRQIFLLAG AAALAIGAPA LAQGKGGNGG NGNGQGGEHG AQHGGGAKAQ GQGNRGGGEA 
AKGPERKMAS VQPGRSDKAS PAKAERGPDR AVAAAGKANR NEQADSRAME DTAPGRSGQA
KGIYNGKGPG NSADLARGNA QRAAPGNLRE AARVVATRRW DGGRYRYDDS RYLVPVSDSC
PPGLARKNNG CLPPGQARKL APTGGWSGWY PTRYFGDGYD WRYDNGYLYR LGNGGLVSAF
VPLLGGALFG GNIWPSQYTS YEVPAYYDRF YGYDDDYDYR YAQNAIFAVD PETQQIEAIA
GLLTGDPWSV GQAMPLGYDI YNVPPAYRDR YVDGPDAMYR YSDGYVYEVD PTTQLVRAVI
ELLV