Gene Saro_3129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3129 
Symbol 
ID3918171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3338468 
End bp3340807 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content66% 
IMG OID640445913 
Producthypothetical protein 
Protein accessionYP_498398 
Protein GI87201141 
COG category[S] Function unknown 
COG ID[COG5448] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02217] conserved hypothetical protein TIGR02217 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0416541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCATT GGCTCGCCAA GCGCCGAACG GTGCAGCAGA CCGACATGAT CCAGCGGTTC 
GACCCGCGCT TCTGGACCGT GAACTTCCCG CGACCGGCGA TGGCCTCGGT CGTGACGACA
GCCGCTGATG CCATGCGCGT CGACGTGGTG TTCCAGAAGG CAGACGACCT TGTCGGCCTG
ATCTGGGAAT CTGAGGACAG GTGGGACCAT CCGCTGCTCG CCTACGAAAC GCGGCAGGAC
TATTCACGGC TGACGTTGAC CTTTCGCTGG CGCTCGTCAG GCATAGTCGC GCTCAACGCT
GTCAATGGTC CGACGCTGAC AATCGAGGGC CGCGATGCGA GCGGCACGGC ACGGGCCTGG
TACGTCCGGC TGTGGAACTA TGCAGACGGA TCGCCGACCG ATGCGCTGAT CAGACTGCCG
TTTTCGGACC TTGCCGGTGG TTTCCTGCTG CCGTCGGAGG CGGACCCGGT CCACCCCGCT
GACATCGACC GCATGTTCAT CTCGCTCGTC CCGCCGGGTT ACGCGCCGGG CAGCGACGCG
GCCTATGCAA ATGCGGTCGA GGGCTGGGCG GAACTGTCGG ACATCCACTG CGAAGGGCAT
CGCCCGATGC TCGAGATCGG TGACGTCATG GTGCCACCGC ATGGCCTGGC GATCTGCACC
GGCTATGACG ATGCCTACAA CCTGACGCCG GCGCGTGTCC TGCGGCAGGT GCGCGGTCTG
GGGTACCGCG GGAGCATCAA CCACTACATC GGGATGAGCC ATTTCTTCCC GCTGGCACCT
GATGGCGTGG GCGGCTTCGT CGTGGACGGG GCGTTGCCTG CAATGAACGC CGCGGCGAAG
GCGTGGCAAG AGGCGTTCTT CACCGACGCC TGCGCGATGG GATATACCGT CATCGCCTCC
CAGTCCTACG AGTTGCTGGC ACAGCATTGC CCCGATGCCT GGCAGCAACG TGCCCAGGAT
GGAACGCCCG CGAGGACGGG CTGGTCGCCG CCATCGGCGT TGCTGTCGCC GGCCAATGCC
GAGGCCATGG CATGGGTACG CAAGGTCGGT GTCGAACTCG TTGCGCTGCT CAAGGCGGCG
GGGTTACCTG TACGCCATCA GGTGGGCGAG CCGTGGTGGT GGGTGACGGC CGACCGCAGG
ATCTGCATCT ACGACAACGC GGCAAAAGCG GCGCTGGACG GCGACCCGGT GGACATTCCC
GATCTTGGCG CGCCGCTGAC GGCGGCGCAG AAGAGCCTGC TCGATGCGGC GGGCACCATC
CTTGCGCAAT CTACCGCGGA TCTCGCCGAG GCGGTCCGAA CCGCCGCCGG GGCCGCAGGA
GCGGAGACGC TGCTGCTGGC ATTCCTGCCG ACGGTGCTTG ACCCCGCCAC CCCCGACGCG
CGGCGCGCCA ACTTGCCGGT CGGCTGGGCC AGCCCCGCAT TCGATGTCCT GCAATTGGAG
GACTATGACT GGGTCACGAC GGGCCGGCAG TCGCTGCGCG ACGAAGGCAG GCGCATTGCC
GAGGAACGCC TGGGCTACCC GCGCGACCGC CAGCACTACC TTTCGGGATT TGTCCTGACT
GGAACGAACG CCGCAGTCGA ATGGCCGAGG ATCGATGCGG CAGCAAGCGA AGCCGTCGCG
CTTGGCGTTG CGGAAACCTT CATCTGGGCA CTGCCGCAGG TTTCCCGAGA CGGCTTCGTC
CGGCTTCCCG AGCCAACCGG AGACAACCCC ATGCAATCCT TCGATGACGT CCTGTTCCCC
CTGTCGCTCG GCCGGGATGC CTCCGTCACG CCGGAATTTT CGACGAACGT GACGATCACG
GCTTCGGGTT TCGAGCGACG CAACAGCCTA TGGTCGGACG CGCGACTGCA ATTCGACGTG
GGACCTGGCG TCCGTTCCGA AGCGGAGCTT GGTGAACTGA TCGCCTTTTT CCGCGCCCGG
CGCGGACAGG CCCGCGGGTT CCGCCTGCGC GATCCGTCCG ACTTCAGTTC CAACGGCATG
ACCGGCACAC CCACCCCTAC CGACCAGATC CTCGGGACCG GCGACGGGGC AACAGCGCGC
TTCGCACTGG TCAAGCGTTA TGGCGACAGC GAGGATGCCC AGCGGCGCCG AATCACCCGC
CCGCGCGCCG AAACGCTGCG CGTGAGCATC GACAATGTGG AAACCGGCGA CTTCACGCTG
GCGCCGCTTG GCTACATCAC GCTGGCCAGC GCCCCACCCT CCGGTGCAGT CGTGCGCGCG
GGTTTCCTGT TCGACGTGCC GGTGCGCTTT GCCGAAGACC GCATCGATAT TTCGGGCGCG
GAGTTCGCGG CCGGAGAAGC GCCGAGCGTT CCGCTGGTCG AACTGCGAGA AGACGCGTGA
 
Protein sequence
MSHWLAKRRT VQQTDMIQRF DPRFWTVNFP RPAMASVVTT AADAMRVDVV FQKADDLVGL 
IWESEDRWDH PLLAYETRQD YSRLTLTFRW RSSGIVALNA VNGPTLTIEG RDASGTARAW
YVRLWNYADG SPTDALIRLP FSDLAGGFLL PSEADPVHPA DIDRMFISLV PPGYAPGSDA
AYANAVEGWA ELSDIHCEGH RPMLEIGDVM VPPHGLAICT GYDDAYNLTP ARVLRQVRGL
GYRGSINHYI GMSHFFPLAP DGVGGFVVDG ALPAMNAAAK AWQEAFFTDA CAMGYTVIAS
QSYELLAQHC PDAWQQRAQD GTPARTGWSP PSALLSPANA EAMAWVRKVG VELVALLKAA
GLPVRHQVGE PWWWVTADRR ICIYDNAAKA ALDGDPVDIP DLGAPLTAAQ KSLLDAAGTI
LAQSTADLAE AVRTAAGAAG AETLLLAFLP TVLDPATPDA RRANLPVGWA SPAFDVLQLE
DYDWVTTGRQ SLRDEGRRIA EERLGYPRDR QHYLSGFVLT GTNAAVEWPR IDAAASEAVA
LGVAETFIWA LPQVSRDGFV RLPEPTGDNP MQSFDDVLFP LSLGRDASVT PEFSTNVTIT
ASGFERRNSL WSDARLQFDV GPGVRSEAEL GELIAFFRAR RGQARGFRLR DPSDFSSNGM
TGTPTPTDQI LGTGDGATAR FALVKRYGDS EDAQRRRITR PRAETLRVSI DNVETGDFTL
APLGYITLAS APPSGAVVRA GFLFDVPVRF AEDRIDISGA EFAAGEAPSV PLVELREDA