Gene Saro_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3921 
Symbol 
ID5077405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp88603 
End bp92880 
Gene Length4278 bp 
Protein Length1425 aa 
Translation table11 
GC content66% 
IMG OID640481028 
Producthypothetical protein 
Protein accessionYP_001165690 
Protein GI146275529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCCGCC GGTCGGCAGC AGACCTCACA ATTCTCCCGG AGACCTCCAT GACCGCTACG 
ATCTCAGGCA GCAGCGCCAT TCTCGGCGCT GCGCGAACGC TTGCTGCCAA GCTCCTGCTC
GCCATTCCCA TCGATCGAGC GGCTCTCACG GAGGCCATGA CCGAGAGCTC GGGCGGCAAC
GATGCTGCGG GAGCCTGGAC CCAGCGCGAG AGTTTCGAGG CGCTTGAGGT GGCGCTGGCC
ATGGCAATTC CGGAACTGGT CGGCCGAATG GACGCCGGTG CTGCCATCGG CACGCTCGAG
GCGCTTGCTC GGGAATTACC GACCCACACG GTGCGCAGCG AAGACCAGAT CGCTTTCCAG
CAGTTTTCTA CCCCGCCGGC GCTGGCGTGC CTTGCCGTCC ATCTCGCGCG CCTCGGGTCC
GAAGATATCT TCCTCGAACC GAGCGCCGGC ACCGGGATCA TCGCTGCGCT GGCAAGAGGC
GCGGTGAAGC AGTTCCTTCT CAACGAACTC GAACCTACCC GCGCTGAGCT GCTCGAAGGC
CTATTTCCGG GCTCTTCAGT CTATCGCCAC GACGGCGCCA AGATTGCCGC GCTGCTCGCC
GGGACCGCTC GGCCGAGCGT GATCGTGATG AACCCTCCGT TCTCGGTGTC GCAGTCGCGC
GGCGAAGATC AGAACACAGC GGCACGGCAC CTGCGCGCAG CACTCGACCA TCTGTTGCCC
GGCGGACGGG TGGTGGCGAT CATGCCGGAC TGGTTTGCTG AATCCGCACG AGGTGGCGAA
GTCTATCGCC GTACTCTGGA AGGCGCACGG GTCGTCATGT CGCTGCGGCT CGACAAGGGC
GGCTACGCCA AGCACGGCAC CGGCATTGCG GTGCGGTTGC TGGTAATCGA CAAGGTTCCG
GGCGAGACCT CGGTTTCGAC GATCAATCGA GCGTCCGTAA GCGAGCTGTT CGCGGCGCTA
GGACCTATTC CTCCACGCGC TGCGCTGCGC GTGGCCAGTC CGGCTGCCGT GGTTCGGCCC
AAACTCAGCT TGTTCCGATC GGTGAAAAGC GGCCCGGCGC GTCCGGTGAT CGTGCGGGCA
CCGCAGACCA ACGAGGTCAG GCCTGTCGCC TACCAAGTGC TCGAGGAGGC CGCGGCAATG
GGCGAACAAC GCGGGGTCTA TGCCGACTAT CGCCCCTCGC GCGTCGTCAT CCCCGAAGCA
GGTGAGCATC CGTCCCATCT GGTCGAATCC GCCGCCATGG CGTCAATCGC CGCGCCCAAG
CCCGGCTACA TTCCGTGCCT GCCCGAGCGG ACGGTGACCG CGCGGTTGCT CTCGGCGGCC
CAGCTCGAGA CCGTGATCTA CGCGGGCGAA GCCTGGAGCC GCGATCTCCA CGGCCGCTTC
AGCCAAACTG CTGGCGAGGT GGCGCTCAAG GAAGACCCCG AGGGTCAGCT CTACCGCACC
GGCTTCTTCC TAGGCGACGG GACCGGAGCG GGGAAGGGGA GGCAGGCGGC GGCCTGCATT
CTCGACCAAT GGCTGAAGGG CAATCGTCGG CACATCTGGA TCTCGAAGAA CGCACCGTTG
CTCGAGGATG CGCAGCGCGA CTGGACCGCG ATCGGCGGGC TGCCATCGGA CATCATTGAT
CTCGCGCGCT GGAAGATCGG CGAGGAAATC ACCGCGCCCG AGGGCATCCT GTTCGTCCCC
TATGGAACCC TTCGCTCTGC CCGTGTCGAG GATACCCGGC TCGACCAGAT TGTGCGCTGG
GCCAGTCCCG CATTCGAAGG CGTGATCGTG TTCGACGAAG CCCATGAAAT GGGCGGTGTC
GCTGGCGGGG AGGGGGCTCT CGGTCAGAAA CAGGGTTCGC TCCAGGGGAT CGCTGGAGTG
CTGCTGCAGA ACACCCTGCC GCGCGCCCGC GTGCTTTACG CCTCGGCCAC CGGTGCTTCG
GACGTCAACA ACCTCGCCTA TGCGGTGCGG CTTGGCCTGT GGGGACCGGG CACGGCCTTT
GCGAACCGCG AGCAGTTCAT CTCCGAAATC CGCGACGGCG GCATTGCGGC GATGGAACTG
GTGGCGCGCG ACCTCAAGGC GTCGGGGCTC TATCTCGCCC GCGCGCTCAG CTTCGCCGGG
ATCGAGTACG ACATCCTGCG CCACGACCTC AGCATCGACC AGATCGCGAT CTATGACACC
TATTGCGAGG CCTGGACGAT CATCCACCAG AACCTTGAAG CCGCGCTTGA ACTGACGGGC
GTGGTCGACG GCCTCGAGAA CCGCACGCTC AACAGCGGCG CCAAGGCGGC GGCGCGCAGT
CGGTTCGAAG GCACCAAGCA GCGCTTCTTC GCGCAGGTGC TGCTCTCACT GAAGCTGCCT
TCGATCTTCC CGGCGATCGA CGAACACCTC GCCCAGGATG AAAGCGTGGT GGTGCAGCTG
GTCAGCACGG CGGAATCGAT CCTCAATCGG CGGCTTAATG AGCTCGATCC CGAGGAGCGC
GAGGCGCTGG AATTGGACTT GTCGCCCCGC GAGGCCATCG TGGACTACCT CACCCGCGCT
TTCCCGACCC GGCAGATGGA GGAATACGTC GACGAACTCG GCGACGTTCG CTCTCGGCCA
ATGTGGGACC AAGCCGGCAA CCCGGTCCAC AACCCGCAGG CCGAGGCCGC TCGCGAGCAG
CTGATCGAGC ATATCTGCGC GATGCCGCCG ATCCCGACCG CGCTCGACGC TTTGCTAGAG
CACTACGGGG TCAGCGCGGT GGCCGAAGTT ACTGGGCGCT CCAAACGCCT GGTCCGCGAT
GGCAGCGGCC AGCAGCGCCT CGAAAGCCGC TCGCCGCGCA CCAACCTTGC CGAAACCACC
GCGTTCATGA CGGGGGCAAA GCGTATCCTG GTGTTCTCCG ATGCCGGCGG AACGGGCCGC
AGCTACCATG CGAGCCTCGA TGCCAAGAAC CAGCAGCGCC GCGTGCATTT TCTGCTTGAG
CCGGGCTGGC GAGCCGACCG CGCGATCCAG GGGCTTGGAC GAACGCACCG CACCCACCAG
GCCTGCCCGC CGCTGTTTCG GCCGGTCACC ACCGACTGCA AAGGCGAAGC CCGGTTCACC
AGCACGATCG CGCGGCGGCT CGATGCGCTC GGTGCGCTGA CGCGCGGCCA GCGCCAGACC
GGCGGGCAGG GGATGTTCGA TGCCGCAGAC AACCTCGAGA GCGCCTACGC CAAGCACGCG
CTGCACGACT GGTACGGGCT GCTGGCCACC GGCAAACTCA AGAGCACGTC GCTGAAAGAA
TTCCAGGCCA TGAGCGGGCT TGAACTCACC GACCAAGACG GAGTGCTGCG CGAGGACCTG
CCGCCGATCC AGCGCTGGCT CAATCGCATC CTGGCAATGA AGATCGCCGT CCAGAATGCC
ATCTTCGACG AGTTCTTGAC TCTGGTGGAG ACGCGGGTGT CGGCGGCCAA GGAGGCCGGG
ACTTTCGATA TCGGCGTCGA GACCGTCGCG GCCGAGACCT GCGAGGTGCT CTCCGACACG
GTGATCCGTA CCGATCCGGT GACGGGGGCG ACCTCGCACC TCCTCGAACT GTCGCTGACC
CAGCGGCGTA AGCTGCTCGC GCTCGAGCGC GTGCTGAAAA TGGCCTCATA CGAGAACAAG
CCGCTGTTTC TGCGCAACGC CAAGTCGGGC AAGGTCGCGC TCGCGATCCC TGCGCCGTCG
CACATGGACG AGGAGGGCGA GCTTATCCGC CGGTTCGAAC TCGTCCGCCC GTTGCGCAGC
GAATACATCC TTGCGGGCAG GCTGGACGAG ACCGCCTGGG AGCCTGTCGC CCAGACCAAG
TTCAGCGCGC TGTGGGAAGC AGAATACGCC GCTGACGAGA ACCAGCTGGT GACCGAGACG
GTGTTTCTTG CGACCGGGCT ATTGCTGCCG ATCTGGGGCG CGCTTCCTAA AGAGGACCTG
ACGGTCAACC GCATCGTCGA CAAGTCCGGC GCCTCGTGGC TCGGTCGGCA CGTCCATGAC
CTCTACGTCG ATGCGACGCT CGAGAAGCTT GGCGTCTCGC GCAAAGCGCA GACTGACCCG
GCCAAGATCG CCCAAGCCAT TCTCGGTGGT GGCACGTGGA AGGCGCCGCA TCCGCTGAAC
TTCACCGTTC GCACGTCCCG GGTGAACGGG GCGCGCCGGA TCGAGATCGT CGGCGCCGAG
GCGGCCCGCA TCCCCGAGCT CAAGGCTATG GGCTGCTTCA CCGAGATCAT CGCCTACAAA
ACCCGGGTGT TCGTGCCAAC CGACAAGGCC GAGGCCATTC TGCAGGCGAT GACGGGCCCA
GGGACTACGC CTGATTGA
 
Protein sequence
MCRRSAADLT ILPETSMTAT ISGSSAILGA ARTLAAKLLL AIPIDRAALT EAMTESSGGN 
DAAGAWTQRE SFEALEVALA MAIPELVGRM DAGAAIGTLE ALARELPTHT VRSEDQIAFQ
QFSTPPALAC LAVHLARLGS EDIFLEPSAG TGIIAALARG AVKQFLLNEL EPTRAELLEG
LFPGSSVYRH DGAKIAALLA GTARPSVIVM NPPFSVSQSR GEDQNTAARH LRAALDHLLP
GGRVVAIMPD WFAESARGGE VYRRTLEGAR VVMSLRLDKG GYAKHGTGIA VRLLVIDKVP
GETSVSTINR ASVSELFAAL GPIPPRAALR VASPAAVVRP KLSLFRSVKS GPARPVIVRA
PQTNEVRPVA YQVLEEAAAM GEQRGVYADY RPSRVVIPEA GEHPSHLVES AAMASIAAPK
PGYIPCLPER TVTARLLSAA QLETVIYAGE AWSRDLHGRF SQTAGEVALK EDPEGQLYRT
GFFLGDGTGA GKGRQAAACI LDQWLKGNRR HIWISKNAPL LEDAQRDWTA IGGLPSDIID
LARWKIGEEI TAPEGILFVP YGTLRSARVE DTRLDQIVRW ASPAFEGVIV FDEAHEMGGV
AGGEGALGQK QGSLQGIAGV LLQNTLPRAR VLYASATGAS DVNNLAYAVR LGLWGPGTAF
ANREQFISEI RDGGIAAMEL VARDLKASGL YLARALSFAG IEYDILRHDL SIDQIAIYDT
YCEAWTIIHQ NLEAALELTG VVDGLENRTL NSGAKAAARS RFEGTKQRFF AQVLLSLKLP
SIFPAIDEHL AQDESVVVQL VSTAESILNR RLNELDPEER EALELDLSPR EAIVDYLTRA
FPTRQMEEYV DELGDVRSRP MWDQAGNPVH NPQAEAAREQ LIEHICAMPP IPTALDALLE
HYGVSAVAEV TGRSKRLVRD GSGQQRLESR SPRTNLAETT AFMTGAKRIL VFSDAGGTGR
SYHASLDAKN QQRRVHFLLE PGWRADRAIQ GLGRTHRTHQ ACPPLFRPVT TDCKGEARFT
STIARRLDAL GALTRGQRQT GGQGMFDAAD NLESAYAKHA LHDWYGLLAT GKLKSTSLKE
FQAMSGLELT DQDGVLREDL PPIQRWLNRI LAMKIAVQNA IFDEFLTLVE TRVSAAKEAG
TFDIGVETVA AETCEVLSDT VIRTDPVTGA TSHLLELSLT QRRKLLALER VLKMASYENK
PLFLRNAKSG KVALAIPAPS HMDEEGELIR RFELVRPLRS EYILAGRLDE TAWEPVAQTK
FSALWEAEYA ADENQLVTET VFLATGLLLP IWGALPKEDL TVNRIVDKSG ASWLGRHVHD
LYVDATLEKL GVSRKAQTDP AKIAQAILGG GTWKAPHPLN FTVRTSRVNG ARRIEIVGAE
AARIPELKAM GCFTEIIAYK TRVFVPTDKA EAILQAMTGP GTTPD