Gene Saro_3948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3948 
Symbol 
ID5077432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp123039 
End bp125348 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content55% 
IMG OID640481054 
Producthypothetical protein 
Protein accessionYP_001165716 
Protein GI146275555 
COG category 
COG ID 
TIGRFAM ID[TIGR03187] DGQHR domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0773653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG ACGTAATGGG CCCGACCGTC ACGGGGGATG ATCTGGATTC TGAGCTGCGT 
CAGCGCAAAA GCAAGGACAT CTTTCACACA GTTACGGGTT CAACGCGCAA GGTCATTGCC
GAGAAAGTTG CCCTTGAAGA GCAAGACGGG TGGCGCGTCG CAAAGAAAAA CAAGAAGTCG
ACGAGGCTTG CGAAGGCCAA GCCTGCGCAC GAACAGCTCG AAGATGAGGT TTGGTGCCTC
CTCGCTCAAA TGGGATTCCA GGAGCTGAAC AAGGGCAGGC TCTTCACGAT CGCAGTGGAG
GATGGCCTGA ACCCTCGGCA GATCGATGTA TTTGCCAAGG ACGATGAGAC GGTCATCATT
GTTGAATGTC GGCAGAAAGC GACAGTCGGG CGGAAGCCAA TGGCGGATCT TATCGAAAAG
ATCCGTGCGC TGCGCGAGGC CGCGCAAAAA AGTATCAAAC TCCATTACGG CAGTCAGAAA
AAGCTGAAGG TCAAATTCGC CATTGCCACC AGAAACATCA TCTGGGGCGA GGCTGACCTG
GAGAAATGTA AAGAGTACCA GATCGCTGTG ATATCGGACC AGCTGCTCGA TTATTACAAG
CAACTGACAC AGCACCTCAA AATGGCCGCG CGCTTCCAGT TCCTCGCGCA CATGTTTGAG
GGACAGCGGG TCGATGGGCT CGCGCAAACC GTAGTTGCGA CCCGAGGCAA AATGGGTGGA
AGGCCGTTCT ACACGTTCCT GATCCGGCCC GAAGAGCTGA TGAAGATCGC CTATGTTGGC
CACAAGGGTA GCCGCGACAT CGAAAACCTC GAAACCTACC AGCGAATGCT CCAGTCCGAC
CGACTGAAGG GAATTGCGAA GTACATCAAT GAAGGCGGCA AATTCCCGAC TAACATCGTC
GTGAACCTCA AGCTCCCCGG TAAGAAGGAA CCACAGTTCG ACAAGAAGGA GACCGTCGGT
GAGGAGATAC TCGGTTTCTT GCACCTGCCC CCCATCTATG CCTCAGCATG GGTGATTGAT
GGGCAGCACC GCCTCTATGG CTATGCCTAT GCGCGTGAGA ACGGAGGCTT CAAGAGCGAC
GAGACCGTTC TACCCGTGCT GGCATACGTC AATCTTCCCG CCGATGAAGA GATGGACCTG
TTCATCGACA TCAACAGCAA GCAGGTGAAG GTGAAAACCG GGTTGCTGGT CGAACTCTAT
TCGGATCTGC ACTGGAAGTC CGACGATGTC GAAGAGGCCT TCCAGGCCCT GCTGTCGCGG
ATCGCCTACC GGCTGAACAA GGACAAGGCT TCTCCACTCT TTGACCGCAT GGTTGTCTCC
GGCACCAGGA AAACGAATGT GCGGTGTCTT ACGCAGACGT CGATACGGGA CGGCCTCAAG
GTTGCCCGGC TGATTGGCAG TCCCCTTAAG GGCATGATCG TGCCCGGTCC CCTCTCTACG
GGTGATCCGC TCAACTACGA CGCCAACCTC AAGAAGAGCC TTTCGGTTCT GACTGAGTGT
CTCGCGCTTT TCGCAAACAT TTTGCCCAAC CAATGGGCGG CCGGCGACAG CCCTACTGGC
TATGTATGCA CCAACAACGG TCTTCGCGCC CTGTTCTTGG TGATCCAGGA TGTCGCAGAG
CATGTCCGTC AAAATTCGGG CATAGACCTC GCGCTACTCA ATGCAGATGA AACGTTCAAA
GAACTGGAGC CGTATCTTAC AGCCCTCGCA GATCAACTCG CATCGGTAGC GCCGAACGAT
ATTCAGGCAT TTCGCAAGAT CGGATCCTCA CTGACAGCTG TGAAGCAGCA GTCGTTTGGC
ATGGAAGCCT ACATTCAGGC GAAACTCTCT GATTTCCGCC CGCTGGGGCT CCAGGAATAC
CTGGCCTCGC GCGATGCAGC TGGCGCCGAT GCGGCGGCGG CGAAGGTGAC CCAAATCCAC
AAGAAGCTGT TCAACTACGT CATCGAGACT CTGAAAGATC ACTTTGGCCG GGATCACAAA
GCGTGGTGGA CCCAGGGCGT ACCGCTTACC ATTCGCCTTT CGTGTACCCA GGAATGGGAA
AAGAAGAATC GTGAAGGCGA CGAGGAGTCT CATCTCTACC TCATCAACTA TCAGGATATC
GCCGTCGCCA ACTGGGATCT GTTCCGGGAC ACCCTGTCCC TGGGTTATAA GGATCCGGAC
AACAAGAAAG AGAGCACCAA GTGGATTAAA GTGCTCAACG ATATCCGCCA ATATACGGCT
CACCCTGAAA AAGGCCTGCT CAGCAAGGAA CAGGTCTCAT TCGTGAATGA GGTTTACGAG
AAGGTCGAGC ATCATATTCC CGCCCGGTAG
 
Protein sequence
MADDVMGPTV TGDDLDSELR QRKSKDIFHT VTGSTRKVIA EKVALEEQDG WRVAKKNKKS 
TRLAKAKPAH EQLEDEVWCL LAQMGFQELN KGRLFTIAVE DGLNPRQIDV FAKDDETVII
VECRQKATVG RKPMADLIEK IRALREAAQK SIKLHYGSQK KLKVKFAIAT RNIIWGEADL
EKCKEYQIAV ISDQLLDYYK QLTQHLKMAA RFQFLAHMFE GQRVDGLAQT VVATRGKMGG
RPFYTFLIRP EELMKIAYVG HKGSRDIENL ETYQRMLQSD RLKGIAKYIN EGGKFPTNIV
VNLKLPGKKE PQFDKKETVG EEILGFLHLP PIYASAWVID GQHRLYGYAY ARENGGFKSD
ETVLPVLAYV NLPADEEMDL FIDINSKQVK VKTGLLVELY SDLHWKSDDV EEAFQALLSR
IAYRLNKDKA SPLFDRMVVS GTRKTNVRCL TQTSIRDGLK VARLIGSPLK GMIVPGPLST
GDPLNYDANL KKSLSVLTEC LALFANILPN QWAAGDSPTG YVCTNNGLRA LFLVIQDVAE
HVRQNSGIDL ALLNADETFK ELEPYLTALA DQLASVAPND IQAFRKIGSS LTAVKQQSFG
MEAYIQAKLS DFRPLGLQEY LASRDAAGAD AAAAKVTQIH KKLFNYVIET LKDHFGRDHK
AWWTQGVPLT IRLSCTQEWE KKNREGDEES HLYLINYQDI AVANWDLFRD TLSLGYKDPD
NKKESTKWIK VLNDIRQYTA HPEKGLLSKE QVSFVNEVYE KVEHHIPAR