Gene Saro_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3931 
Symbol 
ID5077415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp105955 
End bp108180 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content65% 
IMG OID640481037 
Productphage integrase family protein 
Protein accessionYP_001165699 
Protein GI146275538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.117648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCTC TTCAGCTTCT GGCGGAGGGC GGCGATGAAC GTAATGCCCT CCCGGTGCTC 
CTGTCCGCAC CGCTGCGCAC CGGCTTCGAT CGCGCGCATA TCTCGCGATA CGGCGATCCG
GTCTGGGATC TGGCTCCCGG CGTGTTTCGC GACAACGCCC GGCGCTGCCA TATCACGGTA
CATTTTGACG GTATCGACGA CCCATCCATC GCCGATGCCC TGCGCCAGAT TCTTCATGCG
CGTCTCAATG TCGATCTACC CGGCCACCGT TCACGGCTTG AGCCGGCGGG GGTGCGCGGC
GAGGCCAACC GGACCTTGCG CTTTTTCGAC TTCGTGAAGG CAGAGCTGGG TCGTTTCGAT
CTCGGCCGGG TCGATCAGCC TTTGGCCGAT CGGTACGCCC GCTCTTTGCG CCTTGCCGGG
CTGCGTCCGG CTGCAGTCGC TACCCGGCTG CGCGTGATCT TCGACCTGCA CGAGTTGCGC
CACCATCTGA CGACCGCACG CCTATCCTTT GAGCCATGGC CCGGCCGCAG CCCGTTTTCG
GTGGCGGGAG CAAAGCATAT CGCCGGAGAG AACCGGACGC CGCGCATTCC GGAAGCGATC
ATCACCCCGC TGCTCGCCTG GTCGCTGCGC TACGTGACCT GCTATGCAGA TGACATCCTC
GCCGCGCGGG CGGAACTCGA TCGCCTCGAA GCAACGCGCG ACCGTCTGGT TGCCGCTGAG
GCGGGTCTTG ATCATGCGGA TCGACGGTCG CGGCAACGCC AACGCCTCAA CACCTATATT
GCCGCCTTGC GGCGACAACG GCGCGGGATT CCGATCTGGA CCACCCCACA CAACGGCACA
ACGCGGACAG ATCTGCAAAG CAGCGAAGTC ACGCCGCCGA TCAACTACCA CCTGATCCAC
CTCCATGCCG GCATCGATGC GCAGGCCGAA CCTGCCATGC ACCTTGGCCT CACCACCGGC
GCGCCGGACC TCATTGCCGC CGTCATCGCA GAACTCGGCA CCGAGGTCGG CGGGATGGAT
ACGCCCATTT CGGCCGATCC CGATACTGGC CTGCCCTGGC GCACCCGCTT CGATGCCAAG
GTCCTGGCTC TCGAAGAGGT CATGCTGCAG TCGGCAGCCT ACGTCGTTTG TGCCTATCTC
TCAGGCATGC GGGACAGCGA GATCCAGGCG ATGAAGCGGG GATGCCTGTC CGTCACCAGA
TCAGAAGACG GTGCGATCTT GCGGCATCGC ATCAAGTCCA CCGCTTACAA GGGCAAGCGC
GGAGGCGGCG AGGAGACCGA GTGGGTCACG ATCGCACCAG TTGCCGAAGC CATCGCCGTC
CTCGAACGGC TTTCCGCGCG GGCGAGCCTA GCGCGCGGGA CAACGACTTT GTGGCCGGTC
CTCGCGCTTC GGGTAAACAC CAAGACGCAT GTTTCTGCGG AGATCGTCCG GCAACTCAAC
CGATTTCGCG ATCATCTCAA CGACCGGTTC GGAACGGCGC AGGCACCGAT CATTCCTGCC
GGCCCCAATG GCGCGCCCTG GCGTCTGACC ACACGCCAGT TTCGCCGGAC CATTGCCTGG
CACATCGCCA ACCGGCCCTT TGGAACGATT GCCGGCATGA TCCAGTACAA GCACGCCAGC
GTTGCGGCGT TCGAAGGCTA TGCCGGCAGC AGCCGGTCCG GATTTCGGGG AGAGATCGAA
GCCCAGCGCG CGCTTGGCCA GATCGACGAT ATCCTCGTCT ACTTCGACGA TCGGCAAGGT
GGTGCGCGCC TTGGCGGACC TGCCGCGAAC AGGATCGGAG CCGCACTCGA TACTGCCGCC
CACGAGTTGG CGCCGCTACC CGCCATGATT GCCGACCGGC CACGCCTACG GACGATGCTC
GGCAGTCTCG CGCGGACATT GCATGTCGGC CCGCTGGCCG ATTGCTTCTT TGATCCGGCA
ACTGCCCTGT GCCTCAACCG CATTTCGGAA CCGGGTGCGA CCGGCCCAAT GATCGCCATG
TGCGAGCCGG TCCGCTGCCC CAATGCCTGT ATTGCCGAAC GGCATCGCCC GGCCTGGCAG
CGCGGAGCTG ACGAGGCGCG GCTGTTGCTG CGCGAGAAGC GGCTTCCAGA ACCGCAGCGG
GTAACGCTTC AGGCCGAGGT GGCAAGGATT GAGCGTGTCC TCGAACAGAT CGCGCCCAGT
GCCGCCACAC CTAGGACCGG TGTGGCAGGA GAGGGAGAGG AGGCCGGTTT TCGGGTGACG
GGCTGA
 
Protein sequence
MTALQLLAEG GDERNALPVL LSAPLRTGFD RAHISRYGDP VWDLAPGVFR DNARRCHITV 
HFDGIDDPSI ADALRQILHA RLNVDLPGHR SRLEPAGVRG EANRTLRFFD FVKAELGRFD
LGRVDQPLAD RYARSLRLAG LRPAAVATRL RVIFDLHELR HHLTTARLSF EPWPGRSPFS
VAGAKHIAGE NRTPRIPEAI ITPLLAWSLR YVTCYADDIL AARAELDRLE ATRDRLVAAE
AGLDHADRRS RQRQRLNTYI AALRRQRRGI PIWTTPHNGT TRTDLQSSEV TPPINYHLIH
LHAGIDAQAE PAMHLGLTTG APDLIAAVIA ELGTEVGGMD TPISADPDTG LPWRTRFDAK
VLALEEVMLQ SAAYVVCAYL SGMRDSEIQA MKRGCLSVTR SEDGAILRHR IKSTAYKGKR
GGGEETEWVT IAPVAEAIAV LERLSARASL ARGTTTLWPV LALRVNTKTH VSAEIVRQLN
RFRDHLNDRF GTAQAPIIPA GPNGAPWRLT TRQFRRTIAW HIANRPFGTI AGMIQYKHAS
VAAFEGYAGS SRSGFRGEIE AQRALGQIDD ILVYFDDRQG GARLGGPAAN RIGAALDTAA
HELAPLPAMI ADRPRLRTML GSLARTLHVG PLADCFFDPA TALCLNRISE PGATGPMIAM
CEPVRCPNAC IAERHRPAWQ RGADEARLLL REKRLPEPQR VTLQAEVARI ERVLEQIAPS
AATPRTGVAG EGEEAGFRVT G