Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3931 |
Symbol | |
ID | 5077415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 105955 |
End bp | 108180 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481037 |
Product | phage integrase family protein |
Protein accession | YP_001165699 |
Protein GI | 146275538 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.117648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCTC TTCAGCTTCT GGCGGAGGGC GGCGATGAAC GTAATGCCCT CCCGGTGCTC CTGTCCGCAC CGCTGCGCAC CGGCTTCGAT CGCGCGCATA TCTCGCGATA CGGCGATCCG GTCTGGGATC TGGCTCCCGG CGTGTTTCGC GACAACGCCC GGCGCTGCCA TATCACGGTA CATTTTGACG GTATCGACGA CCCATCCATC GCCGATGCCC TGCGCCAGAT TCTTCATGCG CGTCTCAATG TCGATCTACC CGGCCACCGT TCACGGCTTG AGCCGGCGGG GGTGCGCGGC GAGGCCAACC GGACCTTGCG CTTTTTCGAC TTCGTGAAGG CAGAGCTGGG TCGTTTCGAT CTCGGCCGGG TCGATCAGCC TTTGGCCGAT CGGTACGCCC GCTCTTTGCG CCTTGCCGGG CTGCGTCCGG CTGCAGTCGC TACCCGGCTG CGCGTGATCT TCGACCTGCA CGAGTTGCGC CACCATCTGA CGACCGCACG CCTATCCTTT GAGCCATGGC CCGGCCGCAG CCCGTTTTCG GTGGCGGGAG CAAAGCATAT CGCCGGAGAG AACCGGACGC CGCGCATTCC GGAAGCGATC ATCACCCCGC TGCTCGCCTG GTCGCTGCGC TACGTGACCT GCTATGCAGA TGACATCCTC GCCGCGCGGG CGGAACTCGA TCGCCTCGAA GCAACGCGCG ACCGTCTGGT TGCCGCTGAG GCGGGTCTTG ATCATGCGGA TCGACGGTCG CGGCAACGCC AACGCCTCAA CACCTATATT GCCGCCTTGC GGCGACAACG GCGCGGGATT CCGATCTGGA CCACCCCACA CAACGGCACA ACGCGGACAG ATCTGCAAAG CAGCGAAGTC ACGCCGCCGA TCAACTACCA CCTGATCCAC CTCCATGCCG GCATCGATGC GCAGGCCGAA CCTGCCATGC ACCTTGGCCT CACCACCGGC GCGCCGGACC TCATTGCCGC CGTCATCGCA GAACTCGGCA CCGAGGTCGG CGGGATGGAT ACGCCCATTT CGGCCGATCC CGATACTGGC CTGCCCTGGC GCACCCGCTT CGATGCCAAG GTCCTGGCTC TCGAAGAGGT CATGCTGCAG TCGGCAGCCT ACGTCGTTTG TGCCTATCTC TCAGGCATGC GGGACAGCGA GATCCAGGCG ATGAAGCGGG GATGCCTGTC CGTCACCAGA TCAGAAGACG GTGCGATCTT GCGGCATCGC ATCAAGTCCA CCGCTTACAA GGGCAAGCGC GGAGGCGGCG AGGAGACCGA GTGGGTCACG ATCGCACCAG TTGCCGAAGC CATCGCCGTC CTCGAACGGC TTTCCGCGCG GGCGAGCCTA GCGCGCGGGA CAACGACTTT GTGGCCGGTC CTCGCGCTTC GGGTAAACAC CAAGACGCAT GTTTCTGCGG AGATCGTCCG GCAACTCAAC CGATTTCGCG ATCATCTCAA CGACCGGTTC GGAACGGCGC AGGCACCGAT CATTCCTGCC GGCCCCAATG GCGCGCCCTG GCGTCTGACC ACACGCCAGT TTCGCCGGAC CATTGCCTGG CACATCGCCA ACCGGCCCTT TGGAACGATT GCCGGCATGA TCCAGTACAA GCACGCCAGC GTTGCGGCGT TCGAAGGCTA TGCCGGCAGC AGCCGGTCCG GATTTCGGGG AGAGATCGAA GCCCAGCGCG CGCTTGGCCA GATCGACGAT ATCCTCGTCT ACTTCGACGA TCGGCAAGGT GGTGCGCGCC TTGGCGGACC TGCCGCGAAC AGGATCGGAG CCGCACTCGA TACTGCCGCC CACGAGTTGG CGCCGCTACC CGCCATGATT GCCGACCGGC CACGCCTACG GACGATGCTC GGCAGTCTCG CGCGGACATT GCATGTCGGC CCGCTGGCCG ATTGCTTCTT TGATCCGGCA ACTGCCCTGT GCCTCAACCG CATTTCGGAA CCGGGTGCGA CCGGCCCAAT GATCGCCATG TGCGAGCCGG TCCGCTGCCC CAATGCCTGT ATTGCCGAAC GGCATCGCCC GGCCTGGCAG CGCGGAGCTG ACGAGGCGCG GCTGTTGCTG CGCGAGAAGC GGCTTCCAGA ACCGCAGCGG GTAACGCTTC AGGCCGAGGT GGCAAGGATT GAGCGTGTCC TCGAACAGAT CGCGCCCAGT GCCGCCACAC CTAGGACCGG TGTGGCAGGA GAGGGAGAGG AGGCCGGTTT TCGGGTGACG GGCTGA
|
Protein sequence | MTALQLLAEG GDERNALPVL LSAPLRTGFD RAHISRYGDP VWDLAPGVFR DNARRCHITV HFDGIDDPSI ADALRQILHA RLNVDLPGHR SRLEPAGVRG EANRTLRFFD FVKAELGRFD LGRVDQPLAD RYARSLRLAG LRPAAVATRL RVIFDLHELR HHLTTARLSF EPWPGRSPFS VAGAKHIAGE NRTPRIPEAI ITPLLAWSLR YVTCYADDIL AARAELDRLE ATRDRLVAAE AGLDHADRRS RQRQRLNTYI AALRRQRRGI PIWTTPHNGT TRTDLQSSEV TPPINYHLIH LHAGIDAQAE PAMHLGLTTG APDLIAAVIA ELGTEVGGMD TPISADPDTG LPWRTRFDAK VLALEEVMLQ SAAYVVCAYL SGMRDSEIQA MKRGCLSVTR SEDGAILRHR IKSTAYKGKR GGGEETEWVT IAPVAEAIAV LERLSARASL ARGTTTLWPV LALRVNTKTH VSAEIVRQLN RFRDHLNDRF GTAQAPIIPA GPNGAPWRLT TRQFRRTIAW HIANRPFGTI AGMIQYKHAS VAAFEGYAGS SRSGFRGEIE AQRALGQIDD ILVYFDDRQG GARLGGPAAN RIGAALDTAA HELAPLPAMI ADRPRLRTML GSLARTLHVG PLADCFFDPA TALCLNRISE PGATGPMIAM CEPVRCPNAC IAERHRPAWQ RGADEARLLL REKRLPEPQR VTLQAEVARI ERVLEQIAPS AATPRTGVAG EGEEAGFRVT G
|
| |