Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1727 |
Symbol | |
ID | 3916302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1816983 |
End bp | 1820396 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444468 |
Product | glycoside hydrolase family protein |
Protein accession | YP_497001 |
Protein GI | 87199744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0544172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCC GCAACGGCCT TTCGGTCGCA ACGCTACTTG CCAGCACCAC CGTGCTCACG CCCGCTGCAC TCGCGGATAC CGCACCTGCG GTGCAGGCCG CCCCGGCGGC ATCGACACTC GACCAGCAGT TCCAGGATCC GCCGATGGAA GCGCGCCCGC GCGTCTGGTG GCACTGGATG AACGGCAACA TCACCAAGGA CGGCATTGCC AAGGACCTTG CGTGGATGAA GCGCGTCGGC ATCGGCGGCC TCCAGAACTT CGACGCGGGG CTCGATACGC CGCAAGTGGT CGACAAGCGT CTCGTCTACA TGACGCCCGA GTGGAAGGAC GCCTTCCGCT TCGCCGCGAT CGAGGCGGAC CGGCTCGGCC TCGAGCTCGC CATCGCCGCA TCCCCAGGCT GGTCGGAAAC CGGCGGCCCG TGGGTCAAGG CCGAGGACGG CCTCAAGAAG GTCGTCTGGA GCGAGACCGT CCTGAAGGGC GGCAAGCGCT TTTCCGGCAA GCTGCCGCTC CCGCCGGCCG AGACCGGGCC TTTCCTCGAC ATGCCGGAAT CCGATCCGCT TGCCGCCATT GTTGGCGATC ATGGCTTCAA GGCGCCCGCA CTATACAGCG ACGTGGCGGT TCTGGCCGTT CCGGATCTGG GCAAGCCACT GCCGGTTCCC GCTTATTCGG CCGGCGGCAA ATCGCTCGAT GCGGCAGCGC TTTCTGATGG CAGTCTCAAG ACCGGCGTGA CGCTGGCGCG CGGTCCGGTC GATGCGCCCA CGACCATCGT CGCCGCCTAT GGCGCGCCCC AGACGGTCCG CTCGGCCACC GTCTCGATCC CGGGCGCCAA GGGCACGTTC TCCGGCCCAA CCGTACAGAT GCACCTCGAA GCCAGCGACG ACGGGCAGGC GTGGCGCAAG GTGGCCGATT TCCCGGTGAC TGCGGCGCCT TCCACGGTCG GCTTCCCGGC CGTCACCGCC CGCCAGTTCC GCCTCGTCCT TGCGCCACTG CCGTTCAACG GGTCGAACAT GGGCGATCCG ATGCCGGGCC TGGCTCCGGT CGGCGGCTTC GTCGAAATGA TGGCCGCTGC CGCCAAGGCC CCCTTCGACG TCAAGCAGTT CCGCCTTTCG TCCGATGTGA TGGTCGACCA GTTCGAGAGC AAGGCCGGCT TTGCTGTCGT GCCCGACTAC TACGCCCTCG GCACGCCGGA CGCCGCCGCG GCGGGCATCG ATCCGAAGCA GGTGATCGAC CTGACCGGGC GCATGAAGCC AGATGGCACG CTCGACTGGA CGGCGCCCAA GGGCAACTGG CGGATCATCC GTCTCGGCTC CAGCCTGCTC GGCACGACCA ACCATCCCGC GCCGCCCGAG GCAACCGGCC TCGAAGTCGA CAAGTTCGAC GGCGATGCGG TTCGCCGTTA CCTTGAACAC TATATCGGCA TGTACAAGGA CGCGGCCGGT TCGGAACTCG TCGGCAAGCG CGGCGTCCGC GCGATCCTGA CCGATTCGAT CGAGGTCGGC GCGGCCAACT GGACCCCGCG CATGGTCGAG CAATTCAAGC GCCTGCGCGG CTACGATCCC ACGCCCTGGC TGCCCACGCT TGCCGGCGTT CTTGTCGGTA CGCGGGCCCA GTCCGACGGG TTCCTGCGCG ACTATCGCCA GACCCTGGCC GATCTCATGG CTTCCGAACA CTATGGCACG GTTGCCAGGG TCGCGCACGA GAACGACCTC AAGGTCTATG GAGAGGCGCT GGAAGACAAT CGTCCGTCGC TTGGCGATGA CATCGCCATG CGCAGCCACG CCGACGTTCC GATGTCGGCG CTGTGGACCT TCTCCCGCAA GGCAGGGCCC AATCCCAGCT ACATTGCCGA CATGAAGGGT GCTGCCTCCA CCGCCCACAT CTACGGGCAG AACCTCGTCG CTGCGGAATC GATGACTTCC GCCCTCGCTC CCTGGGCCTA TGCGCCCAAC GAGTTGCGCC GCATCATCGA CCTCGAATTT GCCAGCGGCG TGAACCGCCC GGTCGTTCAC ACCTCGGTGC ACCAGCCCGT CGACGACAAG GTGCCCGGCA TGTCGCTGAT GATCTTCGGG CAGTACTTCA ACCGGCACGA AGCCTGGGCC GAACTGGCGC GGCCCTGGGT CGACTACATG GCCCGTTCGT CGCTGTTGCT TCAGGCCGGC AGGAACGTCG CGGACGTGGC GTACTTCTAT GGCGAGGAAG GGCCGCTCAC CGCGCTCTAC GGCCGCAAGC GCATCGAAGA TGCGCCGAAG GTGCACGCCT ATGACTTCAT CAACCGCGAC GCCCTGTTCG ATGCGGTCGA AGTGCAGGGA GGCGAAGTCG TCGCGAAGGG CGGCGCGCGC TACAAGGCGA TCCAGCTCGG CGGCTCGTCC CGGCGCATGA CCTTGCCGAC GTTGCGTCGC CTCGCGGCGC TGGTCGAAGC TGGCGCGACC GTTGTCGGCA GCCCGCCGGA AGGTTCACCC GCGCTCAACG ACAGCGCGGC CGACTTCTCG GCGCTCGTCG CCAAGCTGTG GAGCGGTCAG CTGGTGACGA GGGTCGGGGC AGGGCAGGTG ATTGCCTCTG CCGACATCGA CCGCGCGCTT GCCGATGCCG GCATTGCGCC GGACCTGGTG CTCGATGGGA CGTCGGAGGA CGGCGAGGTG CTGTTCGTCC ACCGCGCCCT GCCGGACGGC GACGCATGGT TCCTCAACAA CCGCAAGGCA GCTCCGGAAG TCGTCGAGGC CCGTTTCCGC GTAACCGGCA AGCAACCGGA ACTGTGGCAC GCCGATACCG GCGAGACCGA GGCCATCTCC TACCGGATCG AGAATGGCCA GACGGTCGTA CCGCTGACGC TCGATGCCGA GGATTCGGTC TTCGTCGTGT TCCGCAAGCC GGCCTCGGCC AATTCCATGG CGATCAAGAA GGGCGTTCCA GCGACCGTCG CGACGCTCGA TGGTGCGTGG AAGGTCGCCT TCCAGCCCGG TCGCGGCGCC CCTGCATCCA TCGCTCTGCC AAGGCTGGGT TCACTCAGCG ATCAGGCCGA TCCGGGCGTG AAGTACTTCT CGGGTCTTGC CACCTACACC AGCACTTTCA CGCTGCCGAA GGGCGTGAAG CCGGGCCAGC CGCTATGGCT GAACCTCGGC CAGGTCGGTG AAATCGCCGA GGTCAGCGTC AACGGCAAGC ATGCCGGATA CGCCTGGCAC AAGCCCTATC GCGTCAACAT CGGGGCATCG GCGCGCAAGG GCGTCAATAC GCTCGAAGTG AAGGTCGCCA ACCTTTGGGT CAATCGCCTG ATCGGCGATG CTCAGCCCAA TGCGTCCCGG ATCACCTGGA CGGCGCTCCC GACCTATGGC CCCGACGCGC CTTTGCGGCC CTCGGGCCTG ATCGGGCCGG TGACGCTTGA AGCTATTGGG GCGGGGGTGC CGTCGAAGCC CTGA
|
Protein sequence | MNIRNGLSVA TLLASTTVLT PAALADTAPA VQAAPAASTL DQQFQDPPME ARPRVWWHWM NGNITKDGIA KDLAWMKRVG IGGLQNFDAG LDTPQVVDKR LVYMTPEWKD AFRFAAIEAD RLGLELAIAA SPGWSETGGP WVKAEDGLKK VVWSETVLKG GKRFSGKLPL PPAETGPFLD MPESDPLAAI VGDHGFKAPA LYSDVAVLAV PDLGKPLPVP AYSAGGKSLD AAALSDGSLK TGVTLARGPV DAPTTIVAAY GAPQTVRSAT VSIPGAKGTF SGPTVQMHLE ASDDGQAWRK VADFPVTAAP STVGFPAVTA RQFRLVLAPL PFNGSNMGDP MPGLAPVGGF VEMMAAAAKA PFDVKQFRLS SDVMVDQFES KAGFAVVPDY YALGTPDAAA AGIDPKQVID LTGRMKPDGT LDWTAPKGNW RIIRLGSSLL GTTNHPAPPE ATGLEVDKFD GDAVRRYLEH YIGMYKDAAG SELVGKRGVR AILTDSIEVG AANWTPRMVE QFKRLRGYDP TPWLPTLAGV LVGTRAQSDG FLRDYRQTLA DLMASEHYGT VARVAHENDL KVYGEALEDN RPSLGDDIAM RSHADVPMSA LWTFSRKAGP NPSYIADMKG AASTAHIYGQ NLVAAESMTS ALAPWAYAPN ELRRIIDLEF ASGVNRPVVH TSVHQPVDDK VPGMSLMIFG QYFNRHEAWA ELARPWVDYM ARSSLLLQAG RNVADVAYFY GEEGPLTALY GRKRIEDAPK VHAYDFINRD ALFDAVEVQG GEVVAKGGAR YKAIQLGGSS RRMTLPTLRR LAALVEAGAT VVGSPPEGSP ALNDSAADFS ALVAKLWSGQ LVTRVGAGQV IASADIDRAL ADAGIAPDLV LDGTSEDGEV LFVHRALPDG DAWFLNNRKA APEVVEARFR VTGKQPELWH ADTGETEAIS YRIENGQTVV PLTLDAEDSV FVVFRKPASA NSMAIKKGVP ATVATLDGAW KVAFQPGRGA PASIALPRLG SLSDQADPGV KYFSGLATYT STFTLPKGVK PGQPLWLNLG QVGEIAEVSV NGKHAGYAWH KPYRVNIGAS ARKGVNTLEV KVANLWVNRL IGDAQPNASR ITWTALPTYG PDAPLRPSGL IGPVTLEAIG AGVPSKP
|
| |