Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1883 |
Symbol | |
ID | 3917104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1988558 |
End bp | 1989931 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640444627 |
Product | TPR repeat-containing protein |
Protein accession | YP_497157 |
Protein GI | 87199900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00380027 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG CCGATGCGCA TCTGCGGCGG GCTGTCGCGC TGCGCGATTC CCAGGACCTC GCCGGGGCGC TCGATGCCAT ACGAGAGGCA GCGAGTGCCG CGCCAGGGGA CGCCAATGTC GCTTTAGGTG TCGCTCAGGT CACATTCGAA GCCGGGCTCG ATGCGGCCGA TCTCTATGGG CGGGCGGCAG AACTTGCCCC GGACCGCCTC GACCTCCAGC GCAGCCGCGC CAGCGCGCTG GCGGCAGAGG GACGGCAGCC GGAGGCCGAG GAGCTGCTGG AAGCACTGCT GGCAAGGCAT CCCGCATGGA TCGACGGCCA TCGCTGCCTG TGCGGCATGC GGGCGACTGC CGGCGAGGTC GATTTCGCGC GGAGCTTTCG GAGCGCCGTG GCGCGCGAGC CCGAGAATTT CGGACTGCGC ATGGCATGGT TCCACGTCCT TGCCACTGCT CGGCTTTGGG ACGAGGCAAG AGCGGTCGTC GACGAGGCCG AGGCCCTGCT GGGAGAGCGG CAGGCGTCGC TTCTGGGCAA GCTGTTCATT GCCAGCGAGA GCGGGGAAGA GGCTGCAAAC CCAAGCCTTT TCGACCGGGT GGAGCACGTG CAGGACCTCG GGCTCGACAT CGCCCGGGTG CGGCATTTCC TGCGCGGCGG CCAGATCGAG CGGGCGCGGG ATCTGTGCGT GCGACACATG GGCCAGCCCA CCATGCGCGC GTTCTGGCCC TACGCCTCGC TCGCCTGGCG CCTCCTCGAC GATCCGCGCG CGCAGTGGCT CGACGCAGGC ATGCGCCATG TGCGCGCGTT CGACCTGGAC TTCCGCGCGG AGGAGCTCGC CGCGCTGGCG GAGACGCTGC GGCGGCTGCA TACGATGCGA CAGCCCTGGC ACGAACAATC CGTGCGCGGG GGCACGCAGA CCGAGAGGCC GTTGCTGCTG CGGATCGATC CGGTGATCGC CAGTGCCAGG GCGCGGATCG AGGCCGCGGT GCGGCGATAC ATCGATGAAT TGCCCGATCA CGATCCCGCC CATCCATTGC TGGCGGCACC CCGCAAAGGC CTCCTCTTTT CGGGGAGCTG GTCGGTCAGG CTGCGCCCCG GCGGCTTCCA TTCGGTGCAC ACCCATCCGA TGGGATGGCT CAGCTCGGCG CTTTACGTGA CGGTGCCAGA GCGGGAACAG CGCGGCGCAG CGCCCGCCGG GCACCTGCGC TTCGGTACGC CGCCGCCCGA ACTGGCCCTG CCGCTGGAGG CCTACGGTGA GGTAGTGCCT GTGCCGGGGC GCCTGGCCCT TTTCCCTTCG ACCATGTGGC ACGGCACCGT CCCCTTCGCG GATGGCGAAC GGATGACCAT AGCCTTCGAC ATCGTACCCA ACCTGAAAGC CTGA
|
Protein sequence | MSTADAHLRR AVALRDSQDL AGALDAIREA ASAAPGDANV ALGVAQVTFE AGLDAADLYG RAAELAPDRL DLQRSRASAL AAEGRQPEAE ELLEALLARH PAWIDGHRCL CGMRATAGEV DFARSFRSAV AREPENFGLR MAWFHVLATA RLWDEARAVV DEAEALLGER QASLLGKLFI ASESGEEAAN PSLFDRVEHV QDLGLDIARV RHFLRGGQIE RARDLCVRHM GQPTMRAFWP YASLAWRLLD DPRAQWLDAG MRHVRAFDLD FRAEELAALA ETLRRLHTMR QPWHEQSVRG GTQTERPLLL RIDPVIASAR ARIEAAVRRY IDELPDHDPA HPLLAAPRKG LLFSGSWSVR LRPGGFHSVH THPMGWLSSA LYVTVPEREQ RGAAPAGHLR FGTPPPELAL PLEAYGEVVP VPGRLALFPS TMWHGTVPFA DGERMTIAFD IVPNLKA
|
| |