Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0075 |
Symbol | |
ID | 3917674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 76481 |
End bp | 77512 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640442800 |
Product | hypothetical protein |
Protein accession | YP_495358 |
Protein GI | 87198101 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGCG GGGACCTGGC CGGAGCGCTG GCGGCAGCAC GCACCGTCGA GCCGACAGCA GGCGACCACA CAGCACCCGG TCACACCCAC GAGACTGCCT GCCTCAACTG CGGCACCGCG CTGATCGGCA GCCATTGCCA CGCCTGCGGC CAGGCCGCGC ATGTCCACAA GACGCTGGGC GCGTTCTTCC ACGACCTGCT GCACGGCGTG TTCCATTTCG AAGGGAAGAT CTGGCGCACG CTGCCCCTTC TGGTCGCCAG GCCCGGGCAA CTGACGCGCG AGTACATCGA CGGCAGGCGC GCGAGCTATA TCTCGCCGAT CGCGCTGTTC CTGTTCTGCA TGTTCCTGCT GTTCACCACG ATCAGTTCGC TCAGCGGCGA TATCGGGACC AAGGCGCAGA CATCGGTCGC GGCCGCGCTC GAACAGGAGC GCAACGACCT TCGCGAACTC GAAGCATCGC GCACGGCTTC CGCCACCGAC CCCGCCGAAC TGGCAAAAGT GGACAAGGCG CTGGCGGAAA CGCGCGACAA CATCGCCGCG CTCGAGAAGC TCCAGACCAC CGGCATTACC GTCGGCAATG TCAGCACCGA GGGGCTCGAC AACATAAGCG ACGTGCCCGC GATCGCGGAA GCGGTGCGCA TCTGGAAGGA GAACCCCCGC CTCGCGCTCT ACAAGGTGCA GACCTACAGC TACAAGCTGA GCTGGGCGCT GATCCCGATC TCGGTGCCGT TCCTGTGGCT GCTGTTCCCC TTCAGCCGCC GCTTCGGGCT CTATGACCAC ACCGTCTTCG TCACCTATTC GCTCTGCTTC ATGACGCTTT TGGCGATCCT GGCCATGCTT GGAAGCAAGG TCGGCATCCC AGGCCTCGGA CTCCTCGCCC TGGTCCCGCC GGTCCACATG CTGCTGCAAT TGCGCGGAAC CTACGGAGTC ACCTGGTTCG GCGCATGGTG GAGGACCTGG TTCCTCTCCA TGTTCGCCTT CACGGCGCTG ATGCTGTTCG GCGTGCTCAT CCTGGTGGAA GCGGGCGGCT GA
|
Protein sequence | MTGGDLAGAL AAARTVEPTA GDHTAPGHTH ETACLNCGTA LIGSHCHACG QAAHVHKTLG AFFHDLLHGV FHFEGKIWRT LPLLVARPGQ LTREYIDGRR ASYISPIALF LFCMFLLFTT ISSLSGDIGT KAQTSVAAAL EQERNDLREL EASRTASATD PAELAKVDKA LAETRDNIAA LEKLQTTGIT VGNVSTEGLD NISDVPAIAE AVRIWKENPR LALYKVQTYS YKLSWALIPI SVPFLWLLFP FSRRFGLYDH TVFVTYSLCF MTLLAILAML GSKVGIPGLG LLALVPPVHM LLQLRGTYGV TWFGAWWRTW FLSMFAFTAL MLFGVLILVE AGG
|
| |