Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0477 |
Symbol | |
ID | 3918606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 520686 |
End bp | 521564 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640443207 |
Product | hypothetical protein |
Protein accession | YP_495759 |
Protein GI | 87198502 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAG AAGAAAAGCT GCACGAGCTT ACCAAGCGCA AGTTCTACAA GGCCGAACAG GAAGCAGGCT TCGTTGAAAG CCAGCCCCGG ACAACTCCAC AGCAACAGGA CCCGGCCTAT CGGCTGGCAT TCCGCGACAC CGACTTCCTC CTGCGCGAGG AACTTCGCCC GGTCCGCTTC CAGCTTGAGC TGTTGAAGAC CGAAATGCTG CTCGATGAAG CTGCGATCGG ATCAACGCTG GTCGTCTATG GCTCGGCCCG CATCCCCTCG CCCGACATGG CCGAAGCGGC CATGTCGACG GCGACCACGC CCGAGAAGAA GGCGGTGATC GAACGCCTCG TGGCCAAGGC GAAGTATTAC GAGGAAGCCC GCAAGCTGGC CTATACGGCC AGTTCCTGCG GGCTGGTCGA GGAAGGCAAG CGGCAATTCG TGATCTGTTC GGGCGGCGGT CCTTCGATCA TGGAGGCTGC CAACCGGGGC GCGCAGGAGG CCGGGGCTGA ATCGATCGGG CTCAACATCG TGCTTCCGCA CGAGCAGGCA CCCAATTCCT TCGTTACGCC GCACCTCTCG TTCCAGTTCC ACTACTTCGC GCTGAGGAAG ATGCATTTCC TCCTGCGCGC GCGGGCGGTC GCGGTGTTCC CCGGCGGCTT CGGCACGTTC GACGAGTTCT TCGAGATGCT GACGCTGATC CAGACCGGCA AGATGAAGCC GATCCCGATC CTGCTGTTCG GCAAGGACTA CTGGAGCCGC GTGGTCAACT TCGAGGCACT GGCCGAGGAA GGCGTGATCA ACTTCGAGGA CCTCGAGCTG TTCACCCCGG TCGAAACGGC GGACGAGGCC TGGAAGCACA TCGTCGATTT CTACGATCTC GATTGTTGA
|
Protein sequence | MTEEEKLHEL TKRKFYKAEQ EAGFVESQPR TTPQQQDPAY RLAFRDTDFL LREELRPVRF QLELLKTEML LDEAAIGSTL VVYGSARIPS PDMAEAAMST ATTPEKKAVI ERLVAKAKYY EEARKLAYTA SSCGLVEEGK RQFVICSGGG PSIMEAANRG AQEAGAESIG LNIVLPHEQA PNSFVTPHLS FQFHYFALRK MHFLLRARAV AVFPGGFGTF DEFFEMLTLI QTGKMKPIPI LLFGKDYWSR VVNFEALAEE GVINFEDLEL FTPVETADEA WKHIVDFYDL DC
|
| |