Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3103 |
Symbol | |
ID | 3918145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3320807 |
End bp | 3321937 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445887 |
Product | hypothetical protein |
Protein accession | YP_498372 |
Protein GI | 87201115 |
COG category | [S] Function unknown |
COG ID | [COG3146] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.73421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCCTTA CAGCGCGCAT CCACAAGGCC GTTTCCGAAA TTCCGGCAGA GGACTGGGAC CGCCTTGCCG GACCGGGCAA TCCCTTCGTT TCGCACACTT TTCTGGCGTT GCTGGAAGAG TCGGGCTCGG TTGGCGGCCG CTCCGGATGG TCGCCCCTGC CGATCGTGAT CGACGACGGG AATGGGCGAC CGGCGGCGGC CTTGCCTGCC TATCTGAAAA GCCACAGCCA GGGCGAATAC GTGTTCGACC ATTCGTGGGC GGACGCCTGG CAGAGGGCGG GCGGCAGCTA TTACCCCAAG CTCCAGATCT GCGCGCCGTT CACCCCGGCC ACGGGGCCGC GCCTGCTGCT TGGCGACCGT CCCGACCTTG CCGGCCCGCT GCTGCGCGCC GCGGAGCAGT TGTGCGAGGG CAACGAGCTG TCCTCGGCCC ACGCGACGTT CGTCGAACCG GCGCAGTTGC CGATGTTCGA GGCCGCCGGC TGGTTGCCGA GAAGCGACAT CCAGTTCCAC TGGGAGAATC GCGGCTATGC CAGCTTTGCC GATTTTCTCG GCGCGTTGTC TTCGGAGAAG CGCAAGAACC TGCGCAAGGA ACGTGCCCGC GCCCAGGACG GGGTGGAAAT CCGCCAGCTT ACCGGCGCGG ACATTCGCCC CGAGCATTGG GATGCCTTCT GGCTGTTCTA TCAGGACACC GGCGCACGCA AGTGGGGACG CCCGTACCTG ACGCGCCGCG CGTTCGACCT GATTGGCGAG CGGATGGCGG ACAAGGTCCT GCTGGTGCTG GCGTTTCTCG ATGGCGAGCC GGTGGCGGGG GCGCTCAACT TCATCGGCGC GCAGGCGCTT TACGGGCGAT ACTGGGGCGC GCTGGTCGAG AAGCCCTTCC TGCATTTCGA GCTTTGCTAT TACCAGGCCA TCGACGCCGC GATCCGGCTT GGGCTGGATC GGGTGGAGGC GGGTGCGCAA GGCGGCCACA AGCTGGCGCG GGGCTATGAG CCGGTCAGGA CGTGGTCCGC GCACTTCATC GCGGACCCCG GATTCCGCCG GGCGGTATCT GATTTTCTGG AACGGGAGCG TGCCGGCATC GCGCAGGACC AGATGCACCT GGGCGAGCGG ACTCCGTTCC GGAAGGGATA A
|
Protein sequence | MTLTARIHKA VSEIPAEDWD RLAGPGNPFV SHTFLALLEE SGSVGGRSGW SPLPIVIDDG NGRPAAALPA YLKSHSQGEY VFDHSWADAW QRAGGSYYPK LQICAPFTPA TGPRLLLGDR PDLAGPLLRA AEQLCEGNEL SSAHATFVEP AQLPMFEAAG WLPRSDIQFH WENRGYASFA DFLGALSSEK RKNLRKERAR AQDGVEIRQL TGADIRPEHW DAFWLFYQDT GARKWGRPYL TRRAFDLIGE RMADKVLLVL AFLDGEPVAG ALNFIGAQAL YGRYWGALVE KPFLHFELCY YQAIDAAIRL GLDRVEAGAQ GGHKLARGYE PVRTWSAHFI ADPGFRRAVS DFLERERAGI AQDQMHLGER TPFRKG
|
| |