Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0660 |
Symbol | |
ID | 3918085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 694266 |
End bp | 697238 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640443391 |
Product | hypothetical protein |
Protein accession | YP_495941 |
Protein GI | 87198684 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGCG TGAAGGCGGG CACTCTCGAA ATCGAGATCC TGACGAACAT GGCGCGCCTG CAGGAGGAAA TGCGCCAGAT CGAAAAGTCG GTCGGCAACA TGTCAACCGG GGTTGCGCGC TCCACCAAGG CTGCAAACGA CAATATCGGG AGTATTGGCA AGACCAGCGG CCTAGCTAAG CACCACGTGC AGAATCTGGC CTTTCAGTTC CAGGATCTCG GTATCCAGCT CACCGCCGCT GCCCAGAGCA GCAAGCCCTT CCAAGGGGCG ATGATGGCGC TATTCCAGCA GGGCACGCAG ATTCAAGGGG TGATGTCGCA GGCGGGCATA GGCGTTGGGG GCCTGATCGC CCAGATCGGG CGCTTTGTCC TCGCCGCCGC GCCCGCCATC GCCGCCATCG GTGCAATCAG TGCCGCGGTT GGTCTCGTCA CCAGCGAGAT CAACGAGAAC AGCAAGGTCA CGGTCACCTG GCAGGACACC CTGCTGGGTA CTTATGACGC ACTGAAGAAG TATCTCACCG ACCAGCTCAC CGGTGCATTC AAGGCATTCG GCCTCGAAAC TTCGAACGTC TGGAAGGACG TCGTCAACGC GGCGAAGTGG GGAATCAACT GGATCATCGG CGCGATGACG GTGGTACCGC GGGTTCTTCG CGACACCTGG AGCCTCATTC CCGCAGGCGT AGCGGACATC TTCGTGACGG CTGCCAATGG CGCCATCCGG GCAGTCAACG CGATGGTCGC GAAGACCGTC GAGATCCTCA ATGGCTTCAT CAGCAGCGCC AATGTCATCC TTGGCAAGGT CGGCCTAGAT CTTCCCAAGT TGGCAGCTCC GCAGATCGGG GAGCTCCAGA ACAGCTATGC CGGAGCCGGT GCCAAGCTCG GAACCGCGTT CATGGGCGCA ATCCGTGACA CCGTGACGCG TGACTACCTC GGAGACGCTG CCGCAGCGAT CAGTCCGTTT GCCCAGGCGC GGGCTGTGGA GCGCATGAAG AAGGACGCGA AGAAGGCGGG CAAGGAGGCA GGAAAAGCAC TCCGCGATGG CGCCGAAGAC GAGCTGAAGA AGCTCCTCGC ATCTCTGGAG GGCACCTTCG CGATCTATCG GGACGCGGCC AAGACCACCG GGAAGATGCT GGACAAGCAG CTCGATCGGG ACTGGCAACG GATGTTCGAC GAGATGGCGG ACAAGCAGCG CAAAGCCTCT CAGGCGGCAG TTGATGCTGC CGATGCAAAT GCTGCCTGGA ACGACGAATT GCGCCGCACG ATCGAGCTTC TCGATCAGAT TGGCGGGTTT GCAAGCATTC TGGCGAACAT CGGCGCAGTT ATCGAGGGCT TCTCCAGCGG CGACTTCTCG GGTGTGCGCG GTCCGCTGGG CGGCCTGCTC GGCCTGGTTG GCAGTACCGA TGAAGGCAAG CAGGCGATCA AGGACCTGGG TGTTGTGTTC CGCGATGCGC TCGACGGCGT GTTCGGCGGA GAGGGCAGCT TCACGAAGGC ACTGAAGGCG GCGAGCGTGG GTGTAGCTGC CGGGCAGATG GTGTTCGGTT CCAAGAACAG CGGCATCGGT TCAGCAGTCG GCGGGGTGCT GGGAGAAGTT GCCGGCAAGG CTATCGGCAA ATCTGTTGGT GGGCTGCTCG GCAAAGTTGG TGGTCCGCTT GGGTCCATTG TCGGTGGCAT TCTCGGCGGC GCGCTTGGCA GCCTGTTCAA GACGGTAAAG ACGGGCTTTG CCGTCGTTTC CAACACGGGC GTGACCTCAG GAGGCAGCAG CTCCGAACTT GCGTCGTCCT CGAAATCGAG CGGCACGGGA ATCCAGGCCG CAATTCAGAA CATTGCCGAT CAGCTGGGCG GGTCGGTGGG GAACTACTCG GTCTCGATCG GCAAGCGGTC TTCGGGCTGG ATCTCGGTCT CGGCGTCGGG CTCGTCGCAG GTTGCCGACA AGAACTGGAA GAAGCGCAAC GTCGGCGGTG ACTTGATCTA TGATGGCAAG GACGAGGCCG AGGCCCTTCG CGTCGCCCTG CTCAATGCCA TTCGCGACGG AGCCATTCAG GGCGTGCGCG CGGGCACTGC CGCGCTCCTC AAGAAGGATG GCGACATCGA GACCCAGTTG AACAAGGCCC TGAAGTTCGA GGGCGTGTTC ACCAGCCTCA AGCAGATGAC GGACCCGGTC GGGTATGCGC TGCAGCAGCT GACCAAGGAG TTCGAAGGCC TCAAGAAGAT CTTCGACGAG GCAGGGGCGA CAGCGGCAGA GTACGCGGAC CTCGAGAAGC TCCTCAACCT CCAGAAGCAG GAAGCCATCG ACAAGGCGGC GGCCGAGGTG CGTCAAAAGG CACTCGAGGC CGTCAACGAT CCGCTGAAGC TGCAGATCCG CATATTGGAG CTGCTCGGCA AGGGAGAGGA TGCCGTGGCT GCCGCGCGCA TCATGGAACT CGCCAGCCTC AAGTCGTCTC TTCAGCCCCT TCAAGCGATG GTCTACCAGC TGGAAGACGC GAAGGCGGTA ATCGATCAGT TCGGGCCACT GGCCGACGAT TTGCGCGCCT ATCGCAAGGA GCTGCTGGGC GGCAGCACAA CCATGGGGCT TGCTTATCTG GCGGGCCAGT TCCGCAGCAC TGCGGCCTCG GCCGCCGGCG GCGATGCCAC CGCCCTTGGA CAGTTGCGCA GCGTTTCTGG TCAATACCTC GATGCAGCCA AGGAGAACGC GGCCAGCGCG CTCGACTACC AGCGGGCTGT TTCCGAGGTT CTTGCGGCTG TGGACCGCGG GCTCTTCGCT GCCGACGCCC AGGTCGATTA CGCGCAAGCC CAGATCGACG CGATCGAGAA CACCGCGAAC ATCATGCAGT CGATGAAGAC CGAGCTGGTG ACGCTGCAGA AGCAGGTCGC TGACACCAGC GCAACCACGC TCAAGCTCTG GCAGCGGTTC GAGATCAATG GCTTGCCGGT GAACAACGAT GCCGGGCCGG TCCAGGTGGA GGTAGTCTCG TGA
|
Protein sequence | MAGVKAGTLE IEILTNMARL QEEMRQIEKS VGNMSTGVAR STKAANDNIG SIGKTSGLAK HHVQNLAFQF QDLGIQLTAA AQSSKPFQGA MMALFQQGTQ IQGVMSQAGI GVGGLIAQIG RFVLAAAPAI AAIGAISAAV GLVTSEINEN SKVTVTWQDT LLGTYDALKK YLTDQLTGAF KAFGLETSNV WKDVVNAAKW GINWIIGAMT VVPRVLRDTW SLIPAGVADI FVTAANGAIR AVNAMVAKTV EILNGFISSA NVILGKVGLD LPKLAAPQIG ELQNSYAGAG AKLGTAFMGA IRDTVTRDYL GDAAAAISPF AQARAVERMK KDAKKAGKEA GKALRDGAED ELKKLLASLE GTFAIYRDAA KTTGKMLDKQ LDRDWQRMFD EMADKQRKAS QAAVDAADAN AAWNDELRRT IELLDQIGGF ASILANIGAV IEGFSSGDFS GVRGPLGGLL GLVGSTDEGK QAIKDLGVVF RDALDGVFGG EGSFTKALKA ASVGVAAGQM VFGSKNSGIG SAVGGVLGEV AGKAIGKSVG GLLGKVGGPL GSIVGGILGG ALGSLFKTVK TGFAVVSNTG VTSGGSSSEL ASSSKSSGTG IQAAIQNIAD QLGGSVGNYS VSIGKRSSGW ISVSASGSSQ VADKNWKKRN VGGDLIYDGK DEAEALRVAL LNAIRDGAIQ GVRAGTAALL KKDGDIETQL NKALKFEGVF TSLKQMTDPV GYALQQLTKE FEGLKKIFDE AGATAAEYAD LEKLLNLQKQ EAIDKAAAEV RQKALEAVND PLKLQIRILE LLGKGEDAVA AARIMELASL KSSLQPLQAM VYQLEDAKAV IDQFGPLADD LRAYRKELLG GSTTMGLAYL AGQFRSTAAS AAGGDATALG QLRSVSGQYL DAAKENAASA LDYQRAVSEV LAAVDRGLFA ADAQVDYAQA QIDAIENTAN IMQSMKTELV TLQKQVADTS ATTLKLWQRF EINGLPVNND AGPVQVEVVS
|
| |