Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3930 |
Symbol | |
ID | 5077414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 104171 |
End bp | 105958 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481036 |
Product | hypothetical protein |
Protein accession | YP_001165698 |
Protein GI | 146275537 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCTG CTCCACGCAA AGGGCGCAAG GTCAGTTTCG AGGCCGCCGC GACTGCGCCC GAGGCCGATC CATGGTCGCA GATCAGCACG CTGCGCTTCT TGATCGATCC GCCATACGGC GGCGAGTTGC TCGTGGATTT CACCAGCTTG CGGCCCCGTG GCCTGGCTCT CGCCTTTGCA CGGGGCCTGT TCATGCTGGT CGCTCCGCGT GGCCCCATCA TGGTGCGTTC GTCGATCAAG ACATACGTCA CACAGTTGCC GAGCTTCTTT GCCTATCTTG CCGGGACAGG CGACCGGATC AACGGCCCGG CAGACCTTCG CGCCGACCAT ATCGACGGGT TCGAGAGGTG GCTCGCCGCG AAGGGCAAGT CGCGCGTGCA CGCGCCGACC TATGTCGCCA AGGTCGTCTC CGTCCTTCGC CGCATTGCGG CCGATGGTCC GGAGCTTGTC GATCCTGGCT TGCGGGAACG GCTGCGCTAC ATCAGCTCGC ACCCCCACGT TCGCCCCCGT CCACGCGATG CCTACAGTCC GTATGTCGCT CGCCAACTCC GCGATGCGGC ACGCGCCGAT CTTGTTGCGA TCGAGGCGCG CTTGCGGTCG GGGTCGCGTT TCGATGAGGT GCCGGAGATA GAAGGCGCTT ACCGCCAAGC CCACGCTGCC ATCGAAGCGC GCGGCGTGCT GCACCACCGT GATCCCGCAT GTCAGAGTCT GTATAAGCTG CGGTGGATTC GCGAACTGAG CGGCGCAAGG TTCAACCTGG GCCTCCATGC TGCCCACTAC CTTACTGCAC GCGACGTCGT GCCGCTTCTC GTTCTGCTCT CGCTGGAGAC GGGGCTTGAG CTGGAGTGCT GCAAGTCGCT CACCATCGAT TGTTTGCGCA ACGCTTCGGG CGGCACCGTC GATGTCGCCT ACACCAAGCT TCGCGCCCAC GGTGCCGAGC ACAAGACCAT CCGGGTCCGT GATGGCGGCT CATCGACACC GGGCGGCCTG ATCCGGCGGA TTATCGCACT CTCCGCCAAG GCCAGGGTGC ATAATTCCAG CGACAACCTT TGGGTCTATT ATCAGACCAA CGAGATCACT GCTGGTATCC GCCATCCGCG CACGACGATC GACGCGTGGA CGCTGCGTCA CGGCATTGTC GATGACGCCG GTCGACCGCT ACTCCTGCGC CTTTCGCAGC TACGCAAGAC CCACAAGGCC CTGTGGTACC TCAAGACCGA AGGGCACATG GCCCGCTTCG CTGTTGGTCA TACCGTCGAG ATCGCGGCAC GACACTACGC CGATATTCCG GCACTCAGAC CGCTGCATGA GCAGGCCGTG GCGGACGCCC TCGAAGAAGC GCTGACCGGC CCGAGGATCC TTCCTCCTCC CGACGAGGAG CGTCTGCGCG GCGCGTTGGC GAGCCCCGCG CCGGATGATG AAGGCGTCTC GCGCGCACTT CTCGACGGCG AACAGGACGT CTGGCTTGCC AGCTGCGGCA ACTTCTATTC CAGTCCCTTT GCATCTGCCG GTACCGCGTG CCCCACACCC TTCTGGGGTT GCCTCGATTG CCGCAACGCG GTCATCACCG CGCGCAAGCT GCCCGCCATC CTCGCGTTCC TCTCCTTCGT CGATGATCAG CGAGCCGGCC TGAGCGCGGC CGACTGGGCA GCCAAGTTCG GCCATGCCCG TGACCGGATT GTTCAGCAGA TCCTGCCTGC CTTCGGCGAC GACGTTGTGG CAAGGGCCCG GGTGCAGGTC GCGGCCGAAC CTCCCACGGT CTATCTGCCG CCCGAGGCCC GCGCATGA
|
Protein sequence | MTAAPRKGRK VSFEAAATAP EADPWSQIST LRFLIDPPYG GELLVDFTSL RPRGLALAFA RGLFMLVAPR GPIMVRSSIK TYVTQLPSFF AYLAGTGDRI NGPADLRADH IDGFERWLAA KGKSRVHAPT YVAKVVSVLR RIAADGPELV DPGLRERLRY ISSHPHVRPR PRDAYSPYVA RQLRDAARAD LVAIEARLRS GSRFDEVPEI EGAYRQAHAA IEARGVLHHR DPACQSLYKL RWIRELSGAR FNLGLHAAHY LTARDVVPLL VLLSLETGLE LECCKSLTID CLRNASGGTV DVAYTKLRAH GAEHKTIRVR DGGSSTPGGL IRRIIALSAK ARVHNSSDNL WVYYQTNEIT AGIRHPRTTI DAWTLRHGIV DDAGRPLLLR LSQLRKTHKA LWYLKTEGHM ARFAVGHTVE IAARHYADIP ALRPLHEQAV ADALEEALTG PRILPPPDEE RLRGALASPA PDDEGVSRAL LDGEQDVWLA SCGNFYSSPF ASAGTACPTP FWGCLDCRNA VITARKLPAI LAFLSFVDDQ RAGLSAADWA AKFGHARDRI VQQILPAFGD DVVARARVQV AAEPPTVYLP PEARA
|
| |