Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3961 |
Symbol | |
ID | 8667251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4411513 |
End bp | 4412763 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Glycosyltransferase-like protein |
Protein accession | YP_003339614 |
Protein GI | 271965418 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.606556 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.103628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGGGG CGGTCAGGCG CGGGGCACTG TGGCTGCTGA GGAGACTCAA CGGGAGACGG GCACGGCGTG CGCCCGCGCC GGACACGGGA ACGGTCCGCA TCGTGCTGCA GCACGCCTAC GGGATGGGCG GAACCATCCG GACCGTGCTC AACCTCGCCG CGTACCTGGC GCGGGAGCGC GACGTGGAGA TCGTCAGCGT GGTCCGGACG GCGGAGGAGC CCTTCTTCCC GATGCCTCCC GGCGTGCGGG TGTCGTTCCT GGATGACAGG ACCAGGCCCC GCGGCCGGCT CGCCCGGATG CTGTCGCGCT TCCCGAGCCT GCTGGTCCCG GAGCAGGAGA ACGTCTACCA CGCGCTCACC CTCTGGACCG ACCTGCGCCT GGTCGCCTTC CTGCGGTCCA TGCGCACCGG CGTGGTGATC TCCACCCGGC CGGGGTTCAA CCTGGTCACC GCGCTGTTCG CGCCTCCGGG AGTCCTCACG GTCGGCCAGG AGCACGTGGC CCTGGACGTC CACTCCCCCG AGATCCGGCG GCTCATCAAG AGGCGGTACG GCAGGCTGGA CGCCTTCGTG ACCCTCACCG ACGCCGATCT GCGCCAGTAC GCCAGGACGC TGCGGGCCGA TCCGCCGGGA CGGCTGCTGC GCATCCCCAA CGCCCTGCCC CACCTGGCCG GTGACATCTC CCCTCTGGAG GAGAAGGTCG TCATCGCCAT CGGCAGGCTG GTGCACGCCA AGGGCTTCGA CCGTCTGGTC AGGGCGTGGG CACACGTCGC GGCGGCGCAT CCGGACTGGG TGCTGCGCAT CTACGGCGGC GGTACGGCGA AGGCCGAGAC GAAGCTCCGC ACCAGGATCG AGGACGCGGG TCTGGCGGAC CGGGTGTTCC TGATGGGCAG TTCTCCGGAG ATCGGGGTCG AGCTGGCCAA GTCGTCGATC TACGTCGTCA GCTCGCGCTA CGAGGGGTTC GGCATGACGA TCCTGGAGGC GATGAGCAAG GGCGTGCCCG TGGTGAGCTT CGACTGCCCG CACGGTCCCA GGGAGATCAT CACCGACGAG CACGACGGGC TGCTCGTGCG TACCAAGAAG GCACAGGATC TGGCCGAGGC CGTCTGCCGC CTGATCGAGG ACCGGCGGTT GCGCGGCACG CTGGGCGGCA ACGCGGTGCG CACGGCCGCC CGTTACGACC TCGACGCCGT GGGCGCGCGC TGGGACGCCC TGCTGGCCGA CCTCGCCGGC GACGCGTCGC CGCCCGCCTG A
|
Protein sequence | MRGAVRRGAL WLLRRLNGRR ARRAPAPDTG TVRIVLQHAY GMGGTIRTVL NLAAYLARER DVEIVSVVRT AEEPFFPMPP GVRVSFLDDR TRPRGRLARM LSRFPSLLVP EQENVYHALT LWTDLRLVAF LRSMRTGVVI STRPGFNLVT ALFAPPGVLT VGQEHVALDV HSPEIRRLIK RRYGRLDAFV TLTDADLRQY ARTLRADPPG RLLRIPNALP HLAGDISPLE EKVVIAIGRL VHAKGFDRLV RAWAHVAAAH PDWVLRIYGG GTAKAETKLR TRIEDAGLAD RVFLMGSSPE IGVELAKSSI YVVSSRYEGF GMTILEAMSK GVPVVSFDCP HGPREIITDE HDGLLVRTKK AQDLAEAVCR LIEDRRLRGT LGGNAVRTAA RYDLDAVGAR WDALLADLAG DASPPA
|
| |