Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6168 |
Symbol | |
ID | 8669470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 6767682 |
End bp | 6769331 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Urease |
Protein accession | YP_003341641 |
Protein GI | 271967445 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.308489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0462908 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGT ACGGCCCGCG TAAGGGAGAC CGGGTCAGGC TCGGCGACTC CGGCCTGGTG GTCGAGGTCG AGCACGACTC GCAGAAGGCC GGGGAGGAGT TCCTGGCCGG GTTCGGCAAG ACCGCCCGGG ACGGCCTGCA CCTGAAGGCC GCCTCGATCA GGGACACCTG CGACCTGGTG ATCAGCAACG TGCTGATCCT CGACCCGATC CTGGGCGTGC GGACCGCGTC GATCGGCGTC CGGGAGGGGC GCGTCCACGC CGTCGGCCGG GCCGGCAACC CCGACACCCT GGACGGGGTG GACGTGGTGG TCGGCACCGG GACCACGATC GTGTCCGGTG AGGGGCTCAT CGCGACGGCG GGGGCGATCG ACACCCACGT CCACCTGCTC AGCCCGCGCA TCATGGAGGC CTCCCTGGCC TCCGGCGTCA CCACGATCAT CGGCCAGGAG TTCGGGCCGG TCTGGGGCGT GGGGGTGAAC TCGCCGTGGG CCCTGCGGCA CGCGTTCAAC GCCTTCGACG CCTGGCCGGT GAACATCGGC TTCCTGGCGC GCGGCTCGTC CTCACACGAG GCTCCGCTGG TCGAGGCGCT GGTCGAGGGC GGTGCGAGCG GCTTCAAGGT GCACGAGGAC ATGGGGGCCC ACACCCGCGC GCTGGACACC GCGCTGCGGG TCGCCGAGGA GCACGACGTG CAGGTGGCCC TGCACACCGA CGGTCTCAAC GAGTGCCTGT CGGTGGAGGA CACCCTCGCC GTGCTCGGAG GCCGGACGAT CCACGCCTTC CACATCGAGG GCTGCGGCGG CGGTCACGTG CCGAACGTGC TCAAGCTGGC CGGGGTGGCC AACGTCATCG GCTCCTCCAC CAACCCGACG CTGCCCTTCG GCCGGGACGC CGTCGCCGAG CACCACGGGA TGATCGTCTC GGTGCACGGG CTCCGGCCCG AGCTGCCGGG GGACGCCGCG CTGGCCAGGG ACCGGATCCG GGCCGGCACC ATGGGCGCCG AGGACGTGCT GCACGACCTC GGGGTCATCG GCATCACCTC CTCCGACGCG CAGGGCATGG GCCGGGCGGG CGAGACGGTC CGCCGGACGT TCGCAATGGC CGGGAAGATG AAGGGGGAGC TGGGCGCGCC GGAGCGCAAC GACAACGAGC GGGTGCTGCG CTACCTGGCC AAACTGACGA TCAACCCGGC CATCGCGCAC GGCCTGGCCC ACGAGGTCGG CTCGCTGGAG CCGGGCAAGC TGGCCGACAT CGTGCTGTGG CGGCCGGACC ACTTCGGCGC CAAGCCGCAG CTCGTGCTGA AGGCGGGCTT CCCCGCCTAC GGCGTGACCG GGGACCCGAA CGCCTCCACC GACACCTGCG AGCCGCTGGT GCTGGGCCCG CAGTTCGGCG CGTACGGCGC GACCGCCGCG GACCTGTCGG TGGCCTTCGT CAGCGGGGCG GCCGCGGACG CGGCCGACGA CCGCATGACG ACCCGCCGCC GCCGGGTGGG GGTGCGCGGC ACCCGCGGCA TCGGCCCCGG CGACCTGGTC CTCAACTCCC GGCTCGGGTC CGTCGAGGTG GACACTCTCG GCCAGGTCAC CCTCGACGGC GACCCGGTCC GGTCGGCCCC CGCCGACTCG GTCTCGCTGA GCCGCCTGTA CTTCCTGTGA
|
Protein sequence | MSMYGPRKGD RVRLGDSGLV VEVEHDSQKA GEEFLAGFGK TARDGLHLKA ASIRDTCDLV ISNVLILDPI LGVRTASIGV REGRVHAVGR AGNPDTLDGV DVVVGTGTTI VSGEGLIATA GAIDTHVHLL SPRIMEASLA SGVTTIIGQE FGPVWGVGVN SPWALRHAFN AFDAWPVNIG FLARGSSSHE APLVEALVEG GASGFKVHED MGAHTRALDT ALRVAEEHDV QVALHTDGLN ECLSVEDTLA VLGGRTIHAF HIEGCGGGHV PNVLKLAGVA NVIGSSTNPT LPFGRDAVAE HHGMIVSVHG LRPELPGDAA LARDRIRAGT MGAEDVLHDL GVIGITSSDA QGMGRAGETV RRTFAMAGKM KGELGAPERN DNERVLRYLA KLTINPAIAH GLAHEVGSLE PGKLADIVLW RPDHFGAKPQ LVLKAGFPAY GVTGDPNAST DTCEPLVLGP QFGAYGATAA DLSVAFVSGA AADAADDRMT TRRRRVGVRG TRGIGPGDLV LNSRLGSVEV DTLGQVTLDG DPVRSAPADS VSLSRLYFL
|
| |