Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3823 |
Symbol | |
ID | 8667113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4263111 |
End bp | 4264238 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | hydrogenase expression/formation |
Protein accession | YP_003339486 |
Protein GI | 271965290 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.786204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0677682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTCG TCGATGAGTA CCGGGACGCG GAGAAGGCCC GGACGCTGGC CGCGCAGATC GCCGCGCTGT GCGAGCCCGG CCGCGCCTAC AAGTTCATGG AGGTGTGCGG CGGGCACACC CACACGATCT ACAAGCACGG GCTGGAGGAC TACCTGCCCG AGGCCGTCAC GCTCGTGCAC GGTCCGGGCT GCCCCGTCTG CGTCATCCCC ATGGGGCGGG TGGACGACGC CATCCACATC GCCGAGCAGC CCGACGTGAT CATGACGTCG TTCGGCGACA TGATGCGCGT GCCCGGGGGC GGCGGCTCCT TCCTCGACGC CAAGGCGCGG GGCGCCGACA TCCGCATGGT CTACTCCCCG CTGGACGCTC TGAAGATCGC GCGGGAGAAC CCCGCCAGAC GCGTGGTGTT CATGGCGATC GGGTTCGAGA CCACCGCCCC CTCGACGGCG ATGACCGTCC TGCGCGCGGC CGCCGAGGGG ATCGAGAACT TCTCGGTCTT CTGCAACCAC GTGACGATCA TCCCCGCGAT CAAGGCCATC CTGGACTCTC CCGACCTGCG CCTGGACGGC TTCGTCGGGC CGGGTCACGT CTCGGCCGTC ATCGGCTGCC GGCCGTACGG GTTCATCGCG CGTGACTACG GAAAACCCCT GGTGGTGGCC GGGTTCGAGC CGCTCGACGT GCTGCACACG GTCTACCGGA TCCTCGCGCA GCTGGCCGAG GGCCGGGTGG AGGTGGAGAA CCAGTACGCC CGGGTGGTGC CGTGGGAGGG CAACCCGAAG GCCCTGGGCG TCATCAACCA GGTGATGGAG CTGCGGCCGT ACTTCGAGTG GCGGGGGCTG GGGTTCATCT CGCACTCGGC GCTCAGGATG GGGGAGCGCT ACGCCGCCTT CGACGCCGAA CGGATCTTCC AGATCCCCGG CGGGCGGGTG GCCGACCCCA AGGCGTGCCA GTGCGGCGAG GTGCTCAAGG GCGTGCTCAA GCCGTGGGAG TGCAAGGTGT TCGGCACCGC CTGCACGCCG GAGACCCCGA TCGGCACCTG CATGGTGTCG TCGGAGGGCG CCTGCGCGGC CTACTACAAC TTCGGCCGCT TCTCCCGTGA GCGGGTCAAG GAGGCGACCC ACCGGTGA
|
Protein sequence | MRFVDEYRDA EKARTLAAQI AALCEPGRAY KFMEVCGGHT HTIYKHGLED YLPEAVTLVH GPGCPVCVIP MGRVDDAIHI AEQPDVIMTS FGDMMRVPGG GGSFLDAKAR GADIRMVYSP LDALKIAREN PARRVVFMAI GFETTAPSTA MTVLRAAAEG IENFSVFCNH VTIIPAIKAI LDSPDLRLDG FVGPGHVSAV IGCRPYGFIA RDYGKPLVVA GFEPLDVLHT VYRILAQLAE GRVEVENQYA RVVPWEGNPK ALGVINQVME LRPYFEWRGL GFISHSALRM GERYAAFDAE RIFQIPGGRV ADPKACQCGE VLKGVLKPWE CKVFGTACTP ETPIGTCMVS SEGACAAYYN FGRFSRERVK EATHR
|
| |