Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3728 |
Symbol | |
ID | 8667016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4135822 |
End bp | 4139013 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003339394 |
Protein GI | 271965198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0214561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0673651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACACG GGGGATGGAT CTCCCGCTGG CCGGTGATCC GCCAGCTCAG GGGAGAGGAT GCCAGGGGCG ACGCCGCCCG GTCGGCGCGG ACCGACGCGC TGCGGCCCCG CACCGAGGAC GCCGACACGG TCGCGCGATC GATCTGCCCC TACTGCGCGG TCGGATGCGG CCAGCTCGTC TACGTCAAGG ACGGCGAGGT CACCCAGATC GAGGGCGACC CCGACTCGCC GATCTCGCGC GGACGGCTCT GCCCCAAGGG ATCGGCGAGC AAGCAGCTCG TCACCCATCC CGGCAGGCAG ACGCGCGTGC TCTACCGGCG GCCGCACGGC ACCGAGTGGG AGCCGCTGGA CCTCGAGACC GCCATGGACA TGATCGCCGA CCGGGTGGTC CGGACCCGGC GGGAGACCTG GCAGCAGGAG GAGGACGACA AGGTCGTACG GCGCACCATG GGCATCGCCG GCCTGGGCGG GGCGACGCTG GACGACGAAG AGAACTACCT GATGAAGAAG CTGTACACCG CCCTCGGCGC GATCCAGGTG GAGAACCAGG CGCGTATTTG ACACTCCTCC ACCGTCCCCG GTCTGGGGAC CAGCTTCGGG CGTGGCGGCG CGACGGACTT CCAGCAGGAC CTGGTGGAGT CCGACTGCAT CGTCATCCAG GGCTCCAACA TGGCCGAGTG CCATCCGGTG GGGTTCCAGT GGGTGGTCGA GGCCAGGGCG CGGGGAGCGA AGGTCTTCCA CGTCGATCCG CGCTACACGC GCACCAGCGC GCTCGCCGAC AAGCACGTGC CGATCCGGGC CGGGAGTGAC ATCGTCCTGC TCGGCGCGCT GGTCAACCAC GTGCTCGTCA ACGAGCTCGA CTTCCGCGAG TACGTCCTGG CCTACACCAA CGCCTCGACG ATCATCTCCG AGCACTTCCG GGACACCGAG GACCTGGACG GCCTGTTCTC CGGTTTCGAC GCCGAGCACC GCAAGTACGA CCCGTCGAGC TGGGCCTACG AGGGCGCCCG GGAGGAGGCC GCCGCCGGGC TGCGCAGCGA GCAGCTGGAG GGCGGGAAGG AACACGCCGA GGTCGCGGGG GCCGAGGAGT ACGGCTCGGG CGGCGCCGCG ATGCTCGGCA GGCCGCGCAC CGATCCGACG CTGACCCATC CGCGCTGCGT CTACCAGATC CTCAAGCGGC ACTTCGCCCG CTACACCCCG GACCTGGTGG CCGAGCTGTG CGGCATCTCC CGTGAGGACT TCGCCGAGCT CGCCGACGCG ATCACCGCGA ACTCCGGCCG GGAGCGCACC ACCGCCTGGG CCTACGCGGT CGGCTGGACC CAGCACTCGG TGGGCGTGCA GTACATCCGC ACGGCCGCGA TCCTGCAACT GCTGCTCGGC AACATGGGTC GCCCCGGCGG CGGCATCATG GCGCTGCGCG GCCACGCCAG CATCCAGGGC TCCACCGACA TCCCGACGCT GTTCAACATC CTGCCGGGAT ACCTGCCGAT GCCCCACGCC CACCGGCACC AGAGGCTGGA CGACTACCTC GCCGAGGCCG GAGCCGACAC CGGGTTCTGG GGCAGGAAGC GCTCCTACAT GGTGAGCCTG CTCAAGGCGT GGTGGGGCGA GGCGGCCACC GAGGAGAACG ACTTCTGCTT CGGTCACCTG CCGCGGCTCA CCGGCGACCA CGGCCACTAC ACGACCGTGA TGGCGCAGAT CGAGGGCACC GTGAAGGGCT ACTTCGTGGT GGGGGAGAAC CCCGCGGTCG GCTCCTCCGG AGGCCGGGCG CAGCGCCTCG GGCTGGCCAA CCTCGACTGG CTGGTGGTGC GCGACCTGAC GCTGGTGGAG ACGGCCACGT TCTGGAAGGA CGGCCCCGAG CTGGAGACCG GGGAGATGCG CACCGAGGAC ATCGCCACCG AGGTGTTCTT CCTGCCCGCC GCGAGCCACG TGGAGAAGGA GGGCACCTTC ACCAACACCC AGCGGCTGCT GCAGTGGCGG GAGAAGGCGC TCGACCCGCC GGGCGACTGC CGTAGCGACC TGTCGTTCTA CTACCACCTG GGCAAGCGCA TCAGGCAGCG GCTGTCCGAC GACGAGATCG ACCGGCCACT GCGCGAGCTG ACCTGGGACT ACCCCGAGGA GGGCGAGCAC CGGGACCCCT CGGCTGAGGC CGTGCTGCGC GAGATCAACG GCACCGGCCC CGGCGGGCGG GCGCTGTCGG CCTACACCGA GCTCAAGCCC GACGGCTCCA CCCGGTGCGG CTGCTGGATC TACTGCGGGG TCTACGCCGA CGAGGTCAAC CAGGCGGCCC GGCGCAAGCC GGGGCGCGAG CAGAACCGGG TCGCGCTCGA ATGGGGCTGG GCGTGGCCGG CCAACCGCCG CATCCTCTAC AACCGCGCCT CCGCCGATCC CGAGGGCCGC CCGTGGAGCG AGCGCAAGGC CTACGTCTGG TGGGACGCGG AGAAGGGGGA GTGGACCGGG GACGACGTGC CGGACTTCGA GAGGGACAAG CCGCCGGACT ACCTGCCGCC CGAGGGGGCC AGAGCCGAGG AGGCGCTGGC CGGGACCGAC CCGTTCATCA TGCAGGCCGA CGGCAAGGGC TGGCTGTTCG CTCCCGCCGG GCTGGCCGAC GGGCCGCTGC CGGCGCACTA CGAGCCGCAC GAGTCGCCGG TGCACAACCC GCTGTACGGC CAGCAGGCCA ACCCCGCGCG CAAGGTCTAC CGGAGCCCCG CGGCCCCCTA CAACCCGCCG GAGTCGCCGC GCTTCCCGTA CGTGCTCACC ACCTACCGGC TCACCGAGCA CCACACGGCG GGCGCCATGA GCCGGCCGCT GTCCCACCTC GCCGAACTGC AGCCCGAGCT GTTCTGCGAG ATCTCCCCGC AGCTCGCCGC CGAGGTCGGG GTGGTCAACG GGGGCTGGGC GACGATCGTG ACGACGCGTA CCGCGATCGA GGCCCGCGTC CTGGTGACCG AGCGCGTCCG CCCGCTCCGG GTCGAGGGCA GGCTGATCCA CCAGGTCGGC CTGCCGTACC ACTGGGCGTG GGGCAGCGGG GGCCTGGTTG TCGGCGACGT CACCAACGAC CTGATGCCCC TGGTGCTCGA CCCGAACGTC TACATCCAGG AGGGCAAGGC CGTGACCTGT GATCTGCGAG CCGGCAGGCG TCCCCGGGGG AAGGCCCTGC TCGACCTGAT CGAGGAGTAC CGCCATGACT GA
|
Protein sequence | MAHGGWISRW PVIRQLRGED ARGDAARSAR TDALRPRTED ADTVARSICP YCAVGCGQLV YVKDGEVTQI EGDPDSPISR GRLCPKGSAS KQLVTHPGRQ TRVLYRRPHG TEWEPLDLET AMDMIADRVV RTRRETWQQE EDDKVVRRTM GIAGLGGATL DDEENYLMKK LYTALGAIQV ENQARIUHSS TVPGLGTSFG RGGATDFQQD LVESDCIVIQ GSNMAECHPV GFQWVVEARA RGAKVFHVDP RYTRTSALAD KHVPIRAGSD IVLLGALVNH VLVNELDFRE YVLAYTNAST IISEHFRDTE DLDGLFSGFD AEHRKYDPSS WAYEGAREEA AAGLRSEQLE GGKEHAEVAG AEEYGSGGAA MLGRPRTDPT LTHPRCVYQI LKRHFARYTP DLVAELCGIS REDFAELADA ITANSGRERT TAWAYAVGWT QHSVGVQYIR TAAILQLLLG NMGRPGGGIM ALRGHASIQG STDIPTLFNI LPGYLPMPHA HRHQRLDDYL AEAGADTGFW GRKRSYMVSL LKAWWGEAAT EENDFCFGHL PRLTGDHGHY TTVMAQIEGT VKGYFVVGEN PAVGSSGGRA QRLGLANLDW LVVRDLTLVE TATFWKDGPE LETGEMRTED IATEVFFLPA ASHVEKEGTF TNTQRLLQWR EKALDPPGDC RSDLSFYYHL GKRIRQRLSD DEIDRPLREL TWDYPEEGEH RDPSAEAVLR EINGTGPGGR ALSAYTELKP DGSTRCGCWI YCGVYADEVN QAARRKPGRE QNRVALEWGW AWPANRRILY NRASADPEGR PWSERKAYVW WDAEKGEWTG DDVPDFERDK PPDYLPPEGA RAEEALAGTD PFIMQADGKG WLFAPAGLAD GPLPAHYEPH ESPVHNPLYG QQANPARKVY RSPAAPYNPP ESPRFPYVLT TYRLTEHHTA GAMSRPLSHL AELQPELFCE ISPQLAAEVG VVNGGWATIV TTRTAIEARV LVTERVRPLR VEGRLIHQVG LPYHWAWGSG GLVVGDVTND LMPLVLDPNV YIQEGKAVTC DLRAGRRPRG KALLDLIEEY RHD
|
| |