Gene Sros_3728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3728 
Symbol 
ID8667016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4135822 
End bp4139013 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content71% 
IMG OID 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003339394 
Protein GI271965198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0214561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0673651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACACG GGGGATGGAT CTCCCGCTGG CCGGTGATCC GCCAGCTCAG GGGAGAGGAT 
GCCAGGGGCG ACGCCGCCCG GTCGGCGCGG ACCGACGCGC TGCGGCCCCG CACCGAGGAC
GCCGACACGG TCGCGCGATC GATCTGCCCC TACTGCGCGG TCGGATGCGG CCAGCTCGTC
TACGTCAAGG ACGGCGAGGT CACCCAGATC GAGGGCGACC CCGACTCGCC GATCTCGCGC
GGACGGCTCT GCCCCAAGGG ATCGGCGAGC AAGCAGCTCG TCACCCATCC CGGCAGGCAG
ACGCGCGTGC TCTACCGGCG GCCGCACGGC ACCGAGTGGG AGCCGCTGGA CCTCGAGACC
GCCATGGACA TGATCGCCGA CCGGGTGGTC CGGACCCGGC GGGAGACCTG GCAGCAGGAG
GAGGACGACA AGGTCGTACG GCGCACCATG GGCATCGCCG GCCTGGGCGG GGCGACGCTG
GACGACGAAG AGAACTACCT GATGAAGAAG CTGTACACCG CCCTCGGCGC GATCCAGGTG
GAGAACCAGG CGCGTATTTG ACACTCCTCC ACCGTCCCCG GTCTGGGGAC CAGCTTCGGG
CGTGGCGGCG CGACGGACTT CCAGCAGGAC CTGGTGGAGT CCGACTGCAT CGTCATCCAG
GGCTCCAACA TGGCCGAGTG CCATCCGGTG GGGTTCCAGT GGGTGGTCGA GGCCAGGGCG
CGGGGAGCGA AGGTCTTCCA CGTCGATCCG CGCTACACGC GCACCAGCGC GCTCGCCGAC
AAGCACGTGC CGATCCGGGC CGGGAGTGAC ATCGTCCTGC TCGGCGCGCT GGTCAACCAC
GTGCTCGTCA ACGAGCTCGA CTTCCGCGAG TACGTCCTGG CCTACACCAA CGCCTCGACG
ATCATCTCCG AGCACTTCCG GGACACCGAG GACCTGGACG GCCTGTTCTC CGGTTTCGAC
GCCGAGCACC GCAAGTACGA CCCGTCGAGC TGGGCCTACG AGGGCGCCCG GGAGGAGGCC
GCCGCCGGGC TGCGCAGCGA GCAGCTGGAG GGCGGGAAGG AACACGCCGA GGTCGCGGGG
GCCGAGGAGT ACGGCTCGGG CGGCGCCGCG ATGCTCGGCA GGCCGCGCAC CGATCCGACG
CTGACCCATC CGCGCTGCGT CTACCAGATC CTCAAGCGGC ACTTCGCCCG CTACACCCCG
GACCTGGTGG CCGAGCTGTG CGGCATCTCC CGTGAGGACT TCGCCGAGCT CGCCGACGCG
ATCACCGCGA ACTCCGGCCG GGAGCGCACC ACCGCCTGGG CCTACGCGGT CGGCTGGACC
CAGCACTCGG TGGGCGTGCA GTACATCCGC ACGGCCGCGA TCCTGCAACT GCTGCTCGGC
AACATGGGTC GCCCCGGCGG CGGCATCATG GCGCTGCGCG GCCACGCCAG CATCCAGGGC
TCCACCGACA TCCCGACGCT GTTCAACATC CTGCCGGGAT ACCTGCCGAT GCCCCACGCC
CACCGGCACC AGAGGCTGGA CGACTACCTC GCCGAGGCCG GAGCCGACAC CGGGTTCTGG
GGCAGGAAGC GCTCCTACAT GGTGAGCCTG CTCAAGGCGT GGTGGGGCGA GGCGGCCACC
GAGGAGAACG ACTTCTGCTT CGGTCACCTG CCGCGGCTCA CCGGCGACCA CGGCCACTAC
ACGACCGTGA TGGCGCAGAT CGAGGGCACC GTGAAGGGCT ACTTCGTGGT GGGGGAGAAC
CCCGCGGTCG GCTCCTCCGG AGGCCGGGCG CAGCGCCTCG GGCTGGCCAA CCTCGACTGG
CTGGTGGTGC GCGACCTGAC GCTGGTGGAG ACGGCCACGT TCTGGAAGGA CGGCCCCGAG
CTGGAGACCG GGGAGATGCG CACCGAGGAC ATCGCCACCG AGGTGTTCTT CCTGCCCGCC
GCGAGCCACG TGGAGAAGGA GGGCACCTTC ACCAACACCC AGCGGCTGCT GCAGTGGCGG
GAGAAGGCGC TCGACCCGCC GGGCGACTGC CGTAGCGACC TGTCGTTCTA CTACCACCTG
GGCAAGCGCA TCAGGCAGCG GCTGTCCGAC GACGAGATCG ACCGGCCACT GCGCGAGCTG
ACCTGGGACT ACCCCGAGGA GGGCGAGCAC CGGGACCCCT CGGCTGAGGC CGTGCTGCGC
GAGATCAACG GCACCGGCCC CGGCGGGCGG GCGCTGTCGG CCTACACCGA GCTCAAGCCC
GACGGCTCCA CCCGGTGCGG CTGCTGGATC TACTGCGGGG TCTACGCCGA CGAGGTCAAC
CAGGCGGCCC GGCGCAAGCC GGGGCGCGAG CAGAACCGGG TCGCGCTCGA ATGGGGCTGG
GCGTGGCCGG CCAACCGCCG CATCCTCTAC AACCGCGCCT CCGCCGATCC CGAGGGCCGC
CCGTGGAGCG AGCGCAAGGC CTACGTCTGG TGGGACGCGG AGAAGGGGGA GTGGACCGGG
GACGACGTGC CGGACTTCGA GAGGGACAAG CCGCCGGACT ACCTGCCGCC CGAGGGGGCC
AGAGCCGAGG AGGCGCTGGC CGGGACCGAC CCGTTCATCA TGCAGGCCGA CGGCAAGGGC
TGGCTGTTCG CTCCCGCCGG GCTGGCCGAC GGGCCGCTGC CGGCGCACTA CGAGCCGCAC
GAGTCGCCGG TGCACAACCC GCTGTACGGC CAGCAGGCCA ACCCCGCGCG CAAGGTCTAC
CGGAGCCCCG CGGCCCCCTA CAACCCGCCG GAGTCGCCGC GCTTCCCGTA CGTGCTCACC
ACCTACCGGC TCACCGAGCA CCACACGGCG GGCGCCATGA GCCGGCCGCT GTCCCACCTC
GCCGAACTGC AGCCCGAGCT GTTCTGCGAG ATCTCCCCGC AGCTCGCCGC CGAGGTCGGG
GTGGTCAACG GGGGCTGGGC GACGATCGTG ACGACGCGTA CCGCGATCGA GGCCCGCGTC
CTGGTGACCG AGCGCGTCCG CCCGCTCCGG GTCGAGGGCA GGCTGATCCA CCAGGTCGGC
CTGCCGTACC ACTGGGCGTG GGGCAGCGGG GGCCTGGTTG TCGGCGACGT CACCAACGAC
CTGATGCCCC TGGTGCTCGA CCCGAACGTC TACATCCAGG AGGGCAAGGC CGTGACCTGT
GATCTGCGAG CCGGCAGGCG TCCCCGGGGG AAGGCCCTGC TCGACCTGAT CGAGGAGTAC
CGCCATGACT GA
 
Protein sequence
MAHGGWISRW PVIRQLRGED ARGDAARSAR TDALRPRTED ADTVARSICP YCAVGCGQLV 
YVKDGEVTQI EGDPDSPISR GRLCPKGSAS KQLVTHPGRQ TRVLYRRPHG TEWEPLDLET
AMDMIADRVV RTRRETWQQE EDDKVVRRTM GIAGLGGATL DDEENYLMKK LYTALGAIQV
ENQARIUHSS TVPGLGTSFG RGGATDFQQD LVESDCIVIQ GSNMAECHPV GFQWVVEARA
RGAKVFHVDP RYTRTSALAD KHVPIRAGSD IVLLGALVNH VLVNELDFRE YVLAYTNAST
IISEHFRDTE DLDGLFSGFD AEHRKYDPSS WAYEGAREEA AAGLRSEQLE GGKEHAEVAG
AEEYGSGGAA MLGRPRTDPT LTHPRCVYQI LKRHFARYTP DLVAELCGIS REDFAELADA
ITANSGRERT TAWAYAVGWT QHSVGVQYIR TAAILQLLLG NMGRPGGGIM ALRGHASIQG
STDIPTLFNI LPGYLPMPHA HRHQRLDDYL AEAGADTGFW GRKRSYMVSL LKAWWGEAAT
EENDFCFGHL PRLTGDHGHY TTVMAQIEGT VKGYFVVGEN PAVGSSGGRA QRLGLANLDW
LVVRDLTLVE TATFWKDGPE LETGEMRTED IATEVFFLPA ASHVEKEGTF TNTQRLLQWR
EKALDPPGDC RSDLSFYYHL GKRIRQRLSD DEIDRPLREL TWDYPEEGEH RDPSAEAVLR
EINGTGPGGR ALSAYTELKP DGSTRCGCWI YCGVYADEVN QAARRKPGRE QNRVALEWGW
AWPANRRILY NRASADPEGR PWSERKAYVW WDAEKGEWTG DDVPDFERDK PPDYLPPEGA
RAEEALAGTD PFIMQADGKG WLFAPAGLAD GPLPAHYEPH ESPVHNPLYG QQANPARKVY
RSPAAPYNPP ESPRFPYVLT TYRLTEHHTA GAMSRPLSHL AELQPELFCE ISPQLAAEVG
VVNGGWATIV TTRTAIEARV LVTERVRPLR VEGRLIHQVG LPYHWAWGSG GLVVGDVTND
LMPLVLDPNV YIQEGKAVTC DLRAGRRPRG KALLDLIEEY RHD