Gene Sros_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3809 
Symbol 
ID8667099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4251400 
End bp4253727 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content74% 
IMG OID 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_003339472 
Protein GI271965276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0483302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0563214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG GAGCGGCGCG GACCGAGCCT CGGACGCGGG TCAGGATCCG GGTCGAAGGG 
ATCGTGCAGG GGGTGGGTTT CCGTCCCCAC GTCCACTCCC TGGCCCGGCG GCTGGCCCTG
TCGGGGTGGG TGGGGAACGA CAGCGAGGGC GTGTTCATCG AGGCGGAGGG GACGCGGGAG
AACGTCACGC GTTTCCAGGA GGATCTGCGG GGCCACGCGC CGCCGCTGGC GGTGATCCAC
CGCATCACCG TCACCGCCGT GCCCGCCCTC GGTGACCACG CTTTCCTCAT CGCCGGCAGC
AGGGAGGGGG GCGCGCGGGC GGCACTGATC TCGCCCGACG TCGCGACGTG CGCGGACTGC
CTGGCCGAGC TGTCCGACCC CGCCGACCGC CGTTACCGCC ATCCCTTCAT CAACTGCACC
AACTGCGGGC CGAGGTTCAC CGTCATCCGC GACGTCCCCT ACGACCGGCC GGCCACGACG
ATGGCCGGGT TCGCCATGTG CGAGGAGTGC ACGGCCGAAT ACCTCGATCC GCACGACCGC
AGGTTCCACG CCCAGCCGAC CTGCTGCCCG GCCTGCGGTC CGGCGCTGCG GCTGCTCGGC
GCCGGCGGCC GCGCCCACCA CCCCGGCGGC GCCGGGGCCG ACGGCGGCGG CGGTCGCGAT
CCGATCGGGC TCGCGGCCGA GGTGGTGCGG TCGGGCGGCG TCCTGGCCGT CAAGGGCCTC
GGCGGCTATC ACCTGGCGGC CGACGCCACC GACGAGGCCG CCGTGCGGGC CCTGCGGTCG
CGCAAGCACC GCGAGGACAG GCCGTTCGCC GTCATGGTCG CCGACCTCGA CGCCGCGCGC
ACGCTGTGCG AGCTGGACCA GGTGGCCGAA CGGCTGCTCA CCGGCCCCCG GCGGCCGATC
GTGCTGCTGC CCCGCCGCCC CGGCGCCCCG CTGGCCGAGG CCGTGGCACC GGGCGACCGC
CGCCTCGGCG TCATGCTGCC CTACACGCCC ATGCACCACC TTCTCGCCGG AGAGATCGCC
CGTCCGTACG TCCTGACCAG CGGGAACCTC TCCGACGAGC CCATCGCCTA CCGGGACCGC
GACGCGTTCA CCCGCCTCGC CGGCATCGCC GACCGCTTCC TCACCCACGA CCGCCCGATC
CACGTCCGCA CCGACGACTC GGTGGTCCTC GCGGCAGGGG GGCGCGAGCT CCCGCTGCGG
AGATCACGGG GATACGCCCC CGAGCCGCTC CGGCTCCTCC GTCCCGTACG GCGGGCGGTG
CTGGCGTGCG GCGCGGAGCT GAAGAACACC TTCTGCCTCG CCGAGGGAGG ACGCGCCTTC
GTCTCCCACC ACATCGGCGA CCTGGAGAAC TACGAGACGC TGCGCTCCTT CAGCGAGGGG
ATCGACCATT TCCGGCGGCT GTTCGACATC ACGCCCCGGG TTGTCGCGCA CGACCTCCAC
CCGGAGTACC TGTCCACCAA ATACGCCCAC GACATGGACG GAGTCGACCT GGTCGGGGTC
CAGCACCACC ACGCGCACAT CGCCTCCTGC CTGGCCGACA ACCAGGAGGC CGGCCCAGTC
ATCGGAGTGG CCTTCGACGG CCTCGGCTAC GGCGCCGACG GCACCCTGTG GGGCGGCGAG
CTCCTCGTGG CCGACCTGAC CGGCTTCACC CGGGCCGGAT GCCTGGCACC GGTGCCGCTG
CCCGGCGGCA CGGCGGCGAT CAGGCAACCC TGGCGGATGG CCGCCGCGCA TCTGGACGCG
GCCTACGACG GCACGCCACC CGGCGATCTC CAGGTGATCT CCCGCCACCG GGACTGGGAC
GACGTGGTGG CCGTGGCCAG ATCCGGCGTG AACTCCCCCC TCACCTCCAG CGCCGGGCGG
CTGTTCGACG CCGTCGCGGC GATCCTCGGA CTCCGCGACA CCGTCACCTA CGAGGGGCAG
GCCGCGATCG CGCTGGAACA GCGGGCCGAT CCCGCCGAGG AGTCCGCCTA CCCGGCCCGG
CTCCACGGCG GCGACGGCGA GCTGCTGACG ATCCGGACCG GCGACCTCAT CCGGGCCGTC
GTGGAGGACC TGCGTGCCGG CGCCGACCCG GCGGTCGTCT CCGCCCGCTT CCACAACGGG
CTCGCCGCCG CCACCGCCGC GAGCTGCGCG CGACTCCGCT CGTCCACCGG CGTCGGCACG
GTCGCCCTGT CCGGCGGCGT CTTCCAGAAC CAGCTACTTC TCGGCAGGCT CGTCCAGGCG
CTCCGGCTCC GGGACTTCCG GGTGCTGACC CACCATCGCG TCCCGCCCAA CGACGGGGGC
ATCAGCTTCG GCCAGGCCGC CGTGGCCGCC GCGCGCGATC TCCTCTGA
 
Protein sequence
MSTGAARTEP RTRVRIRVEG IVQGVGFRPH VHSLARRLAL SGWVGNDSEG VFIEAEGTRE 
NVTRFQEDLR GHAPPLAVIH RITVTAVPAL GDHAFLIAGS REGGARAALI SPDVATCADC
LAELSDPADR RYRHPFINCT NCGPRFTVIR DVPYDRPATT MAGFAMCEEC TAEYLDPHDR
RFHAQPTCCP ACGPALRLLG AGGRAHHPGG AGADGGGGRD PIGLAAEVVR SGGVLAVKGL
GGYHLAADAT DEAAVRALRS RKHREDRPFA VMVADLDAAR TLCELDQVAE RLLTGPRRPI
VLLPRRPGAP LAEAVAPGDR RLGVMLPYTP MHHLLAGEIA RPYVLTSGNL SDEPIAYRDR
DAFTRLAGIA DRFLTHDRPI HVRTDDSVVL AAGGRELPLR RSRGYAPEPL RLLRPVRRAV
LACGAELKNT FCLAEGGRAF VSHHIGDLEN YETLRSFSEG IDHFRRLFDI TPRVVAHDLH
PEYLSTKYAH DMDGVDLVGV QHHHAHIASC LADNQEAGPV IGVAFDGLGY GADGTLWGGE
LLVADLTGFT RAGCLAPVPL PGGTAAIRQP WRMAAAHLDA AYDGTPPGDL QVISRHRDWD
DVVAVARSGV NSPLTSSAGR LFDAVAAILG LRDTVTYEGQ AAIALEQRAD PAEESAYPAR
LHGGDGELLT IRTGDLIRAV VEDLRAGADP AVVSARFHNG LAAATAASCA RLRSSTGVGT
VALSGGVFQN QLLLGRLVQA LRLRDFRVLT HHRVPPNDGG ISFGQAAVAA ARDLL