Gene Sros_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3814 
Symbol 
ID8667104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4256676 
End bp4258484 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content66% 
IMG OID 
ProductCytochrome-c3 hydrogenase 
Protein accessionYP_003339477 
Protein GI271965281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.344269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0147903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCT CACCGAGATC GGGGCAGAGC GGCCCGCAGG ACCAGCCACG CGAGCTGGTC 
GAGATGTCGT GGGACCCCAT CACGCGCATC GTGGGGAGCC TGGGCATCTA CTGCAAGGTC
GACTTCAAGA ACCGGGAGGT GGCCGAGTGT TACAGCACCT CGTCGATCTT CCGCGGCTAC
AGCATCTTCA TGAAGGGCAA GGATCCGCGC GACGCGCACT TCATCACCAG CCGGATCTGC
GGGATCTGCG GTGACAACCA CGCCACCTGT TCGGTCTACG CCCAGAACAT GGCCTACGGC
GCCCGCCCGC CGGCCCTGGG GGAGTGGATC CTCAACTGCG GCGAGGCCGC CGAGTACATG
TTCGACCACA ACATCTACCA GGAGAACCTG GTCGGGGTCG ACTACTGCGA GAAGATGGTG
CGCGAGACCA ACCCCGGCGT GCTGGCCAAG GCCGAGCGGA CCGAGGCCCC GCACGCGGCC
GACCACGGCT ACAAGACCAT CGCCGACATC ATGCGCTCCC TCAACCCGCT CGAAGGCGAG
TTCTACCGCG AGGCCCTGGC GGTGAGCCGT ACCACCAGGG AGATGTTCTG CCTGATGGAG
GGGCGGCACG TGCACCCCTC CACCCTCTAC CCCGGCGGGG TGGGCACGGT CGCGACGGTG
CAGCTCTTCA CCGACTACCT CAGCCGGCTG ATGCGCTACG TGGAGTTCAT GAAGCGGGTC
GTGCCCATGC ACGACGACCT GTTCGACTTC TTCTACGACG CGCTGCCCGG CTACGAGGAG
GTCGGCCGCA GGCGCGTGCT GCTGGGCTGC TGGGGCAGCT TCCAGGACCC GGAGCACTGT
GACTTCACCT ACGAGAACAT GGCCTCCTGG GGTAAGCGGA TGTTCGTCAC CCCGGGGGTG
ATCGTCGACG GCAGGCTCGT CACGGACGAC CTGGTCGACA TCAACCTCGG CATCCGGATC
CTGCTGGGCA GCTCCTACTA CGACGACTGG CAGGGGCAGG AGCCGTTCGT CACCCACGAC
CCGCTGGGCA ACCCGGTCGA CATCCGTCAC CCGTGGAACC AGCACACCAT CCCGCGCCCG
CAGAAGCGCG ACTTCACCGA CAAGTACAGC TGGGTGATGT CACCGCGCTG GTTCGACGGC
AAGGACATGC TGGCGCTGGA CACCGGCGGC GGCCCGATCG CGCGCCTGTG GTCGACCGCG
CTGTCGGGGA AGGTCGACAT CGGCTACGTC AGGGCGACTG GGCACAGCGT GGAGATCAAC
CTGCCGAAGA CGGCCACCAA GCCCGAGAAG ACGTTCGAGT GGAAGATCCC CGAGCGCAAC
GGCGAGCTGA TGTCCAACGC GCTGGAACGC AACCGGGCCC GCACCTACTT CCAGGCCTAC
GCGGCGGCGT GCGCGCTGTA CTTCGCCGAG CAGGGCCTGG CCGAGGTGCG CGCGGGACGG
ACCCAGACCT GGACGCCGTT CACGGTGCCG GAGGAGGCGA TCAGCTGCGG GTTCACCGAG
GCGGTGCGGG GCGTGCTGTC GCACCACATG GTGATCAGGA ACGGCAAGAT CGCCAACTAC
CACCCGTATC CGCCCACGCC CTGGAACGCC AGCGTCCGCG ACGTCAACGG GGTGCCGGGG
CCGTACGAGG ACGCGGTGCA GAACACCCCG ATCTTCGAGG AGAACTCCCC GGAGAACTTC
AAGGGCATCG ACATCATGCG CACCGTGCGT AGCTTCGACC CCTGCCTACC GTGCGGGGTC
CACATGTATC TCGGTGACGG CAAGGAACTA CGCAAGCTGC ACAGCCCCCA TGCCGCCAGC
ACCCTGTGA
 
Protein sequence
MKTSPRSGQS GPQDQPRELV EMSWDPITRI VGSLGIYCKV DFKNREVAEC YSTSSIFRGY 
SIFMKGKDPR DAHFITSRIC GICGDNHATC SVYAQNMAYG ARPPALGEWI LNCGEAAEYM
FDHNIYQENL VGVDYCEKMV RETNPGVLAK AERTEAPHAA DHGYKTIADI MRSLNPLEGE
FYREALAVSR TTREMFCLME GRHVHPSTLY PGGVGTVATV QLFTDYLSRL MRYVEFMKRV
VPMHDDLFDF FYDALPGYEE VGRRRVLLGC WGSFQDPEHC DFTYENMASW GKRMFVTPGV
IVDGRLVTDD LVDINLGIRI LLGSSYYDDW QGQEPFVTHD PLGNPVDIRH PWNQHTIPRP
QKRDFTDKYS WVMSPRWFDG KDMLALDTGG GPIARLWSTA LSGKVDIGYV RATGHSVEIN
LPKTATKPEK TFEWKIPERN GELMSNALER NRARTYFQAY AAACALYFAE QGLAEVRAGR
TQTWTPFTVP EEAISCGFTE AVRGVLSHHM VIRNGKIANY HPYPPTPWNA SVRDVNGVPG
PYEDAVQNTP IFEENSPENF KGIDIMRTVR SFDPCLPCGV HMYLGDGKEL RKLHSPHAAS
TL