Gene Sros_9074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9074 
Symbol 
ID8672420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10012076 
End bp10013989 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content73% 
IMG OID 
Product2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1 7-dioic acid hydratase (catechol pathway)-like protein 
Protein accessionYP_003344440 
Protein GI271970244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCTTCG TGACCTATGC CGCCGAGGAC GGTGACCGTG CCGGCGTCCT CGACGGCGAC 
CTGATCCACG CCTTTCCCCC GGGGACGACC CTGCTCGGAC TGCTCGGCTC CGGGCTGCGG
CAGGCGGGCG AGCAGGCGCT CGCCGAACCG GACGAGGTCG TGCCGCTGTC CGACGTCATG
GTGCGGGCCC CCATCCCCAG GCCCCCGTCC ATCCGGGACT GCCTGTGCTT CCTCGACCAC
ATGCGCGGCT GCCTGAAAGC CACCGGTGGC ACGGGGGACC TGGAGCCCAC CTGGTACCAG
ATCCCGGCTT TCTACTTCGC CAACCCCGCC ACCGTCATCG GACCCCACGA CGACGTACCC
ATCTCGCCGG GCAGCGCATG GTTCGACTTC GAGCTGGAGA TCGGCGCGGT GATCGGCACC
GCGGGCCGCG ACCTGACCCC CGAGCAGGCC GAGGAGCACA TCGCCGGCTA CACCCTGATG
TGCGACTGGA GCGCCCGCGA CCTGCAGGGC CTGGAGAGCC AGCTCAAGAT CGGCCAGGCG
AAGGGCAAGG ACGGGGCGAC CACGCTCGGC CCCTGGCTCG TCACCCCCGA CGAGCTGCCC
GCCGGCCTCG CCGTCGCGGT GCGCGCCGAG GTCAACGGCG TCACCGTCGG CGAAGGCCGC
GCCGACGCGA TGGACTGGTC CTTCGGCGAG GTGATCTCCT ACGCCTCCCG CGGCGCCGAG
CTCCAGCCGG GCGACGTCTT CGGCTCCGGC ACCGTGCCCG GCTGCTGCCT CATCGAGCAC
CTCAGCTTCG CCGACCTGGC CGCCTTCCCC GGCTGGCTCA AGGACGGCGA CGTGGTACGG
CTGAGCGCCG AGGGACTGGG CGAGATCCGG CAGACCGCGC GGGCTTCGGC CGCACCGTAC
CCGCTGGCCG CCCGGCCCGA TCCCGCCGCC GGACCACGAC GACCGCGCCG CAACCCGGCC
CCCTCCGCCC TGCCGTACAC GGCGGGGCTC CACCAGGTCG GCGAGGGCGT CTGGGCGTGG
CTGCTACCCG ACGGCGGCTA CGGCAGGAGC AACGCGGGCC TGGTCACGGG CGAGGGTGCG
TCCCTGCTGG TCGACACCCT CTACGACCTG TCACTGACCG GGGAGATGCT GGACGGCATG
CGGGTCGTCA CCGATCGGCA CCCGCTGACC CACGCCGTGC TCACCCACGC CGACGGCGAC
CACACCCACG GCGGCGGGCT CCTGCCCGCC CAGGTGCGCG TGATCACCGC CGAGGGGACC
GCGCATGGGA TGCGCACCGA GATGCCGCCG GAGCTGACGG CGGCGCTGCA GGTGATGGAC
CTCGGGCCGG TGCTCACGCC GTACATGCGC GAGCGCTTCG GCGGCTTCGA CTTCGGGGGC
ATCCGCCTGC GCGAGCCTGA CCAGACCTTC CAGCGGCGGC TCACCCTCGA CGTGGGCGGC
CGCGAGGTGC GCCTGCTCGA CCTCGGGCCG GCCCACACCG AGGCCGACAC GGTGGTCCAC
GTACCGGACG CGGGCGTGCT GTTCGCCGGC GACCTGCTGT TCATCGGATG CACGCCGATC
GTGTGGAGCG GCCCCATCGC CAACTGGATC TCCGCCTGCG ACACGATGCT CGCCCTGGAC
GCGCCGACCG TCGTCCCCGG CCACGGCCCG GTGACGGACC CGGACGGGAT CCGCGCCGTG
CGCGCCTACC TCGCCCACGT CGTCGAGCAG GCCGACCTCG CCCACGCCAA GGGCCTGAAC
CTGCGAGAGG CGGCGTTCGC CGCCGACCTG GCCGACTACG CCTCCTGGCT GGACGCCGAG
CGGATCGTGG TCAACATCTA CCGGCGCTAC CGGGAGATCG ATCCCGAGCA GCCCGTGCTC
GACAGGTTCG CGCTGTTCGC CCTGATGGCC GAATGGGAGG CCGCCCGCTC ATGA
 
Protein sequence
MRFVTYAAED GDRAGVLDGD LIHAFPPGTT LLGLLGSGLR QAGEQALAEP DEVVPLSDVM 
VRAPIPRPPS IRDCLCFLDH MRGCLKATGG TGDLEPTWYQ IPAFYFANPA TVIGPHDDVP
ISPGSAWFDF ELEIGAVIGT AGRDLTPEQA EEHIAGYTLM CDWSARDLQG LESQLKIGQA
KGKDGATTLG PWLVTPDELP AGLAVAVRAE VNGVTVGEGR ADAMDWSFGE VISYASRGAE
LQPGDVFGSG TVPGCCLIEH LSFADLAAFP GWLKDGDVVR LSAEGLGEIR QTARASAAPY
PLAARPDPAA GPRRPRRNPA PSALPYTAGL HQVGEGVWAW LLPDGGYGRS NAGLVTGEGA
SLLVDTLYDL SLTGEMLDGM RVVTDRHPLT HAVLTHADGD HTHGGGLLPA QVRVITAEGT
AHGMRTEMPP ELTAALQVMD LGPVLTPYMR ERFGGFDFGG IRLREPDQTF QRRLTLDVGG
REVRLLDLGP AHTEADTVVH VPDAGVLFAG DLLFIGCTPI VWSGPIANWI SACDTMLALD
APTVVPGHGP VTDPDGIRAV RAYLAHVVEQ ADLAHAKGLN LREAAFAADL ADYASWLDAE
RIVVNIYRRY REIDPEQPVL DRFALFALMA EWEAARS