Gene Sros_8972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8972 
Symbol 
ID8672314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9918636 
End bp9920084 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content73% 
IMG OID 
Productpyruvate dehydrogenase E2 
Protein accessionYP_003344346 
Protein GI271970150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGT TCAAGCTCCC CGACGTGGGT GAAGGCCTGA CCGAGGCGGA GATCGTCCGC 
TGGCACGTGA AGGCCGGCGA CCCCGTCAAG GTCAACCAGA TCATCGTGGA GATCGAGACC
GCCAAGGCCG TCGTCGAGCT GCCCTGCCCG TTCGAGGGCG TGGTGGCGGC CCTGATGGCC
GACGAGGGCG AGACCGTGGA CGTCGGCAGG CCGATCATCT CGGTCGACGA CGGGACCGGC
ACGGACCCCG CTCCCTCCGC CGCCCCCGGC CCCGCCCCCG AACGCGGGCA GGCCCTGGCC
GAGGACATGG TGCCCGCCCT CCCGAAGGAG GAGCGCCAGC CGGTCCTGGT GGGCTACGGC
GTCAAGATGG GCGCGGCCAA ACGCCGTCCC CGCAAATCGG CGCCCACCCC GCCCGCGGGC
TCGCCGGCTC CGCGGAGCGT CCAGCCGTCC GCGCGGGAGG ACGCCGGGCC CGCCGCCGGC
GAGGCCGCCG GGCCGTTCAC CGGGGAGAGC GCCGCGCCCT CCGTGCGGGA GGACGCCCGG
GAGAACGGGA CTGCCGCCGG TGCCGCGCCC GCCTCCGGGG GACGCGTCGC GACGCTGGCC
AAGCCTCCCG TGCGGAAGCT GGCCAAGGAT CTCGGAGTCG ACCTGACCAC GCTCACCGGG
AGCGGCCCGC AGGGCTCGAT CACCCGGGAC GACGTCCAGT CGGCGGTCGG GGCGGTCTCC
GCTCCGGTTG CGGTGCCGGC CGTACGGGCG GGTGAGGAGC GGATCCCGGT CAAGGGTGTG
CGCAGGGCGA CCGCGCAGGC CATGGTGGCC TCGGCGTTCA CCGCGCCGCA TGTCACCGAG
TTCCTCCAGG TGGACGTCAC CGAGACCATG GACGCGGTCG GGCGGCTGCG ACGGTTGCCC
GACTTCGCCG AGGTCAAGGT GTCGCCGTTG CTCCTGGTGG CGAAGGCGGT GCTCGTCGCC
GCGCGCCGGT ATCCGATGAT CAACTCGGCG TGGGACGAGG CCGCGCAGGA GATCGTGGTC
AAGCACTACG TGAACCTGGG CATCGCCGCG GCCACCCCGC GCGGCCTGCT CGTCCCCAAC
GTCAAGGACG CCCACGCGAT GTCCCTCCCG GACCTCGCCA GGGCCCTCGG CGCCCTCGCC
GAGACCGCCC GTGCGGGCCG CACCCAGCCC GCCGACATGG CCGGCGGCAC GATCACGATC
ACCAATGTGG GCGTGTTCGG GGTGGACGCG GGCACTCCGA TCCTCAACCC CGGGGAGTCG
GTCATCCTGG CCTTCGGGCA GGTCAGGGAC ATGCCGTGGG TGGTGGACGG GCAGATCGTG
CCACGCAGGG TCTGCACGCT GGCGCTGTCG TTCGACCACC GGGTCGTGGA CGGGGAGCTC
GGCTCGCTCT TCCTCCGTGA CGTCGGCGCC ATGCTGGAGG ACCCGCTCCG CATGCTCGCC
TGGAACTGA
 
Protein sequence
MKQFKLPDVG EGLTEAEIVR WHVKAGDPVK VNQIIVEIET AKAVVELPCP FEGVVAALMA 
DEGETVDVGR PIISVDDGTG TDPAPSAAPG PAPERGQALA EDMVPALPKE ERQPVLVGYG
VKMGAAKRRP RKSAPTPPAG SPAPRSVQPS AREDAGPAAG EAAGPFTGES AAPSVREDAR
ENGTAAGAAP ASGGRVATLA KPPVRKLAKD LGVDLTTLTG SGPQGSITRD DVQSAVGAVS
APVAVPAVRA GEERIPVKGV RRATAQAMVA SAFTAPHVTE FLQVDVTETM DAVGRLRRLP
DFAEVKVSPL LLVAKAVLVA ARRYPMINSA WDEAAQEIVV KHYVNLGIAA ATPRGLLVPN
VKDAHAMSLP DLARALGALA ETARAGRTQP ADMAGGTITI TNVGVFGVDA GTPILNPGES
VILAFGQVRD MPWVVDGQIV PRRVCTLALS FDHRVVDGEL GSLFLRDVGA MLEDPLRMLA
WN