Gene Sros_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4604 
Symbol 
ID8667898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5122430 
End bp5123761 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID 
Productcytochrome d ubiquinol oxidase subunit I 
Protein accessionYP_003340208 
Protein GI271966012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0125829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATCT TGGATCTGGC CCGCCTGCAG TTCGCCGTCA CGACGGGGGT GCACTGGCTG 
TTCGTCATCC TCACCCTCGG CCTCGTGCCG CTCGTCGCGA TCATGCAGAC GCGCTCCTTG
CGGGCACGGG ATCGAGTACG GCGCGCGGCA CTGGACCGGA TGACCCGGTT CTGGGGCCAG
ATCTATGTGA TCAACTACGC GCTCGGCATC GTCACCGGTC TTGTGATGGA GTTCCAGTTC
GGCCTGACCT GGAGCGGGCT CGGGAAATAC ACGGGCAACG TCTTCGGCGC CCCGCTGGCC
CTGGAGACGC TCATCGCGTT CTTCGCCGAG TCGACCTTCC TCGGCATGTG GATCTTCGGC
TGGGACCGTC TGCGGCCCGG ACTGCACGTC ACGCTGATCT GGCTGGTCAC GGCCACCGCC
TACACCTCGG CCTTCTGGGT GCTGGTCGCC AACGGCTACA TGCAGGCCCC GGTCGGCTCG
GTGGTCAGGG ACGGCGTGGC CTACCTCACC GACTTCGGCG CGCTGCTCAC CAATCCGAGC
GCGCTGGTGC CGCTGGCCCA CGTGAGTCTC GCCGCGCTCC TGACCGGTGG CCTGTTCGTG
GCCGGGGTCA GCGCCCACCA CCTGCGTCGC GGCAGGGAGG ATTTCCGCGG TCCGCTCCGG
ACCGGCGTCC TCGTGGCCGC CGTCGTCACC TTCCCCGTCT ATGCCGCCGG CGGCCTGCAA
TACCCGATCA TCGCCGCCAC GCAACCGGCC AAGACGGCCA TGATGGACGA GCCCGGCTGG
GTCCTCTGGG GGCAGTACGT CATGATCGGA CTCGGTTACC TGCTCGGCAT CCTCGCGCTC
GCGGCCTTCC TGGTCGTCTT CCGCGAACCG CTGCTGCGGC GCCGGCTACC CGCCGTGGTC
GCTGTCGCGC TGACCGCCTA CCCGGCCGGC GAGTATTTCG GCGGCTTCCT GTTCAACGAG
CCGGGGCCGT ACCGCGGGCC GATCTACCTG CTCTGGTTCC TGATCATGGG CGCAGTCCTG
CTGTCGCGGC CCGTCGGCCC GCTGCTGCGG CTACTGCCGA TCCTCATCCC GCTGCCTCTC
GTCGCCTCCC TCGGCGGCTG GATCTCCCGC GAGATCGGCC GGCAGCCCTG GATCGTGTAC
GGCAGGCTCA CCACGGCCCA GGCCCTGTCG CCCGGCCTGA CCCGGACCAT GATCATCAGC
TCGCTGGCCG GCTTCGTCGT CGTGCTCGGC GCGCTGGCCG TCACCAACTG GGTCCTGATC
GCGCGCACCG CCCGCCGCGG GCTGGAACCC GCGCCCACCG AACCGGCCGC CGAACCCGTA
CCGGCATTCT GA
 
Protein sequence
MDILDLARLQ FAVTTGVHWL FVILTLGLVP LVAIMQTRSL RARDRVRRAA LDRMTRFWGQ 
IYVINYALGI VTGLVMEFQF GLTWSGLGKY TGNVFGAPLA LETLIAFFAE STFLGMWIFG
WDRLRPGLHV TLIWLVTATA YTSAFWVLVA NGYMQAPVGS VVRDGVAYLT DFGALLTNPS
ALVPLAHVSL AALLTGGLFV AGVSAHHLRR GREDFRGPLR TGVLVAAVVT FPVYAAGGLQ
YPIIAATQPA KTAMMDEPGW VLWGQYVMIG LGYLLGILAL AAFLVVFREP LLRRRLPAVV
AVALTAYPAG EYFGGFLFNE PGPYRGPIYL LWFLIMGAVL LSRPVGPLLR LLPILIPLPL
VASLGGWISR EIGRQPWIVY GRLTTAQALS PGLTRTMIIS SLAGFVVVLG ALAVTNWVLI
ARTARRGLEP APTEPAAEPV PAF