Gene Sros_3322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3322 
Symbol 
ID8666610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3617945 
End bp3619234 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339004 
Protein GI271964808 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.928519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.276614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAACGG CAGTCGCCCC CCGGCTGGGC CCGCGGCTCT GGACCGCCCT GATCGTGCTC 
GGCTTCGTCG GTCAGGTGGC CTGGACCGTC GAGAACATGT ACCTCAATGT TTTCGTCTAC
GACACGATCA GCGACGACCC CGGAGCCATC GCGACGATGG TCGCGGCCAG CGCGCTGGCC
GCGACGCTGG CGACGTTGCT GATCGGCGCC GTCTCCGACC GCACCGGCCG CCGCAGGGTG
TTCGTCTCGG TGGGGTACGT GCTGTGGGGG CTGGTCACCG CCGCGTTCGG GTTCGTCACC
GTGGACGCGG TCGCGGGCGT CGTCTCCGCG GCCGGCGCCG TCCTCGTCGC GGTGGTCGCC
GTCATCGCCC TCGACTGCCT CATGTCCTTC CTCGGGGCCG GGGCCAACGA CGCGGCCTTC
CAGGCCTGGG TCACCGACGT GACGCATCCG GGCAACCGCG GCAGGGTCGA GTCGGTCCTG
GCGACCATGC CGCTGGTGTC GATGCTGGCG GTCTTCGGCG GCTTCGACGC TCTCACCCGG
GCGGGGAACT GGCGGCTGTT CTTCCTGCTC ATCGGGGCGG CGATCGTCAT GGTCGGGGTG
GCCTCCTGGT TCCTCGTGCG GGACCGGCCC ACTCCGGCCC GGCAGGAGGG CGGCTACCTG
AGCTCCCTCG TCCACGGCCT GCGCCCGGCC GTGATGCGCG CCAACCCGGG CCTCTACCTC
GCGCTGGCGG CGTGGTCGAT CTGGGGGATC TCCACGCAGG TCTTCCTGCC CTATCTGATC
ATCTATGTGC AGCGGTACCT GCACATCGAG GGCTACGCCG TCGTCCTGGC GGTCGTCCTG
ACCGGTGCCT CGGCCGTCAG CGTCGTCTGC GGCCGGTACG TCGACCGGAT CGGCAAGATC
CGTTTCCTGC TGCCGGCGGT CGCGGTGTAC GGGGCGGGGC TGCTGCTGAT GACCGTCGCG
CGCGGCATGA TCCCGGTGAT CGCGGCCGGA CTGGTGATGA TGTCCGGCTT CATGCTCGTC
CTGGCGCCCA TCGGCGCGAT CGTCCGCGAC TACTCGCCGC CGGGCCGGGC CGGGCACGTC
CAGGGGCTGC GGATGGTCTT CGCGATCCTC ATCCCCATGA TCGTCGGGCC GTCCCTCGGC
GCCGCCGTGA TCAAGGGGGC CGACGAACAC TACGAGGAGC TGGGTGTGCT CAAACAGGTC
CCGACGCCGG CCGTCTTCCT GGCCGCCGCG GCCGTACTGG TGCTCATCGT CGTGCCCGTG
CTGGCGCTGC GCCGGAGGGA GGGCCGATGA
 
Protein sequence
MTTAVAPRLG PRLWTALIVL GFVGQVAWTV ENMYLNVFVY DTISDDPGAI ATMVAASALA 
ATLATLLIGA VSDRTGRRRV FVSVGYVLWG LVTAAFGFVT VDAVAGVVSA AGAVLVAVVA
VIALDCLMSF LGAGANDAAF QAWVTDVTHP GNRGRVESVL ATMPLVSMLA VFGGFDALTR
AGNWRLFFLL IGAAIVMVGV ASWFLVRDRP TPARQEGGYL SSLVHGLRPA VMRANPGLYL
ALAAWSIWGI STQVFLPYLI IYVQRYLHIE GYAVVLAVVL TGASAVSVVC GRYVDRIGKI
RFLLPAVAVY GAGLLLMTVA RGMIPVIAAG LVMMSGFMLV LAPIGAIVRD YSPPGRAGHV
QGLRMVFAIL IPMIVGPSLG AAVIKGADEH YEELGVLKQV PTPAVFLAAA AVLVLIVVPV
LALRRREGR