Gene Sros_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0407 
Symbol 
ID8663675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp406495 
End bp407646 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscription termination factor Rho 
Protein accessionYP_003336179 
Protein GI271961983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG AAACCACCAC CAAGAAGCGG CCACGTGCCG CACGCCCTCC GCGTCCTCGC 
GAGAGCGACG CCTACCTGGA GACCGTGGCC GGGCTGCTCG ACGTCCGCGA CAAGACGGGC
TACATACGCA CCCACGGCTA CCTCCCCGGG GTGGACGACG TGCGCGTGCC CCACGCCCAG
ATCAGGCAGT ACGGCCTGCG TCCCGGCGAC CACGTCGTCG CCACCACGCG CAAGCCGTAC
GAGAGGCTGG CCGAGGTGGA GAGCGTCAAC GGCTCCACCG ACTGGCGGAA CAGGCCCGAC
TTCGCCGACA TGACGCCGAT CCACCCGCGC GAGCGGCTCC GTCTGGAGAC CGAGTCGGTG
ACCAGCAGGG TCATCGACCT GTTCGCGCCG ATCGGCAAGG GCCAGCGCGG CCTGATCGTC
GCCCCGCCGA AGGCGGGCAA GACCATGGTC CTGCAGGACC TGGCCGCCGC GATCACGCGC
AACCATCCGG ACTGTCACCT CATGGTCGTG CTCGTCGGCG AGCGCCCCGA GGAGGTCACC
GAGATGCGCG AGTCCATCCA CGGCGAGGTC GCCGCGTCCA CATTCGACCG CCCCGACCGC
GACCACACCG CCCTCGCCGA ACTCGCCGTC GAGCGCGCCA AGCGCCTCGC CGAGAGCGGG
CACGACGTCG TCGTCCTGCT CGACTCCCTG ACCCGCCTGG GCCGCGCCTA CAACAACCTC
GCCCCCGGCG GCGGACGCAC CCTCGCCGGC GGCCTCGACG CCGCGGCCCT GCTCCCCCCG
CGCCGCTTCT TCGGCGCCGC GCGCAACCTG CGTGACGGCG GCTCGCTGAC GATCCTCGCC
ACCGCCCTGG TCGAGACCGG CTCGCGGATG GACGACAACC TCTTCGAGGA GTTCAAGGGC
ACCGGCAACA TGGAGCTGCG CCTCAGCCGC GCGCTGGCCG ACAAGCGCCT CTACCCCGCC
GTCGACCTCG ACGCCTCCGG CACCCGCCGC GAGGAGATCC TGCTCGACCC GCAGGAGCAC
CAGCTCACCT GGCGCCTGCG CCGTACCCTC GGCGGCCTGG AGAAGCAGCA GGCCCTGGAA
CTGCTCACCG ACAGGCTCCG GGAGACCCCT TCCAACGCCG CCTTCCTCCA GCAGGTCCGG
CAGACCACCT GA
 
Protein sequence
MTIETTTKKR PRAARPPRPR ESDAYLETVA GLLDVRDKTG YIRTHGYLPG VDDVRVPHAQ 
IRQYGLRPGD HVVATTRKPY ERLAEVESVN GSTDWRNRPD FADMTPIHPR ERLRLETESV
TSRVIDLFAP IGKGQRGLIV APPKAGKTMV LQDLAAAITR NHPDCHLMVV LVGERPEEVT
EMRESIHGEV AASTFDRPDR DHTALAELAV ERAKRLAESG HDVVVLLDSL TRLGRAYNNL
APGGGRTLAG GLDAAALLPP RRFFGAARNL RDGGSLTILA TALVETGSRM DDNLFEEFKG
TGNMELRLSR ALADKRLYPA VDLDASGTRR EEILLDPQEH QLTWRLRRTL GGLEKQQALE
LLTDRLRETP SNAAFLQQVR QTT