Gene Sros_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3653 
Symbol 
ID8666941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4048685 
End bp4049908 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content64% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339325 
Protein GI271965129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTATC CCCCGCAGGG TCGGCAGCCG CATGGTCAGC GCCCGTGGCA GCCCTACAAC 
AGCCAACCGT CCCAGCACGG CCCCTACCCT TCCCAGGGCT ACCCGCAGCA GCCGACGGCA
TACAGGCAGG TACAGCCGCG GCTGTATGCG ACTAACGGCT ATCCGATGCA ACAACCGGTG
CCGCAATCAC AGCAGCGGCG TGGCGGGTCG GCCCTCTCCG TGGCGACTGT CGCGCTGGCC
GTCATCGCGC TGCTCGGTGT TGCCGTCGTC ACGACCCTCC TCGTACTCAA CTCCGCCACC
TCGACCGGAT CATCCGAGGG AGCGAGAGTG GCCCTGGTGG ACCTGCGGGC GCAGGAGCAG
ACCGAGCCCG GGGTACGCAC TGCCGCGCAA CAGGCGTTCG ATCTGTACTC GGCCGGCTCC
TATGGCGAGT TCTGGGATCG CTGGTCTGCG CAGTCGCAGT CCCTGATGCC GCGCGACGAC
TACATCAGCA TGTTCGAACA GTGCCCGCAG GCCGCGCAGA ATCTGCGGTT CACAATCAGC
TCAGTCGCTG TCAACGGCAC CGGCGCGAAG GTGAACGCCA ACCGTTTGAT CGCAGCGTTC
ACCTTCGACT TCACCTATGA AGGCCAGGCG TGGCGGTATG TCCTACCTGC CGATCAGCAG
CAGGAGTACC GCACCAAGAG CCTCGATCAG ATCGTGCAGG AGCGGCGGGC CTCCAAGGTC
TGCGGTGGAC AGGATGGCGG CTTACGGCTC ACCCCCGTGC CGACTCAGCC CCTGATCGCA
CAGCCGCCGA CAGCGCAAGC GCAGACAGTG ACGGTGGCGA AGGTCGGCGA GACCATCACC
GTTAAGGGGC TACAGCCCGG CGTTGAAGTC GCTGTCACCC CCAATCGGGT CATTGACAAC
GCGACCTCCG GCAACCAGTT CCTGAAGCCG AAAGACGGCA ACCGCTACAT CGCTGTCGAA
CTCACCTTGA AGAATGTCGG CCAGGAGATC TACACCGACT CACCGGCTGT CGGCGGGACG
TTGATCGACG CCGAAGGGCA ACAGCATCGG CCAACGTTCG CGGAAGTGAC GGAGGGAGCC
GCGTTCGGCG GATCGGTCAC TGTGAACCGC GGCGATACCC GAAAAGGCCT GATCGTATTC
GAAGTTCCCG CCTCGGCGAC GCCTGCCAAA CTGCAGTTCG GGGTCATGTT TGGTCAGCAG
AAGGGCGAAT GGGCGCTGTC TTAG
 
Protein sequence
MTYPPQGRQP HGQRPWQPYN SQPSQHGPYP SQGYPQQPTA YRQVQPRLYA TNGYPMQQPV 
PQSQQRRGGS ALSVATVALA VIALLGVAVV TTLLVLNSAT STGSSEGARV ALVDLRAQEQ
TEPGVRTAAQ QAFDLYSAGS YGEFWDRWSA QSQSLMPRDD YISMFEQCPQ AAQNLRFTIS
SVAVNGTGAK VNANRLIAAF TFDFTYEGQA WRYVLPADQQ QEYRTKSLDQ IVQERRASKV
CGGQDGGLRL TPVPTQPLIA QPPTAQAQTV TVAKVGETIT VKGLQPGVEV AVTPNRVIDN
ATSGNQFLKP KDGNRYIAVE LTLKNVGQEI YTDSPAVGGT LIDAEGQQHR PTFAEVTEGA
AFGGSVTVNR GDTRKGLIVF EVPASATPAK LQFGVMFGQQ KGEWALS