Gene Sros_9064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9064 
Symbol 
ID8672410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10003973 
End bp10005268 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003344433 
Protein GI271970237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGGA CCGCCCAAGT CACCTCCTTC GAGGTGTTCG AGCGCGTCGC CCAAGGCCTG 
CGCATGCCCG ACCCCGCCCG CATGGCCCTG GGCCTGGCCC CTACCCGGCC GGCGGCACCA
CGAGTGATCG ACGCCGTTAC CCATCCCGCC TCAGTCATCC CGTCTCGGGA TGGAACGCCA
ACAAGTGCCC TGCTTAGCGT GGAGAGCGCT GTTGCCGTTG GCCAGCCTCC GAGGGACGTC
GACGTTCTGA CCCTGGCCTG GATTGTGGGA AGGCTGGACT CTCACATGGA CCGCCGAACG
ATGCTCATCC TCGCCGCCGG AATGACCGCC GAAACCGCGG CCACCATCGC CGACCCCTGG
GAGCGCCTGT CCCGCGCGCT GACCGGACCA CAGACACTCG ACGAAGACAC CATCGAACGC
CTCGAAGCCC GCACCATCGG CTTCCACCGC CTGGAGTACG TGCTCCCCGC CCGAGCCATC
TACCAAGGGC TCACCACCCA CATCAACGAA CTGAGCAACC TGCTCCAGAG CGGCCCGCCC
GACCGCTTCC GCCGACGCCT GGCCGCGACC GCCGGCGAAG CCGCCACCCT CGCCTCCTGG
ATCGCCTGGG ACCTCAAGCA GCCCGGCCAG TCCGCCTCAT TCGAGCGCGT CTCCGCCCTG
GCCGCCAAGG AGAGCGGGCA CCCGATCATC CAGGCGTGCA CCTACGCCTA CAGGTCCAAT
GCCGCCGAAG GCCACACCGC CTACGAGGCC GTACGGCAGG CACAGCAGTT CCTCCCCGCC
CAGGGCGACG ACGCCACCCG GGCATGGCTG CTCAGCAGGG AAGCCGAAGA GTTGGCTGCC
CTCGGTGACC GCCGCGCCGT CGACCTGTTG CACCAGGCCG AAGAGGCCTA CGGTCGAGCA
CGACCCCACC GCGAACGCGC CTGGACCCGC TTCCTCGACC CCGGCCGCAT GGCCGCCTTT
CAACTGTCCA CCTACGTACG ACTCGGCGAC GAACGTCAGG TGATCGAGGC CGGCCAGGCC
GCGCTGTCGG CCGTCGCCCA GGACGCCGAC CACAAGAAAG TGGCCGTCAT CTACGCCGAC
ATCGCCCAGG CCCAGCTCCA GATAGGCGAC GTTGCCGAAG GAATCGCCTA CGCCCGTCGG
GCGCTCGACG CCGCCCAGCG CGGCGAATCG ACCTGGGGAC TCCAGCACCT CACGACGGTG
GAGAAGGCCC TTTCCACCCA GCAGGACCAG GCCGCCCGGG ACCTGCTCGG CGACATCGTC
TCTACGCGCC GGACACTCGG GCCGTCTCCC GCCTGA
 
Protein sequence
MKGTAQVTSF EVFERVAQGL RMPDPARMAL GLAPTRPAAP RVIDAVTHPA SVIPSRDGTP 
TSALLSVESA VAVGQPPRDV DVLTLAWIVG RLDSHMDRRT MLILAAGMTA ETAATIADPW
ERLSRALTGP QTLDEDTIER LEARTIGFHR LEYVLPARAI YQGLTTHINE LSNLLQSGPP
DRFRRRLAAT AGEAATLASW IAWDLKQPGQ SASFERVSAL AAKESGHPII QACTYAYRSN
AAEGHTAYEA VRQAQQFLPA QGDDATRAWL LSREAEELAA LGDRRAVDLL HQAEEAYGRA
RPHRERAWTR FLDPGRMAAF QLSTYVRLGD ERQVIEAGQA ALSAVAQDAD HKKVAVIYAD
IAQAQLQIGD VAEGIAYARR ALDAAQRGES TWGLQHLTTV EKALSTQQDQ AARDLLGDIV
STRRTLGPSP A