Gene Sros_7654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7654 
Symbol 
ID8670975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8446758 
End bp8447888 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content69% 
IMG OID 
Productcellulose biosynthesis protein CelD 
Protein accessionYP_003343070 
Protein GI271968874 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATCA CTGTGGTGCG CCCGCAGGAC CTCGGCGAGG CCGAGTCCCA CCGATGGCGC 
GAGATCCAGA AAGCCTCTCC CAGCCTCGAC AACCCCTTTC TCTCCGTGGA CTTCACCCTG
GCCATGGGCA GGCTCCGTGA CCACGTCAGG GTCGCGGTGA TCGAGGACGG CGGCGAGATC
GCGGGATTCC TCCCCCACGA GCGGCACGGC TTCGGCGTCG GCAGGCCGCT GGGCGGCTAC
CTCACCACCT GCCAGGGGCT GGTCTCGGTC CCCGAGCTGA AGATCGACCC ACGTGACCTG
CTCCGGGCCT GCGGGCTGTC GGCCATCGAC TTCGACCACC TGGTCGCCGG CCAGCCCACG
TTCGCGCCCT ACGAGACGGA CGTACGGCCC GCCCCCGTCA TGGACCTCAG CGGCGGTTTC
GACGCCTACG TCGAGCGGGT GCGCGCCGGC TCGGCGAAGA ACTACAAGAC CGTCCGCTAC
AAGGAGCGCA AGCTCGGCCG CGAACGGGGC GAGATCCGGT TCGAGTGGGA CTCCGCCGAC
ATCGGGACGC TGCGCGCGGT CATGGCCTGG AAATCGGACC AGTACCGGCG GACCGGACGG
GTGGACCGCT TCGCCCAGCC GTGGATCGTG CGGCTCGTCG AGGAGCTGCA CTCCCGGCGC
TCCGACGACT TCGCCGGCGT GCTCACCATG GTCTACGCCG GAGACACCCC CGTCGCCGGG
CACTTCGGCC TCCGTACGGC GCACACCCTG GTGGGCTGGT TCCCCGCCTA CGACCCGGCC
TTCGCCCGCT ACTCCCCCGG GATCATGCAC CACCTGCACA TGGCCGAACA CGCCGCGAAC
GCGGGGCTGC ACCAGGTGGA CATGGGGAAA GGCGGCCGCG AATACAAGGA ATGGCTTAAA
ACCGGCGTTT TGATGATCGC CGAGGCACGC ATCTCGCGTC CGTCTCCGGT GGCCGCCGCC
CAGTGGCTGG GCCGGGTCCC CATCAGCAGA CTCCGCGCCG TCGTAGTGGA CAACCCCTCC
CTTTTCCGAG CCGCGGACCG GCTACTCAAG GGCTACGGCA GAGCGAGATC CTCTCTCCTG
TCCCGCCCCA TATCCCCTCC CACCGCAGAA CGATCACCCG AGGCACAGTA A
 
Protein sequence
MKITVVRPQD LGEAESHRWR EIQKASPSLD NPFLSVDFTL AMGRLRDHVR VAVIEDGGEI 
AGFLPHERHG FGVGRPLGGY LTTCQGLVSV PELKIDPRDL LRACGLSAID FDHLVAGQPT
FAPYETDVRP APVMDLSGGF DAYVERVRAG SAKNYKTVRY KERKLGRERG EIRFEWDSAD
IGTLRAVMAW KSDQYRRTGR VDRFAQPWIV RLVEELHSRR SDDFAGVLTM VYAGDTPVAG
HFGLRTAHTL VGWFPAYDPA FARYSPGIMH HLHMAEHAAN AGLHQVDMGK GGREYKEWLK
TGVLMIAEAR ISRPSPVAAA QWLGRVPISR LRAVVVDNPS LFRAADRLLK GYGRARSSLL
SRPISPPTAE RSPEAQ