Gene Sros_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3212 
Symbol 
ID8666500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3500148 
End bp3501692 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content73% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003338898 
Protein GI271964702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0502803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGGG CGGCGGCCGC GCTCGCGCTG GTGACCATGG CCACGCTGGC CGCCTGCTCC 
TCCGGCGACG TGCCCGGGGA CGGCCCGGAC GTGCTGCGGG TGCTGGCCGG CAGCGAGGTC
AAGGACCTGG AGCCGCTGCT GCGGAGGTCC GGGGTGAAGG TGAGGATCTC CTACACCGGC
ACGCTGGACG GCGCCGAGCA GGTGGCGGGC GGCGGCGCGG ACGGCTCCTA CGACGCGATC
TGGTTCTCCT CCAACCGCTA CCTGTCGCTG ATCGACGGCG CGACCGCACG GCTGTCCACC
GAGACGAAGA TCATGGTCTC CCCGGTGGTG CTCGGGCTCA CGACCGCCAA GGCCAGGGAG
CTCGGCTGGG AGGGCAGGCC GGTCACCTGG GAGCAGATCG CGACCGCCGC CCGCGAGAAG
CGGTTCACCT TCGGCATGAC CAACCCCGCC TCCTCCAACT CGGGTTTCTC CGCACTGGTC
GGGGTGGCCG CCGCGTTGTC GGACGCCGGG GAGGCGCTGA GCGGCGAGCA GATCACCGCG
GTGACGCCCA GGCTCAAGGA GTTCTTCTCC GCCCAGCGGC TGACCTCGGG GTCCTCCGGC
TGGCTGGCCG ACGCCTACTC CCGGGAGGGC GGGGTGGACG GCATCGTGAA CTACGAGTCG
GTGCTGCTCG GCATGGGCGG GCTGAGCCTG GTCCGCCCGA GCGACGGGGT GGTCACCGCC
GACTACCCGC TGACCCTGCT GGCCTCGGCC CCGCGGGAGA AGAAGGAGCT GTACGGCAGG
CTGACGGCCT GGCTGCGGAC GCCGGACGTG CAGCGGGAGA TCATGACCGG CACCCACCGG
CGGCCGATCG TCCCCGGCGT CCGGCCGGGG CCGGAGTTCG GCACGGCGCC CCTGCTGGAG
CTGCCGTTCC CCAACCGCAG GGCCGCCGCC GACGGGCTGA TCACCGCCTA TCTCGACGAG
GTGCGCGTCC CTGCGCGGGC GCTGTTCGTG CTGGACACCT CGGGGTCGAT GGAGGGCGAG
CGGATCGAGG CGCTGCGCCA GGCACTGGTC ACGCTCACCG GCGCCGACAC CTCGGCCTCC
GGCACGTTCT CCCGCTTCCG CAGCCGGGAG AACGTGATCA TGATCCCGTT CGGCGGCTCG
GCCGGGCTGC CGCAGCCGTT CATCCTCCCC GAACGTGACC CGCAGCCGGC CCTGGCGCAG
ATCAGAGCCT ACGCCGAACG GCTCCGGGCG GCCGGCGGGA CCGCGATCTA CGACGGGCTG
CGCGCCGCCT ACGGGCAGGC CGGCGACGCC GGCCGGGACC ACTACACCTC GATCGTGCTG
ATGACCGACG GCGAGAACAC CGACGGCTCC TCCTACGAGG ACTTCGAGGC GTACTACCGG
TCGCTGCCCG AGGCGCGGCG GCAGGTCCGG ACGTTCGTCG TGCTGTTCGG CGAGAGCGAC
GCGGACGAGA TGGAGAGGAT CGCCACGCTG ACCCGGGGCG CCGTCTTCGA CGCCCGCACC
GGATCGCTCG CCTCGGCGTT CAAGGAGATC CGTGGCTACC AGTGA
 
Protein sequence
MRRAAAALAL VTMATLAACS SGDVPGDGPD VLRVLAGSEV KDLEPLLRRS GVKVRISYTG 
TLDGAEQVAG GGADGSYDAI WFSSNRYLSL IDGATARLST ETKIMVSPVV LGLTTAKARE
LGWEGRPVTW EQIATAAREK RFTFGMTNPA SSNSGFSALV GVAAALSDAG EALSGEQITA
VTPRLKEFFS AQRLTSGSSG WLADAYSREG GVDGIVNYES VLLGMGGLSL VRPSDGVVTA
DYPLTLLASA PREKKELYGR LTAWLRTPDV QREIMTGTHR RPIVPGVRPG PEFGTAPLLE
LPFPNRRAAA DGLITAYLDE VRVPARALFV LDTSGSMEGE RIEALRQALV TLTGADTSAS
GTFSRFRSRE NVIMIPFGGS AGLPQPFILP ERDPQPALAQ IRAYAERLRA AGGTAIYDGL
RAAYGQAGDA GRDHYTSIVL MTDGENTDGS SYEDFEAYYR SLPEARRQVR TFVVLFGESD
ADEMERIATL TRGAVFDART GSLASAFKEI RGYQ