Gene Sros_6106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6106 
Symbol 
ID8669404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6696957 
End bp6699314 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGTP diphosphokinase 
Protein accessionYP_003341580 
Protein GI271967384 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.364718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.643206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCGTG ATGTGGTCGT CCCCGACGTG ACCGCCGACG GCGGCGGTCC CGTGACGGCC 
GTCCCGGCCG GCGATGCCGG TTCCGCCATG TCCGGAGCCA CCGAAGAGAA GCCTGCGGTG
AGGCGCAGAC TCGCGCGCTT TGGGGGGCAA TGGGGTGGCG CCATGAACCC GGTGCTGGAG
CCGCTGTTCC GGACGGTCCG TGCGACCCAT CCGAAGGCAG ACCTGCGGCT GATCGAGCGT
GCCTACGATG TGGCCGCCTA CCACCATCGC GATCAGAAGC GCAAGAGCGG CGACCCCTAC
ATCACCCACC CCCTGGCGGT GGCGACGATC CTGGCCGAGC TCGGCACCGA CGACGAGACG
CTGTGCGCCG CGCTGCTGCA CGACACCGTC GAGGACACCG CCTACGGCCT GGACGAGCTG
CGCTCGGACT TCGGCGACAA CATCGCGCTG CTGGTCGACG GCGTCACCAA GCTCGACAAG
GTCAAGTTCG GTGACGCCGC GCAGGCCGAG ACCGTGCGCA AGATGGTCGT GGCGATGTCC
CGCGACATCC GGGTGCTGGT GATCAAGCTC GCCGACCGGC TGCACAACAT GCGCACCATG
CGCTACCTCC CCGAGCACAA GCGGCAGCAG AAGTCCCGCG AGACGCTGGA GATCTTCGCG
CCGCTCGCCC ACCGGCTGGG CATGAACACC ATCAAGTGGG AGCTGGAGGA CCTCGCGTTC
GCCATGCTCT ACCCCAAGCG CTACGACGAG ATCGCCAGGA TGGTGTCGGA GCGGGCCCCG
CGCAGGGACC TGTTCCTGCA GGAGGTCATC GAGAAGGTCT CCGGCGACCT GCGCGAGGCC
AAGATCCGCG CGGTGGTCAA GGGACGCCCC AAGCACTACT ACTCGGTCTA CCAGAAGATG
ATCGCCAGGG ACGTCGCCTT CGACGACATC TACGACCTGG TCGGCATCCG GGTGCTGGTC
GACACGGTCC GCGACTGCTA TGCCGCCCTC GGAACGATCC ACGCGCGATG GAACCCGGTG
CCCGGCCGGT TCAAGGACTA CATCGCGATG CCCAAGTTCA ACATGTACCA GTCGCTGCAC
ACCACGGTGA TCGGCCCCGA GGGCAAGCCG GTGGAGCTGC AGATCCGCAC CCACGCCATG
CACCACAGGT CCGAGTACGG CGTGGCCGCG CACTGGAAGT ACAAGGAGGA CATGACGGCC
GCCGGTCCTC CCGGAGCGAA GCTGAAGCCC GGCAACGACA TGGCGTGGCT CCGCCAGCTC
CTGGACTGGC AGAAGGAGAC CGCCGACCCG GGGGAGTTCC TGGAGTCGCT CAGGTTCGAC
CTGTCGGTCT CGGAGGTGTA CGTCTTCACC CCGCGGGGCC AGGTGATCGC CCTCCCCGAG
GGTGCGACGT CGGTCGACTT CGCCTACGCC GTCCACACCG AGGTCGGGCA CCGCTGCATC
GGGGCCCGGG TCAACGGCCG CCTGGTGCCG CTGGAGTCGC GGCTGGGCAA CGGCGACACC
GTCGAGATCT TCACCTCCAA GTCGCCCGAC GCGGGCCCGT CGCGTGACTG GCTCAAGTTC
GTCAAGTCCG GCCGGGCCCG CAACAAGATC CGTCAGTGGT TCTCCAAGGA GCGCCGCGAG
ACCGCGATCG AGGCGGGCAA GGAGGCCATC GGCCGGGCCA TGCGCAAGCA GGGCCTGCCG
CTGCAGCGCA TGATGTCCGG AGAGTCCCTC CTGACCCTCG CCAGGGACCT GCGCTATCCC
GACGTCTCCG CGCTCTACGC GGCCGTTGGA GAGGGCCACA TCGCCGCCCA GGCGGTCGTG
CAGAAGCTGG TGCACTCCCT CGGCGGGGTG GACGGCGCGG AGGAGGACAT CGCCGAGGCC
TCGGTGCCCA CGAAGGTGCG GGGCCGGCCC CGCGGCAGCG GCGGCGCGGG CGTGGTGGTG
GCGGGTGACT CGGACGTGTG GGTACGGCTG TCGCGCTGCT GCACCCCCGT GCCCGGTGAC
GAGATCATCG GCTTCGTCAC CCGTGGGCAC GGCGTGTCGG TGCACCGCAC CAACTGTCCC
AACGTGGAGC AGCTGAAGTC CCAGCCGGAC CGGCTGGTCG AGGTGGCCTG GTCGGCCGCG
GACGACTCGG TGTTCCTGGT CGCCCTGCAG GTCGAGGCGC TCGACCGGCC ACGTCTGCTG
TCGGATGTGA CCCGGACCCT GTCGGACCAG CACGTGAACA TCCTGTCGGC GTCGGTGACG
ACGTCCAGGG ACCGGGTGGC GATCAGCAAG TTCACCTTCG AGATGGGCGA CCCCAAGCAC
CTGGGGCACG TCCTGAAGGC CGTGCGCAAC ATCCCCGGTG TCTACGACGT CTACCGGGTG
AGCGGCGCCG GAGCCTGA
 
Protein sequence
MPRDVVVPDV TADGGGPVTA VPAGDAGSAM SGATEEKPAV RRRLARFGGQ WGGAMNPVLE 
PLFRTVRATH PKADLRLIER AYDVAAYHHR DQKRKSGDPY ITHPLAVATI LAELGTDDET
LCAALLHDTV EDTAYGLDEL RSDFGDNIAL LVDGVTKLDK VKFGDAAQAE TVRKMVVAMS
RDIRVLVIKL ADRLHNMRTM RYLPEHKRQQ KSRETLEIFA PLAHRLGMNT IKWELEDLAF
AMLYPKRYDE IARMVSERAP RRDLFLQEVI EKVSGDLREA KIRAVVKGRP KHYYSVYQKM
IARDVAFDDI YDLVGIRVLV DTVRDCYAAL GTIHARWNPV PGRFKDYIAM PKFNMYQSLH
TTVIGPEGKP VELQIRTHAM HHRSEYGVAA HWKYKEDMTA AGPPGAKLKP GNDMAWLRQL
LDWQKETADP GEFLESLRFD LSVSEVYVFT PRGQVIALPE GATSVDFAYA VHTEVGHRCI
GARVNGRLVP LESRLGNGDT VEIFTSKSPD AGPSRDWLKF VKSGRARNKI RQWFSKERRE
TAIEAGKEAI GRAMRKQGLP LQRMMSGESL LTLARDLRYP DVSALYAAVG EGHIAAQAVV
QKLVHSLGGV DGAEEDIAEA SVPTKVRGRP RGSGGAGVVV AGDSDVWVRL SRCCTPVPGD
EIIGFVTRGH GVSVHRTNCP NVEQLKSQPD RLVEVAWSAA DDSVFLVALQ VEALDRPRLL
SDVTRTLSDQ HVNILSASVT TSRDRVAISK FTFEMGDPKH LGHVLKAVRN IPGVYDVYRV
SGAGA