Gene Sros_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0496 
Symbol 
ID8663765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp499607 
End bp501940 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content70% 
IMG OID 
ProductInorganic diphosphatase 
Protein accessionYP_003336264 
Protein GI271962068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.801124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTGC TCCATCTAGC CGTTGCCGAA GAGCCAGCGG TATCCCTGAG CGGTTCCCAT 
TTCAACATAG TGATAGTGGT GGCCGTCGTG GCCCTGCTCG CACTCGCCGT CGCCGGCGTT
CTCGTGCGGG AGGTGCTCGC CGCCGGTCAA GGCACCGAGC GTATGCAGAA CATCGCGCGA
GCCGTACAGG AAGGGGCCTC CGCGTATTTG GCGCGGCAGT TCCGGACGCT CGCGGTGTTC
GTCATCCTGA TCCCTTTCCT GCTCTACCTG CTCCCCGCCG AATCGACCGG TGTGGCCATC
GGCCGCTCGG TTTTCTTCGT GGTGGGCGCG GTCTTCTCCG CACTCACCGG CTTCATCGGC
ATGTGGCTGG CGGTCCGCGG AAACGTCCGG GTCGCCGCGG CCGCCCGTGA GTCGGGCGAG
AAGGTCGCGA TGCGGATCGC CTTCCGCACC GGCGGCGTGG CCGGCATGAT CACCGTCGGT
CTCGGCCTGC TCGGCGCGGC CATCGTGGTG ATCGTCTACG GAGGCGACGC CCCGGCCGTG
CTGGAGGGCT TCGGCTTCGG CGCCGCCCTG CTCGCCATGT TCATGCGGGT CGGCGGCGGC
ATCTTCACCA AGGCCGCCGA CGTCGGCGCC GACCTGGTCG GCAAGGTCGA GCAGGGCATC
CCCGAGGACG ACCCGCGCAA CGCCGCCACC ATCGCCGACA ACGTGGGCGA CAACGTCGGC
GACTGCGCGG GCATGGCCGC CGACCTGTTC GAGTCCTACG CGGTCATGCT GGTCGCCAGC
CTGATCCTGG GCAAGGTGGC CTTCGGCACC GAGGGCCTGG TCTTCCCGCT GATCGTGCCG
ATGGTCGGCG TGATCACCGC GATCATCGGC ATCTTCACCA CCTCCCCGCG GAAGGGCGAC
CGCAACGGCA TGGCCGCGAT CAACCGCGGC TTCTTCATCT CGGCCGTCAT CTCGGCGGTC
CTCGTCGGCG TCGCGGTCTT CCTCTACCTG CCGAGCAGCT TCGCCGGGCT GACCGGGGTG
AGCCCGGAGA TCGCCGCGAT CACCTCCGAC CCGCGCCTGA TCGCCATCGG GGCCGTGCTC
ATCGGCCTGG TGCTGGCCAG CGCCATCCAG ATCCTCACCG GTTACTTCAC CGAGACGAAC
CGCCGCCCGG TGAAGGACAT CGGGGAGAGC GCCCGGACCG GTCCCGCCAC GGTCATCCTG
TCCGGCATCA GCGTCGGCCT GGAGTCGGCG GTCTACTCGG CGCTCATCAT CGGCGCCGCC
GTCTACGGGG CGTTCCTGCT CGGCTTCGGC AACGTCACCA TCGCCCTGTT CGCCGTGGCC
CTGGCGGGCA CCGGCCTGCT GACCACCGTC GGCGTGATCG TGTCGATGGA CACCTTCGGC
CCGGTCTCCG ACAACGCGCA GGGCATCGCC GAGATGTCCG GGGACGTCGA CGGGGAGGGC
GCCCGGGTGC TCACGTCGCT GGACGCCGTG GGCAACACCA CCAAGGCCAT CACCAAGGGC
ATCGCGATCG CGACGGCGGT GCTCGCCGCG ACGGCGCTGT TCGGCGCGTT CCGGACGGCG
ATCGAGACCC AGCTCGCCAA CGCGTCCCAG GGCGTCAAGG ACGTGCTCGG CTCGTTCGGC
ACGTTCAGCC TGAGCGTCGA CTCCCCGAAC GTGCTGGTCG GTCTGATCAT CGGCGCGGCG
GTGGTGTTCA TGTTCTCCGG CCTGGCCATC ATGGCGGTCG GCAGGGCGGC CGGGCGGGTG
GTCTTCGAGG TGCGCGAGCA GTTCCGCACC AAGCCGGGGA TCATGGCGGG TACGGAACTG
CCCGACTACG GCAGGGTCGT GGACATCTGC ACCCGTGACT CGCTGCGGGA GCTGGCCACG
CCCGGCCTGC TCGCCGTACT CACGCCGATC GCCGTCGGCT TCGCCCTCGG CTACGCGCCG
CTCGGCGCGT TCCTCGCCGG GGCCATCGCC TGCGGCACGC TGATGGCGGT GTTCCTGGCC
AACTCCGGCG GTGCCTGGGA CAACGCCAAG AAGCTGGTCG AGGACGGCCA CCACGGGGGC
AAGGGATCGT CCGCCCACGA GGCCACCATC ATCGGTGACA CCGTCGGCGA CCCGTTCAAG
GACACCGCGG GCCCGGCGAT CAACCCGCTG CTGAAGGTGA TGAACCTGGT GGCGCTGCTG
ATCGCTCCGG CGGTGGTCAC CTACGCCGAC AACGTGGCGC TGCGGATCGG CGTCACCGTG
GTCGCGGCCG GCATCGTCGT CGCCGCGGTG GTCGTCTCCA AGCGCCGGTC GGCGAGCATC
GCCCCGACCG AGGAGAACAA CGTCAAGCGC GAGCGAGAGC CGATCTCGGG CTGA
 
Protein sequence
MSVLHLAVAE EPAVSLSGSH FNIVIVVAVV ALLALAVAGV LVREVLAAGQ GTERMQNIAR 
AVQEGASAYL ARQFRTLAVF VILIPFLLYL LPAESTGVAI GRSVFFVVGA VFSALTGFIG
MWLAVRGNVR VAAAARESGE KVAMRIAFRT GGVAGMITVG LGLLGAAIVV IVYGGDAPAV
LEGFGFGAAL LAMFMRVGGG IFTKAADVGA DLVGKVEQGI PEDDPRNAAT IADNVGDNVG
DCAGMAADLF ESYAVMLVAS LILGKVAFGT EGLVFPLIVP MVGVITAIIG IFTTSPRKGD
RNGMAAINRG FFISAVISAV LVGVAVFLYL PSSFAGLTGV SPEIAAITSD PRLIAIGAVL
IGLVLASAIQ ILTGYFTETN RRPVKDIGES ARTGPATVIL SGISVGLESA VYSALIIGAA
VYGAFLLGFG NVTIALFAVA LAGTGLLTTV GVIVSMDTFG PVSDNAQGIA EMSGDVDGEG
ARVLTSLDAV GNTTKAITKG IAIATAVLAA TALFGAFRTA IETQLANASQ GVKDVLGSFG
TFSLSVDSPN VLVGLIIGAA VVFMFSGLAI MAVGRAAGRV VFEVREQFRT KPGIMAGTEL
PDYGRVVDIC TRDSLRELAT PGLLAVLTPI AVGFALGYAP LGAFLAGAIA CGTLMAVFLA
NSGGAWDNAK KLVEDGHHGG KGSSAHEATI IGDTVGDPFK DTAGPAINPL LKVMNLVALL
IAPAVVTYAD NVALRIGVTV VAAGIVVAAV VVSKRRSASI APTEENNVKR EREPISG