Gene Sros_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3839 
Symbol 
ID8667129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4276888 
End bp4278852 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339501 
Protein GI271965305 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.254825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0525361 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTAGTC CTGTCCTGCC CTCCTCGGGC GCGCTGTCCC CTCTGGGGCT CGATTCGGTA 
CGGCTGACGC CGGGTTTCTG GGGCGACCGC GTCGCGCTCA ACCGCGAGGT CATCATCGCC
CACTGCCAGG AGTGGATGGA GCGGGCCGGC TGGATCGGCA ACTTCCGCAG GCTCGGCGGG
GACCCGGCCC GCCGCGACCC GGTCCTCCGG CAGGGCCGGG AGTTCAGCGA CTCGGAGATC
TACAAGCTGC TGGAGGGGAT GGCCTGGGCC GGCCACCCCG CCCTGCCGGA GCTGGCCGCG
ACCGTGGCAC GGGCGCAGGA GGAGGACGGC TACCTCAACA CCCGCTGGTA CGGCGACCGC
TACACCGACT TCGAGTGGGG CCACGAGCTC TACTGCTACG GCCACCTGAT CCAAGCCGGC
GTCGCACGGC TCCGCACGCA CGGCGAGGAC GAGCTGACCG GCGTCGTCCG GCGGGCGGCC
GACCACATCT GCCGGCGCTT CATGGACACC TCCGAGACGT GCGGCCATCC CGTGGTGGAG
ATGGCCCTGG TCGAGCTGTA CCGGGCGACC GGCGCCGAGC GCTACCTGGA GATGGCCCGC
CGGTTCGTCG AGCGTCGCGG GCTGCCCGCG CTGGACGACA TCGAGTTCGG CCGCGCCTAC
TTCCAGGACG ACCTCCCGGT ACGGCGGGCA CGGGTGTTCC GCGGCCACGC CGTACGGGCC
GTCTACCTCG CCTCGGGAGC GGTCGACGTC GCCGTGGAGA CCGGTGACGC CGAGCTGCTG
TCGGCGATCG AGGCGCAGTG GGAGCGCACC GTCGCCCGGC GCGTCCACCT GACCGGCGGC
ATGGGCTCGC GCCACTCCGA CGAGGCCTTC GGCGACGACT TCGAGCTGCC GCCCGACCGG
GCCTACTCCG AGACCTGCGC CGGGATCGGC TCGATCATGC TCGCCCACCG GCTGCTGCTG
GCCACCGGCG ACGTGCGCTA CGCCGACCTG GCCGAGCGGA CCATGTTCAA CGTGCTGGCC
ACCTCACCCG CGCTGGAGGG CCGCTCGTTC TTCTACGCCA ACCCGCTGCA CGTCCGCGTG
CCCGCCGCGC CGCCGGAGGG GATGAACCCG GCCGCCGAGG GCGGCCTGCG CTCGCCGTGG
TTCACCGTGT CGTGCTGCCC CAACAACATC GCCCGCACCT ACGCCTCCCT GGCCGCCTAC
GTCGCGACCT CCGACGCCTC CGGCGTGCAG ATCCACCACC ACACCCCCGC CGAGATCCAC
CACGAAGGCC TCGTCCTGCG CGTCGAGACC GGCTACCCGT GGTCGGGCGA GGTGACCGTC
CGGGTGGTCA GGGGCGGATC GGGGCGGATC TCCCTGCGGG TCCCGCCGTG GGCCTCCGGT
GCGCGGATCT CCCACGGCGG GACCACCCGC CCGGTGCCCG CGGGCTACGC GGTCGCCGAA
GGGCGCTGGC GGCCGGGCGA CGAGATCCGC CTCCACCTGC CGATGACGCC CCGCTGGACC
TACCCGGACC GCCGGGTGGA CGCCGTACGC GGCTGCGCGG CGGTCGAGCG CGGCCCGCTC
GTCTACTGCG CCGAGTCGGT GAAGGACGAG CCCCCGCTCG CCCTCGTCGA GGCGCGGGTC
TCGCCCCCGG TCGAGCACCT CGTCGACGGC GTCGTCGAGC TCGACGTCGA GGCCGTGCTC
GTCTCCCCCG GGGCGGACGC CTGGCCGTAC GCCTCCTCGC CGCGGACGGG CGGCCCGGCG
CGGGCGGAGG ACGCGGCGAC GCCGCCGGCG CCGCCGGCGC CGGGAAACGC GGACACGCCC
GGACCGGCGG GGAAAGCGGA CACGCCCGAG CCGGCGGGGA AAGCGGGAAC CCCGGAGGAG
CCTGTCCCGG TCCCCTCGGG AGAGCCCCTC AGGCTCGTGC CCTACCACCG CTGGGGCAAC
CAGGGCCCCG CCACCATGCG CGTCTGGCTG CCCACCGCCG GCTGA
 
Protein sequence
MASPVLPSSG ALSPLGLDSV RLTPGFWGDR VALNREVIIA HCQEWMERAG WIGNFRRLGG 
DPARRDPVLR QGREFSDSEI YKLLEGMAWA GHPALPELAA TVARAQEEDG YLNTRWYGDR
YTDFEWGHEL YCYGHLIQAG VARLRTHGED ELTGVVRRAA DHICRRFMDT SETCGHPVVE
MALVELYRAT GAERYLEMAR RFVERRGLPA LDDIEFGRAY FQDDLPVRRA RVFRGHAVRA
VYLASGAVDV AVETGDAELL SAIEAQWERT VARRVHLTGG MGSRHSDEAF GDDFELPPDR
AYSETCAGIG SIMLAHRLLL ATGDVRYADL AERTMFNVLA TSPALEGRSF FYANPLHVRV
PAAPPEGMNP AAEGGLRSPW FTVSCCPNNI ARTYASLAAY VATSDASGVQ IHHHTPAEIH
HEGLVLRVET GYPWSGEVTV RVVRGGSGRI SLRVPPWASG ARISHGGTTR PVPAGYAVAE
GRWRPGDEIR LHLPMTPRWT YPDRRVDAVR GCAAVERGPL VYCAESVKDE PPLALVEARV
SPPVEHLVDG VVELDVEAVL VSPGADAWPY ASSPRTGGPA RAEDAATPPA PPAPGNADTP
GPAGKADTPE PAGKAGTPEE PVPVPSGEPL RLVPYHRWGN QGPATMRVWL PTAG