Gene Sros_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1858 
Symbol 
ID8665136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1974026 
End bp1975216 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content76% 
IMG OID 
ProductDNA uptake Rossmann fold nucleotide-binding protein 
Protein accessionYP_003337589 
Protein GI271963393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.306084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00651172 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCGACC GGCTCGCGCG GGTCACGTTG ATGCGGGTGG CCGAGCCCGG CGACGCCATG 
ATGGGCAGGC TCGTCGCCGC TCACGGCCCG CAGGCGGCCG TCGCGCGCAT CCGCGAGGGA
CGGCCGGAGC CGGAGCTCGC CCGGTGGTTC GCCGACTCCC CCCGCCGGGA GTCGGCCGGC
TCGCGGGACG AGCGGCTCAC GGCCAGGCTC GGGCGGATGT TCGCCTCCTG GGCGGCCCGG
CTGGAGACGG CCGACCCCGT CCGCGACCTC GACGAGGGGG AGCGCCGTGG GGCGCGGCTG
GTGGTCCCCG GCGACTCCGA GTGGCCGACC CAGCTGGACG ACCTCGGCGA GTCCCGTCCC
CACGCCTTGT GGCTCCACGG TGAGGCCGAC CTGCGCTTCT CCTGCCTGCG CTCCGTCGCG
GTCGTCGGCT CCCGGGCCGC CACGCCGTAC GGCACGCATG TCGCGGCGGA GTTCGGAGCC
GGGCTGGGCG AGCGGGGCTG GGTCGTGATC TCGGGCGGCG CCTACGGCAT CGACGGGGCG
GTCCACCGCG GCGCCCTCGC GGGGGAGACG CCGACCGTCG CGGTGCTGGC CTGTGGCGCC
GACGTCGCCT ATCCCAGCGC GCACCATTCG CTGTTCGCGG CCGTGCGATC CCAGGGAGTG
CTGGTGAGCG AGTGTCCGAT GGGCGCCACC CCGACCCGGC CGCGTTTCCT GATCCGCAAC
CGGCTCATCG CCGCGCTGTC GCGGGGCACC GTGGTGATCG AGGCGGCGGT GCGCAGCGGG
GCGCTCAACA CCGCGGGACA CGCGGTCTCG CTGAACCGCC ATCTGGCCGC CGTGCCGGGG
CCGGTGACCT CGGAGACCTC CGCGGGGTGC CATCGGTTGA TCCGGCAGGG GAGGGCCATC
TGCGTGACCA CTCCCGAGGA GATGATCGAG CTCGTCGGTG CGATGGGCGG CGACCTGGCC
CCCGAACCAC GCGGCCCGGT GCTCCCGCGT GACCGGTTGA GTCCCGAGAT CCGCAGGGTT
CTCGAGGCCG TGCCCGCCCG GACCGGGACG GGCCCGGCGA CGATAGCGGT GGCAGCGGGC
GTCGACCTGG ACACCGTGCT GTCCTGCCTC GGCGCCCTGG CCGCCGCCGG ATACGTCGAG
CGCGCCCCCC GCGGCTGGCG CCTACGCCCT GACGGACCCC CTGCCGGGTA A
 
Protein sequence
MSDRLARVTL MRVAEPGDAM MGRLVAAHGP QAAVARIREG RPEPELARWF ADSPRRESAG 
SRDERLTARL GRMFASWAAR LETADPVRDL DEGERRGARL VVPGDSEWPT QLDDLGESRP
HALWLHGEAD LRFSCLRSVA VVGSRAATPY GTHVAAEFGA GLGERGWVVI SGGAYGIDGA
VHRGALAGET PTVAVLACGA DVAYPSAHHS LFAAVRSQGV LVSECPMGAT PTRPRFLIRN
RLIAALSRGT VVIEAAVRSG ALNTAGHAVS LNRHLAAVPG PVTSETSAGC HRLIRQGRAI
CVTTPEEMIE LVGAMGGDLA PEPRGPVLPR DRLSPEIRRV LEAVPARTGT GPATIAVAAG
VDLDTVLSCL GALAAAGYVE RAPRGWRLRP DGPPAG