Gene Sros_4694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4694 
Symbol 
ID8667988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5218651 
End bp5219862 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003340285 
Protein GI271966089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.275408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCG ACTACGCGGT GTTTGTCGGC TTAGACGTGG GCAAGGGCGA ACAGCACGCC 
TGCGCCCTGG ACCCTGGCGG CAAGAAACTC CACGACAAGC CCCTGCCCAA CGACGAACAG
CGCCTGCGCG CGCTGTTCGG CAAGCTCAAC ACCCACGGGC CAGTGCTCGT CGTGGTCGAC
CAGCCCGCCT CCATCGGCGC ACTGCCCGTC GCCGTGGCTC GCGCCGAAGG CTGCCAGGTC
GCCTACCTGC CCGGCCTGAC CATGCGCCGC CTGGCCGACC TGCACCCCGG CAACGCCAAG
ACCGACGCCC GCGACGCCTT CATCATCGCC GACGCCGCTC GCACCTTGCC GCACACACTT
CGACGGGTCG ACGCCGGCGA CGAAGCCCTC GCCGAGCTGG AGGTTCTCGT CGGATTCGAC
GATGACCTGG CCGCCGAGGC GACCCGGGTG ACCAACCGGA TCCGCGGACT GCTCGTCACC
ATCCACCCGG CGCTGGAGCG GGCGCTCGGG CCGCGCCTGC ACCATCCCGC CGCCCTGGAG
CTCCTGGCAC GCTTCGGCGG CCCGAACGGC CTGCGAGACG CCGGACGCGA GCAGCTCCTG
ACAGTCGCCC GGCCACTGGC TCCGCGCATG GCGGGCCGTA TGGTCGACGA CGTCTGGGCC
GCGCTCGAAG CCCAGACAGT CCTCGTCCCC GGAACCAGCG CGGCCGAGAC CGTCCTGCCC
CGCCTGTCAC AATCGCTACG AAGCGTGCTC GACCAGCGCA AACAGGTCGC GGCCGAGGTA
GAGGCGATGC TTGATGCCCA CCCTCTCGCC AAGGTCCTGA TCACCATGCC CGGGCTCGGG
ATCAGGACCA CCGCACGGCT CCTGCTGGAG ATCGGCGACA TCTCCGCCTT CGCCACCCCC
GGGCACCTCG CCGCCTACGC CGGGCTCGCC CCGGTGACCC GCCGCTCCGG TTCGTCGATC
AAGGGTGAAC ACCCGCCCAA GGGCGGCAAC AAGGCACTGA AACGGGCCAT GTTCCTCGCC
GCGTTCGCAT CTCTGTCCGA CCCCGAAAGC AGGGAGTACT ACGACAAGAA GCGCGCCGAG
GGCAAGAAGC ACAACGCCGC CCTGATCTGC CTCGCCCGTC GCCGTTCAGA CGTCATCTAC
GCCATGCTCC GCGACCGCAA GCCCTACCAA CCCCGCCGGA AGAACCGCAC CCGAAAGCCC
TCGGCCGCTT GA
 
Protein sequence
MSVDYAVFVG LDVGKGEQHA CALDPGGKKL HDKPLPNDEQ RLRALFGKLN THGPVLVVVD 
QPASIGALPV AVARAEGCQV AYLPGLTMRR LADLHPGNAK TDARDAFIIA DAARTLPHTL
RRVDAGDEAL AELEVLVGFD DDLAAEATRV TNRIRGLLVT IHPALERALG PRLHHPAALE
LLARFGGPNG LRDAGREQLL TVARPLAPRM AGRMVDDVWA ALEAQTVLVP GTSAAETVLP
RLSQSLRSVL DQRKQVAAEV EAMLDAHPLA KVLITMPGLG IRTTARLLLE IGDISAFATP
GHLAAYAGLA PVTRRSGSSI KGEHPPKGGN KALKRAMFLA AFASLSDPES REYYDKKRAE
GKKHNAALIC LARRRSDVIY AMLRDRKPYQ PRRKNRTRKP SAA