Gene Sros_4421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4421 
Symbol 
ID8667715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4933708 
End bp4934991 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content67% 
IMG OID 
Productputative IS4 family transposase 
Protein accessionYP_003340034 
Protein GI271965838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0265932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTGG GCCACAGCGT AGACCCTGCG GTCTGGCAGA TCACCTTCGA TGAACTGATG 
ACCCGGATCG CCGGCCGGTT CGGCCGGGTG GAGCCTCGCC GTACGGCACG GGCCTACCTG
TCCGGGCTGC TGTCGGATAC CGAGCGCAAA AACTGCTGGT GGCTGGCCGA GCATGCTGGA
CACGCCGGAC CGGAGGCGAT GCAACGCCTC CTTCGCACGA CCTGCTGGCA AGCCGATGAA
ATCCGCGATG ACGTACGAGA TTACGTGATC GAACAGCTTG GTCACCCCAG CGGGGTGCTG
ATCGTTGATG AGACGGGATT CATCAAGAAG GGCACCGGCT CGGCCGGTGT GCAGCGGCAA
TACACCGGCA CAGCGGGAAA GATCGAAAAC AGCCAGATCG GCGTGTTCCT CGCCTACGCC
TCACCGAGAG GACGGGCACT GATCGACCGG CGGCTCTACC TGCCGAAAAC CTCCTGGCTG
GCTGATGCTC CCCGCTGCGC GGTGGCCAAA GTACCCGATC AGGCCGCTTT CGCAACCAAG
CCCGCCCTGG CCGGGCAGAT GATCGCCGCC GCTCTTGATG CCGCGGCTCC GGCCGCCTGG
GTGAGTGGGG ATGAGGTCTA CGGCCAAGAC CCGCACCTGC GCCACCTGCT GGAAGAGCGC
GCCGTCGGGT ACGTTTTGGC GATCGCGGGC AACCGGCGGG TGAACCTGGA AGGCACCGAC
CTGCCGGCGG CCCGGATCAG CGCGAAAGTG GCCGATCGAC ACTGGCACCA CTACAGCGCC
GGCGCCGGCG CCAAGGGCCC GCGCTACTAC GCCTGGGCGT GGGCGCGGAT CGACGCCGAC
CGCAGCGGGC ACCACTGGCT GTTGATCCGG CGCAACACCA CCACCGGTGA GCTGGCGTTC
TACCGCTGTT ATGCGCCCGC GCCGATGCCG CTGCTCACCC TGGTGCGCAT CGCCGGTGTC
CGCTGGGCGG TGGAGGAGTC CTTTCAAGCG GCAAAAGGCC AGGTCGGGCT CGACCACTAC
CAGGTACGCA CCTGGACCGG CTGGCACCGG CATATCACGC TGGCCATGCT CGCCCTGGCC
TTCCTTGCCG CCATTGCCGC GGCTCAGGGC CCGGCCGATG ACCGGCAGAT CCCGTTGACG
ATGCCCGAAA TCCGGCGTTT GCTCGCCGTG ATCGTTCTTA GTCCGCCTCG GTCGATCGGT
GAGACCTTGC GCTGGTCACA GTGGCGACGC CGACATCAAG CCCGCGCCCA GCAGGCCCAC
TACCAGCGTC GATCCCAACC ATGA
 
Protein sequence
MAVGHSVDPA VWQITFDELM TRIAGRFGRV EPRRTARAYL SGLLSDTERK NCWWLAEHAG 
HAGPEAMQRL LRTTCWQADE IRDDVRDYVI EQLGHPSGVL IVDETGFIKK GTGSAGVQRQ
YTGTAGKIEN SQIGVFLAYA SPRGRALIDR RLYLPKTSWL ADAPRCAVAK VPDQAAFATK
PALAGQMIAA ALDAAAPAAW VSGDEVYGQD PHLRHLLEER AVGYVLAIAG NRRVNLEGTD
LPAARISAKV ADRHWHHYSA GAGAKGPRYY AWAWARIDAD RSGHHWLLIR RNTTTGELAF
YRCYAPAPMP LLTLVRIAGV RWAVEESFQA AKGQVGLDHY QVRTWTGWHR HITLAMLALA
FLAAIAAAQG PADDRQIPLT MPEIRRLLAV IVLSPPRSIG ETLRWSQWRR RHQARAQQAH
YQRRSQP