Gene Sros_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4029 
Symbol 
ID8667323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4484876 
End bp4487152 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content75% 
IMG OID 
ProductATPase-like protein 
Protein accessionYP_003339680 
Protein GI271965484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00633058 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.086403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGCC GGGGGCCTGA GTGGGCGGTG ATCCTCAGAC TGCTGGGGAC CACCGCGGAC 
ACGACCGGGA GCATGGTGCT CGTGGAGGGC GAGCCGGGGA TGGGCAAGAG CCTCCTGCTC
GCGGAGGCGG CGACCGTGGC CGCCGCGCGC GGGACGGCGG TCGCGACCGG CCGGGCGGAC
GAGCTGGGAC GGCTGACGCC ACTGGGACCG CTGCTGTCCG CCCTGGCGGA GTGGCCGCCC
CCCGTGGCCC TCGCCAGGGG CGACTCCCCC GACGGGCCGG GGCTGCTGTT GTGGCTGGCC
GAGCAGGTGC GGACCACGCT GGAGAGGCGG ACGGAGTCGG AGCCGCTGCT GGTGACCCTG
GACGATCTCC ACTGGGCCGA CCCCGCCACG CTCATGGCCC TGCGGACACT GCCCTCGCGG
CTGGCCTCCC ACCCGCTGGC ATGGCTGCTG GCCCGGCGTA CCACCTCCGA GCACGACGAC
TCCGGCCGGC TGTTCGACCT CCTGGAGGAG GAGGGCGCCA CGCGGATCCG CATGCTCCCC
CTCGACGACG GCGCCGTGGC CGAGGTGATC GCCGACACCC TGGACGCGGC TCCCGGGGAG
GAGCTGGCGG CGCTGTCCGC CCAGGCGGGC GGCAACCCGT TCCTCCTCCT CGAACTTCTC
ACCGGACTGC GAGAGGAGGA CGCCGTCCGG GTGAGCTCGG GCGTCGCCCG GCTGACCGGG
ACGGCGCTGC CCCAACGCGT CCACACCGCC GTCCGGCGGC GCCTCGGCGA GGTGAGCACC
AAGACCCGGC ACCTGCTGGA GGCGGCAGCG GTGCTCGGGC GCTCCTTCTC CCCCGCGTAC
GCGGCCGAGA TGCTCGGCGA GACGCCGGCC GCGCTGCTCC CCGCCCTGGA GGAGGCGATC
GCGGCAGGCG TGCTCGCCGC GACCTCCGAC GAGCTGACGT TCCGGTACGA GCTGGTGTGG
CGGTCGATCG TCGAGACGGT GCCCGCGCCG GTACGGCAGG CGCTGCATCG TCAGATCGAG
CGGAACCCGC CTGCCCGGCA ACCCCCTCGC CACCAGCGCG CGGACGGCTC GTCCGCCGAG
ACCCTCGTGC TGCTGTCGCT GCTCGCGTGG GACGAGGGCC GGCTCTCGCG CGGGCTGGAC
CTGGCCCGCG AGGCCGCGGA GATGCCCGGC AGCGGGCGCC ATCCACAGCC GCGCGCCGTC
CTGACGACGA TGCTGACCGA CCTCTGGCTG CTCGACGAGG CGGAGACGGC GCTGGCGGGC
GCCGCCGAGG AGGCGGTGGG GCAGCCCGAG TGGACCGCGG AGATCCCCGT TCTCCGCGCC
CGCCTCGATC TCACCGCCGG CCGGCTCGGC AGCGCGATCG AGCACGCGGA GGCGGGACTG
ACCGCCGCCG ACACCCTGGG CGTCCCGGCC CTCGCCGCCT CGGCCCTGTC GGTGCTCGGC
GCCGCTTCAC TGCGGGCGGG CGACCTTCCC CGGGCGATCA GGTATCTGAA GAACGATCTG
GCCCGGGGGC CCCAGGGCGC GCCGCCGTAC GTGCGGCTGC GGTCGGAGCT GGCGGCCGGG
CGGATCGACG AGGCGCGCGG CGGGGCGGCG AGCGGGATGA GCTCGCTCAG CCGGGTCTAC
GACGCGCTGC CCCGGCACAC CGGGGCGCTG ACCAGCGAGC CCACCGCCGC CGCATGGCTG
GTGCGGGTGG CGATGGCGGT ACGGGACGAG CGCCGGGCCG ATGCCGTGGT GGACGCGGCC
GAGGGCATCG CCTGGCGCAA TCCCTGCCTG CTCAACCCCT CCGTGGCCGC CGCCCACGCG
CGCGGCGTGC GTGACCGCGA CCCGGAGGCG CTGGTCCGCG CCATGACCGA GCACACCGAT
CCATGGGCCA GAGCCTCGGC GGCGGAGGAT CTCGGCGTGC TGCTCAGCGC GGACTTCGAC
GCCCGCGACC TGGCCGTCGA CAGTCTCAAC AGCGCGCTGG TCTGCTACGG ATCGGTGGAG
GCGGCCCGCG ACGCGGCACG GGTCCGGGGC AGGCTGCGGG GGCTGGGCGA GCGCCGCCGC
CACTGGGTCC GGACCGATCA TCCCGTGTCG GGGTGGGCCA GCCTGACCGG CACCGAGCGG
GCCGTCTGCG ATCTCGTCGC CCAGGGGCTG ACCAACCGGC AGGCGGCCGA GCAGATGTTC
ATCAGCGAGC ACACCGTGGC CTTCCACCTG CGCCAGGTCT TCCGCAAGCT GGGCATCCAC
TCGCGTGTCG AGCTGGCCCG GCTCGCCGCG CAGCAGAGCG TCCGGCCGGA TCACTGA
 
Protein sequence
MRGRGPEWAV ILRLLGTTAD TTGSMVLVEG EPGMGKSLLL AEAATVAAAR GTAVATGRAD 
ELGRLTPLGP LLSALAEWPP PVALARGDSP DGPGLLLWLA EQVRTTLERR TESEPLLVTL
DDLHWADPAT LMALRTLPSR LASHPLAWLL ARRTTSEHDD SGRLFDLLEE EGATRIRMLP
LDDGAVAEVI ADTLDAAPGE ELAALSAQAG GNPFLLLELL TGLREEDAVR VSSGVARLTG
TALPQRVHTA VRRRLGEVST KTRHLLEAAA VLGRSFSPAY AAEMLGETPA ALLPALEEAI
AAGVLAATSD ELTFRYELVW RSIVETVPAP VRQALHRQIE RNPPARQPPR HQRADGSSAE
TLVLLSLLAW DEGRLSRGLD LAREAAEMPG SGRHPQPRAV LTTMLTDLWL LDEAETALAG
AAEEAVGQPE WTAEIPVLRA RLDLTAGRLG SAIEHAEAGL TAADTLGVPA LAASALSVLG
AASLRAGDLP RAIRYLKNDL ARGPQGAPPY VRLRSELAAG RIDEARGGAA SGMSSLSRVY
DALPRHTGAL TSEPTAAAWL VRVAMAVRDE RRADAVVDAA EGIAWRNPCL LNPSVAAAHA
RGVRDRDPEA LVRAMTEHTD PWARASAAED LGVLLSADFD ARDLAVDSLN SALVCYGSVE
AARDAARVRG RLRGLGERRR HWVRTDHPVS GWASLTGTER AVCDLVAQGL TNRQAAEQMF
ISEHTVAFHL RQVFRKLGIH SRVELARLAA QQSVRPDH