Gene Sros_2858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2858 
Symbol 
ID8666144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3104380 
End bp3106818 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content73% 
IMG OID 
ProductTransglutaminase-like protein protein-like protein 
Protein accessionYP_003338558 
Protein GI271964362 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.775139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.338432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTTC CCCTGGCGTC GGGAGCGGCC ACGGGAGCGG TCGCGATCGC GCTCTATCCC 
CTGTTCCAGG GCGGTGCCTG GTTCTGGACA TGCCTGGGGG CCATCCTGGT CGTCACCGGG
ATCGGCATGC TGGGCAGCCG CTACACCCTG CCGAACTGGC TGGTGCCGCC GGCCCAGCTC
GTCGCGGTGT GGATCTACCT GACGGCGGTG TTCACCGGCG ACAAGGCGTG GATCGGCCTC
CTGCCCACCA AGGAGTCGGT CATCGGGCTG GCGAAGCTGC TGGGCACCGG CTTCGCCGAC
ATCCAGCGGT TCGCCGCCCC CGTCCCGTCC GGGACCGGCA TCACCCTGCT GACCTGTGGC
GGCGTCGGCC TGATCGCGAT CCTCGTCGAC CTGCTGGCCT CCCGGCTGCG CCGGGCCGCG
CTCACCGGGC TGCCGCTGCT GGCGCTGTTC ACCGTTCCGG CCGCGGTGGC CTCCGAGCCG
ATCATGTGGC CGATGTTCAT CATCGCCGCC CTGGGCTACC TGGGGCTGCT GGCCGCCGAC
GGGCGGGAGC GGATCACCCA CTGGGGCAGG GCGGTGCTGG TGCGCCGGGC CCCGTCCTTC
GTGGCCCCGA AGCCCGCCTC CGACGCCGGG ACCCTGCGGC TGTCGGGCAA GCGCATCGGA
TTCGCCGCGA TCGCGCTGGC CATCCTGCTG CCCGCGCTGC TGCCCACCCT GGAGCCGAAC
CCGCTGTTCG GTTTCGGCGT GGGCAACGGC AAGGGCAGGG GCGGCAACTC GATCAGCATC
CCGAACCCGA TAGTCAACCT GCGGGGCCAG CTCTCGCTGC CGAACAACTC CACCGTCCTG
ACCTACACCG GCAGCGACAG CTTCCCGCGC TACCTGCGGC TGTACGCGCT GGACATCTTC
GACGGCGAGC AGTGGACGAT GAAGGGGCCG CAGGGGCATC CCGAGGACCG GATCTCCGAG
GGACCGATGC CGCCGGCCCC CGGCCAGAGT CCGAGCATGC CCGTCACGCA GGCGACCCTC
AAGATCAAGG TGAGCGACCA GTTCAGGGAG CCGCTGAAGT TCCTGCCGCT GCCCTACCCC
GCCGGCCGGG TCCGCATCGA GGGAGACTGG CGGCCAGACC GGGACACGCT CATGGTGTTC
TCCACTCTGG ACACCGCCGG AGGGCTGTCC TACGAGGTCG TCACCAACGA GCCCCAGCCC
ACGGCGGAGA CGCTCAAGGC GGCGCCGCCG GCTCCCCCCG AGATCAACGA GCGCTACCTG
CAGCTGCCGG AGAACCTGCC CCCGGAGATC AGGGATCTGG CCAGGCAGGT GACGGAGCGG
GACGCCAGCC GCTACGAGCA GGCGATCCAG CTCCAGGAGT TCTTCACCAA GGACGGCGGG
TTCACCTACA GCCTGGCCAC CCAGGGGCAC AACGGGCCCG CGCTGACGGA GTTCCTGCTC
CGCGCCCGGA CCGGGTACTG CGAGCAGTTC GCCGCCTCGA TGGCGGTGCT GGCGCGGATC
CTGGGCATCC CCGCCCGCGT CGCCATCGGC TACACCGGCG GCAGCAACGC CTCCGGCGAG
TGGCAGGTGC GCACGCACGA CTCGCACGCC TGGCCCGAGC TCTACTTCGA GGGCACGGGA
TGGCTCCGCT TCGAGCCCAC CCCGACCGGC GGCGCGGGGC AGGGCAGCGC GACGGTGCCG
CTCTACACCC GGCCGGAGAT CGTGACGACC GGCGGCGGCG AGCCGGCGGC GACCCCGACC
TCGAACTCCA CCGACGCGGA CCTCCCCAAC CAGTCGGCCG GCAACCGCCG CGACCTGATG
CCCGACCGCG ACTTCGGCGG GGTGCCGGTC GCGGTGGACG AGGGCGTGCC GGTCGCGCTG
CAGGTCGGCC TCGGGGCCGT GGTCGTGCTG CTGATCGCGC TGATCCCCGG GCTGCTCCGC
TGGGAGCTGC GCGCCCGCCG GCGGCGCTCC TACTCCCGCG CGACGGAGGA CCCGCCCCCG
CACGAGGAGG CGGTCACCGC GGAGCCCACC GGGATCGTGG TGGTCAGGAC AGACCGGCGG
GAGGCCGCGG TGGTCGCGGC CTGGGCGGAG CTCGACGACG CGCTGTACGA CTACGGCATG
TCCCGGCAGC CGAGCGAGAG CCCCCGGGCG CTGGCCCGGC GGCTCACCGA GCAGTACGAG
TTCGACGCGG AGACCTCGGC CTCGATGGCC AGGATCGCCA CGGCGGTGGA GCGGATGCTC
TTCGCCCGGA CCCCCGGAGA CCTCGGCTCG CTGCCCGGGG ATCTCCGCCG GGTACGGCGG
GCGCTGGCGG CCACCGTGAC GCGGGGCAGG CGGATCCGCG CGACGCTGCT GCCGCCTTCG
ACGCTCCTGC GGGCGCGACG CCTGGGGACG CGGATGCTGG ACGGCTTCGA CCGGCTGGAG
AACCTCAGGC TCCGCCGGCC TGCCCGGAGG GACGCCTGA
 
Protein sequence
MRLPLASGAA TGAVAIALYP LFQGGAWFWT CLGAILVVTG IGMLGSRYTL PNWLVPPAQL 
VAVWIYLTAV FTGDKAWIGL LPTKESVIGL AKLLGTGFAD IQRFAAPVPS GTGITLLTCG
GVGLIAILVD LLASRLRRAA LTGLPLLALF TVPAAVASEP IMWPMFIIAA LGYLGLLAAD
GRERITHWGR AVLVRRAPSF VAPKPASDAG TLRLSGKRIG FAAIALAILL PALLPTLEPN
PLFGFGVGNG KGRGGNSISI PNPIVNLRGQ LSLPNNSTVL TYTGSDSFPR YLRLYALDIF
DGEQWTMKGP QGHPEDRISE GPMPPAPGQS PSMPVTQATL KIKVSDQFRE PLKFLPLPYP
AGRVRIEGDW RPDRDTLMVF STLDTAGGLS YEVVTNEPQP TAETLKAAPP APPEINERYL
QLPENLPPEI RDLARQVTER DASRYEQAIQ LQEFFTKDGG FTYSLATQGH NGPALTEFLL
RARTGYCEQF AASMAVLARI LGIPARVAIG YTGGSNASGE WQVRTHDSHA WPELYFEGTG
WLRFEPTPTG GAGQGSATVP LYTRPEIVTT GGGEPAATPT SNSTDADLPN QSAGNRRDLM
PDRDFGGVPV AVDEGVPVAL QVGLGAVVVL LIALIPGLLR WELRARRRRS YSRATEDPPP
HEEAVTAEPT GIVVVRTDRR EAAVVAAWAE LDDALYDYGM SRQPSESPRA LARRLTEQYE
FDAETSASMA RIATAVERML FARTPGDLGS LPGDLRRVRR ALAATVTRGR RIRATLLPPS
TLLRARRLGT RMLDGFDRLE NLRLRRPARR DA