Gene Sros_1210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1210 
Symbol 
ID8664485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1239520 
End bp1242360 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content72% 
IMG OID 
ProductPutative glycosyl/glycerophosphate transferase involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC- like protein 
Protein accessionYP_003336951 
Protein GI271962755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.840388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAATC CACCGGACTG CAGCGTGGTC GTCATCACCT ACAACGACGC CGTACGGCTG 
CCCAGGGCCG TGCGGTCCGT GCTCGGCCAG TCGCTGCGGA ACATCGAGGT GATCATCTCC
GACGACGCGA GCACGGACGA CACCGAGCGG GTCGTGCGGG AGCTCCAGCG GGAGGACCCC
CGGATCCGCT ACCTGCGCGC CGACGTCAAC AGCGGCGGCT GCGGCGGGCC GCGCAACAGC
GGGGTGGCGG CGGCCCGCGC GCCGTACGTC ATGTTCCTCG ACAGCGACGA CCGGCTGACC
AGGCACGCCT GCAAGAGCAT GGTGCTGGAG ATCGAGCGGA CCGGCGCCGA CTTCGTCACC
GGGCAGATCT CCCGGCTGTA CGAATGGAGC GGCAGGACCC AGCGCTACTA CCCCGACCTG
TACGCCAGGC GCAGGACCGT CGAGGGGATC GCCCAGGAGC CCGAGCTGTT CCTCGACTCC
TTCAGCACCA ACAAGCTCTA CCCGGCCGAG CTGCTGCGGC GGCTGCCCTT CCGGGAGGAC
CTGTACTACG AGGACCACGT CTTCACCACC GAACTCCTCT GCGCGACGCG GCGGTTCGCC
GTGGTCCCGT GGGTCGTCTA CCTCTGGCAC CGGGCCCTGG AGAAGAACGA GGCGCGGCTG
TCGATCTCGC TGAGCATCAA GGAGATGGAC AACGTCCGGC ACCGGATCAG GGCGGCTCGG
CTGAGTGACG ACATCCTGCG CGGCAACGGC CTCGCCCACC TGGTGCCCGA GCGCCAGCGC
CGCTTCCTCC GCCAGGACCT GCGGGTCTAC CTGAACCCGC TGCCCTCGCG TGATCCGGCG
TGGGCCGAGG AGTTCTCCTC GCTGGTCCGG CCCTACCTGG CGGAGATCGA CCCCGAGGCC
TTCAGGCTGG CCGAGCCGGT CACCCGGGTC TGCTGCGGCC TGATCCTCGA CGGCCGGAGC
GACGAGCTCA TGGTCGCCGC CCGGTCCTTG AACGGGCCGT ATGCCGCGCC GCGGCGCGCC
GTGCGGCAGG ACGGCCGGAC CTACTGGGGC AGCGCCCCCG CCGAGAAGAT GGACATCACC
CCGCTCCGGC TGGCCGAGCT GCCCTTCACC TCGGCCCGGC TCCGCCACGA GGCCGCCTTC
ACCGGCGAGG GCACCAGGGT CACGCTGACG ATCAGGACCT ACGACCCGTT CGGGGTGCTC
GCCGCCAACC CGGGCTGGAC CGCGTTCCTG CGGGTCAAGG ACCACCAGGT GCCGCTGACG
CCGCACGAGC AGGACGACGG CAGCCATCTG AGCGAGGCGA CCGTCGACCT CGCCGCGTTC
AGGCACGGCC CCCTGGGCTT CGGCGGCCAC CACGACCCGC TGGTCGGGAT CGTCCGCGCC
GACGGCCGGG TCACCACCGA CCAGCTGCTC GTCGAACCGT CCTGCGAGCC GTTCACCGCC
GCCGTCCCCG GGCACCGGGT CACGGTCCGG GCGGAGGGGC CGGCCGCGTT CCTCCGGGTG
CGGTGGCAGC GCGCGGGCCT GCTCCGGCCG GTGCCCCGGC TGGGCCGGAT CGCCCGCAAG
GTGATCCGGC GGGCGGGCCG GGCGGACGTG AAGCTCCGCG TCTACAAGGG CCTGATCAAG
GTCGTGCCGC CCCGCCGGGA CCTGGCGCTG TTCGAGTCCG ACGTCGGCGC CGGATACACC
GGCAACCCGC GCTACGTCTA CGAGGAGCTC AGGCGGCGGC AGGCGCCCCT CGACATGGTC
TGGTCCGTCG CCAGGGCCAG GCGGAACTTC CCCGGGGACG CCAGGCTGGT GCGGCGGATG
AGCTGGCGCC ACCTGTGGAC CATGGCCAGA GCGGGCTACT GGGTGGACAG CCACGGCATG
CCGCTGGCCT ACCCCAAACC GGCCCGGACC CGCTACCTGC AGACCTGGCA CGGCCAGGGC
ATCAAGTCGG TCGGCTTCAA CTCGCCCGAC CTGCGGGCCG ACTTCCCCGG GCCGCGCGAG
CAGTGGCGGG CGGCGGTGGC GCGCTGGGAC GCGCTGGTCT CGCCGAGCGC CGAGTTCGAG
CGGGTCTTCC TGCCCTCCCA CGGGTACGAG GGGCCGGTGC TGCGCTACGG CTCGCCCCGG
TGCGACGTGC TCGTCCACGG CGACGACGAG GCGGTACGGC GGGCCAGGGA CAGCCTGGAG
ATCCCGCCGG GACGCAGGGT GCTGCTGTAC GCCCCCACCT ACCGGGACCA GGCCAGGTTC
TCGGGGCGGT CGATCCGCGC CGACCTGGCC ATGATGGCCG AGGCTCTGGC CGGCGAGTGG
GTGGTGATCC TGCGCCCCCA TCCGGTCGAG CGCTACCAGG TCCCCCAGCA CCTGCGCCAC
TTCGTCCGCC CGGCCGGGAG CTACCCCGAG GTCAACGACC TGATGCTGGC CTCGGACGCG
CTGCTGACCG ACTACTCCTC GCTGATGTGC GACTACGCGA TCACCGGCCG GCCGATGCTG
TTCCTGATCG ACGACTGGGA GGAGTACCGG CATGTCGAGC GGGGCGTCTA CCACGACCTG
CCGTCGATCG CCCCCGGCCC CTGCCTGGCC ACCACGGAGG AGCTGATCGA CGCGCTGCGC
GACCTCGACG GCGTGGCCGC GTCGTTCGCC GCCAGATACA TCGAGTTCCG CAGGATGTGG
TGCGCCGACG AGAAGGGCCA TGCCTCGGCG AAGGTGGTGG ACGCCTTCTT CGGCCCGGCG
CGGCCCGGGG CCGGGGCGCC GGCCACCCCG CCCTTCACCG CGCGGCGGCC GATCCGGCTG
CCCCGGCCGG ATCGGTCACC GGTCCGGCTC TCCCTGCCGG GATGGCTGAT GCGGCTGTCC
CTGCCGGAAC GGCCGAGGTG A
 
Protein sequence
MINPPDCSVV VITYNDAVRL PRAVRSVLGQ SLRNIEVIIS DDASTDDTER VVRELQREDP 
RIRYLRADVN SGGCGGPRNS GVAAARAPYV MFLDSDDRLT RHACKSMVLE IERTGADFVT
GQISRLYEWS GRTQRYYPDL YARRRTVEGI AQEPELFLDS FSTNKLYPAE LLRRLPFRED
LYYEDHVFTT ELLCATRRFA VVPWVVYLWH RALEKNEARL SISLSIKEMD NVRHRIRAAR
LSDDILRGNG LAHLVPERQR RFLRQDLRVY LNPLPSRDPA WAEEFSSLVR PYLAEIDPEA
FRLAEPVTRV CCGLILDGRS DELMVAARSL NGPYAAPRRA VRQDGRTYWG SAPAEKMDIT
PLRLAELPFT SARLRHEAAF TGEGTRVTLT IRTYDPFGVL AANPGWTAFL RVKDHQVPLT
PHEQDDGSHL SEATVDLAAF RHGPLGFGGH HDPLVGIVRA DGRVTTDQLL VEPSCEPFTA
AVPGHRVTVR AEGPAAFLRV RWQRAGLLRP VPRLGRIARK VIRRAGRADV KLRVYKGLIK
VVPPRRDLAL FESDVGAGYT GNPRYVYEEL RRRQAPLDMV WSVARARRNF PGDARLVRRM
SWRHLWTMAR AGYWVDSHGM PLAYPKPART RYLQTWHGQG IKSVGFNSPD LRADFPGPRE
QWRAAVARWD ALVSPSAEFE RVFLPSHGYE GPVLRYGSPR CDVLVHGDDE AVRRARDSLE
IPPGRRVLLY APTYRDQARF SGRSIRADLA MMAEALAGEW VVILRPHPVE RYQVPQHLRH
FVRPAGSYPE VNDLMLASDA LLTDYSSLMC DYAITGRPML FLIDDWEEYR HVERGVYHDL
PSIAPGPCLA TTEELIDALR DLDGVAASFA ARYIEFRRMW CADEKGHASA KVVDAFFGPA
RPGAGAPATP PFTARRPIRL PRPDRSPVRL SLPGWLMRLS LPERPR