Gene Sros_1430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1430 
Symbol 
ID8664705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1492152 
End bp1493834 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content77% 
IMG OID 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_003337167 
Protein GI271962971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.709224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.150421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC TCACGGGGGT GGGCGTGAGT CCGGGCGTCG GCTTCGGGCC GGGTTACGTC 
CTCGCGTACG AGACACCGGT GCCGGCGCCC GACGCCCGGC ACGGAGGGGA CGCCGGGGCG
GAGCGGGACA GGGCGGCGCG GGCGCTGGAG CGGGTCGCGG CCGACCTGGA GGAGCGGGGG
GTCCGGGCCG GCGGAGAGGC TCAGGACATC CTGAGCGCCC AGGCGCTGAT GGCCCGTGAC
CCCGGCCTCG CCGTCGGGGT GGCCGGCCTG ATCGACGCGG GCAAGAGCGC GGCCCGCGCG
GTCTACGAGG CGTTCGGCGG CTACCGCGAG ATGCTCGTCG ACGCCGGGGG ATACCTGGGC
GAGCGCGCCG CCGACCTGGA CGACGTCAGG GACCGCGCCA TCGCGGTGAT CGAGGGTCTG
CCGGTGCCCG GCCTGCCCTC CCGGGCGGCG CAGCCGTACG TGCTCATCGC CCGTGACCTG
GCGCCCGCCG ACACCGCGCT GCTCTCCCGG GACGTGGTGG CCGCCTTCGT CACCGAGCAG
GGCGGGCCGA CCAGCCACAC CGCGATCCTC GCCCGCGCGA TGGGGGTGCC GGCCGTCGTC
GCCTGTCCGG GGGCCACCTC GATCGCGCCG GGCACGCCGG TGCTGGTGGA CGGGGTCTCC
GGTCTCGTAC GGCCCGCGCC CTCGGAGGAG GAGGTGGCCA CGGCGAGGAA CGCCGCCTCC
GCCAGGGACG CCGTGCTCGC CGCCACCACC GGGCCGGGGA TGACCGCCGA CGGCCACGCC
GTGCCGCTGC TGGCCAACAT CGGCGGGCCG CGTGATGTGG ACGGCGCCCT GGAGCACGGG
GCGGAGGGCG TCGGGCTCTA CCGGACGGAG TTCCTCTTCC TGGACCGGAC GACGGCGCCG
TCGGGGGAGG AGCAGGAGGC CGCCTACCTG GAGGTGCTGG AGGCCTTCCC GGGCGGCCGG
GTGGTCGTGC GGACGCTCGA CGCGGGGGCG GACAAGCCGC TCGCCTTCCT GCCCCCGCAG
GGCGAGGAGC CGAACCCGGC GCTGGGGCAG CGCGGGCTGC GGCTTCTCAG GGCGCATCCG
GAGATCCTGA ACACGCAGCT GGCCGCCCTG GCCAGAGCCG CCGCCCGCTC CTCGGCCAAG
CTCCAGGTGA TGGCCCCGAT GGTGGCCACC GCTGAGGAGA CCGCCTGGTA CGTGGCGACC
TGCAAGGAGG CGGGGCTGCC GTCCGCGGGC GTGATGATCG AGATCCCGGC GGCCGCCCTG
CGCGCCGCCG ACCTGGCCGA GGAGGCGGAC TTCTTCTCGC TGGGCACCAA CGACCTGACC
CAGTACGCCT TCGCCGCCGA CCGACAGGTG GGCGCGCTGA CCGCGCTGCA GGACGCCTGG
CAGCCCGCGC TGCTCGACCT GGTCGCCCTG GCCGTGGCCG GGGCCGCCGA GCACGGCAGG
CCGTGCGGGG TGTGCGGGGA GGCCGCCGGC GACCCGGTCC TGGCCTGCGT GCTGGCCGGG
CTCGGCGTCA CCTCGCTGTC GATGGCCCCT CCGGCGCTGC CCGCCGTACG GGCCGCGCTG
TCGCGGCACA CCCGCGAGCA GTGCCGTCTC GCCGCGCAGG CGGCCCTGGC CGGGACCTCC
CCGCAGGAGG CGCGTGCCGC GGCCCGCTCC CACCTGCCCG GACTGGCCGG ACTGGCTCTG
TGA
 
Protein sequence
MTALTGVGVS PGVGFGPGYV LAYETPVPAP DARHGGDAGA ERDRAARALE RVAADLEERG 
VRAGGEAQDI LSAQALMARD PGLAVGVAGL IDAGKSAARA VYEAFGGYRE MLVDAGGYLG
ERAADLDDVR DRAIAVIEGL PVPGLPSRAA QPYVLIARDL APADTALLSR DVVAAFVTEQ
GGPTSHTAIL ARAMGVPAVV ACPGATSIAP GTPVLVDGVS GLVRPAPSEE EVATARNAAS
ARDAVLAATT GPGMTADGHA VPLLANIGGP RDVDGALEHG AEGVGLYRTE FLFLDRTTAP
SGEEQEAAYL EVLEAFPGGR VVVRTLDAGA DKPLAFLPPQ GEEPNPALGQ RGLRLLRAHP
EILNTQLAAL ARAAARSSAK LQVMAPMVAT AEETAWYVAT CKEAGLPSAG VMIEIPAAAL
RAADLAEEAD FFSLGTNDLT QYAFAADRQV GALTALQDAW QPALLDLVAL AVAGAAEHGR
PCGVCGEAAG DPVLACVLAG LGVTSLSMAP PALPAVRAAL SRHTREQCRL AAQAALAGTS
PQEARAAARS HLPGLAGLAL