Gene Sros_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1661 
Symbol 
ID8664938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1776668 
End bp1777933 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content71% 
IMG OID 
ProductGlycine hydroxymethyltransferase 
Protein accessionYP_003337395 
Protein GI271963199 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAG AGCCGTACTA CGGGCCCGAC TTCGGGCTCC TGCACAGGCA GGACCCAGAG 
GTCGCGCAGG TCCTCCTCGA CGAGCTCGAT CGGCTCCGCG GTGGCCTCCA GCTCATCGCC
AGCGAGAACT TCGCCTCGCC GGCGGTGCTC GCCGCGCTCG GATCGACGCT CACCAACAAG
TACGCCGAGG GGTATCCCGG CAGGCGCTAC TACGCCGGCT GCGAGGTCGT CGACCGGGCC
GAGCGGCTCG CGATCGACCG GGCCAGGCGG CTGTTCGGCG CCGACCACGT CAACGTCCAG
CCGCACTCGG GCGCCTCGGC GAACCTGGCG GCCTACGCCG CGCTGCTCCA GCCGGGCGAC
ACGGTGCTGG CCATGGAGCT GTCACACGGC GGCCACCTCA CCCACGGTTC CAAGGTCAAC
TTCTCCGGCC GGTGGTTCGA CGTCGTCGCG TACGGCGTGC GCAGGGACAC CGAGCTGATC
GACTACGACG AGGTCAGGGA GCTGGCGCTG CGGCACCAGC CCAAGATGAT CATTTGTGGT
GCCACGGCCT ACCCGCGTGA GATCGACTTC GCCGCCTTCC GGGGGATCGC GGACGAGGTC
GGCGCCTGGC TGCTGGCCGA CGTCGCCCAC ACCGTCGGCC TGATGGCCGG AGGCGCCCTG
CCGTCCGCCG TGCCGTACGC CGACGTGGTC ACCTTCACCA CGCACAAGGC GCTGCGCGGT
CCGAGAGGCG GCGGGATCAT GTGCACGCGG GAGCTGGCGG CCCGGATCGA CCGGGCGGTC
TTCCCGTTCG TCCAGGGAGG CCCGCTCATG CACGCGGTGG CGGCCAAGGC GGTGGCGTTC
GGTGAGGCGC TCCGGCCGGA GTTCGCCGAC TACGCGCGCC AGGTGGTGGC CAACGCCCAG
GTGCTGGCCG ACGCGCTGGC CGCCGAGGGG ATGCGGCCCG TCTCCGGGGG CACCGACAGC
CATCTGGCCC TGATCGACCT GCGCGACGTC GGGGTCACCG GTGCGGTGGC CGAGCAGCGG
TGCACCGCCG CAGGGATCAC ACTGAACCGC AACACCATCC CCTACGACCC CGAGCCGCCC
ACGGTGACGT CCGGGATACG GGTGGGAACC CCCTGTGTCA CGACGCAGGG GATGGGGGCC
GAGCAGATGA AAGAGGTGGC CTCGCTGGTG GCACAGGTCA TCCGTAACCC TGACGCAGTG
GGAGAGACCA GGGCGCGGGT GGCGGCCCTC ACGGAGATCC ATCAGATATA TCCCAGCGAA
CTATGA
 
Protein sequence
MGAEPYYGPD FGLLHRQDPE VAQVLLDELD RLRGGLQLIA SENFASPAVL AALGSTLTNK 
YAEGYPGRRY YAGCEVVDRA ERLAIDRARR LFGADHVNVQ PHSGASANLA AYAALLQPGD
TVLAMELSHG GHLTHGSKVN FSGRWFDVVA YGVRRDTELI DYDEVRELAL RHQPKMIICG
ATAYPREIDF AAFRGIADEV GAWLLADVAH TVGLMAGGAL PSAVPYADVV TFTTHKALRG
PRGGGIMCTR ELAARIDRAV FPFVQGGPLM HAVAAKAVAF GEALRPEFAD YARQVVANAQ
VLADALAAEG MRPVSGGTDS HLALIDLRDV GVTGAVAEQR CTAAGITLNR NTIPYDPEPP
TVTSGIRVGT PCVTTQGMGA EQMKEVASLV AQVIRNPDAV GETRARVAAL TEIHQIYPSE
L