Gene Sros_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1887 
Symbol 
ID8665165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2005219 
End bp2006286 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content78% 
IMG OID 
ProductN-acetylglucosamine kinase-like protein 
Protein accessionYP_003337618 
Protein GI271963422 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGTTC TCGGACTGGA CGTCGGCGGC ACCTCCTCGC GCGCGCTCCT GCTCGACGCC 
TCGGGGCGGC GGATCGGGTA CGGCAGGGCT CCCGGCGGCA ATCCGGCGGC CCACGGCACC
GGCACGGCCG CCGCCAACAT CCGGCAAGCG CTGGAGCCCG CCCTCCTCGG GGTGGACCCG
GCGGAGGTGG TGGGGGCGGT CGTCGGGATG GCCGGGGTCG GGGCGCTCGA CCGGGCCGTC
TTCGACCGGA TGTGGGCCTC CGCCGGGCTG CGCTGCGTCC CTGCCGTGAC GGGCGATCTC
GGCGTCGCCT TCGCCGCCGG TACGGCGGAG CCGCGCGGCA CCGTGCTCAT CGCGGGGACC
GGCGCCATCG CCGCCCGCAT CGAGGACGGC GAGCCGGTGG CGGTCTCCGA CGGGCTCGGC
TGGCTGCTCG GGGACCAGGG ATCGGGTTTC TGGCTGGGGC GGGAGGCGGC CCGCGCGGCC
GTCCGGGGTC TGAGCCGGGG CGAGAGCGAC GGCCTGCTGA CGCGCCTGGT CGCCGAGGAG
ATCCGCGACA GCGACGGCCG TGACGGCCCT GACGGCCGCG GTGTCCAGGA CAGTCGGGAT
GGCCACGACG TCCGCGACGG TCGGGACGTC CAGGACGGTC GGGATGGCCG CGATGGTCGC
GCGGCCGGAT GGCCGCCCGT GGACGGCAGG GCGGAGGCCA TCCGGCTCGT GGTCCACGCC
CAGGGACACT CCCCGCTGGA GCTGGCGAGG CTGGCGCCGC TGGTGAGCCG GGCCGCCGCC
GCGGGCGACC CCGACGCGCT GAAGATCGTG GCGACGGCGG CCGGGCTGCT CTGCGCGACG
GTGGCCGAGG TGCGCCAGGA GGGGGAGGAC ACCCCCATCG TGCTGGCCGG GAGCGTGCTG
ACCAGCGAGG GGCCGGTGTG CTCCGCCGTA CGGGACGGGC TCGGCGCGCC GACGGCCCTG
GCCGGCGACG GTGCCGCGGC GGCGGCCTGG CTGGCGGCGA AGGAGGCGTT CGGCCTGGAC
CGGGAGGCGG CGGCGCGGCT CCACCGGCGG ATCCTGCGGG AGGCGTGA
 
Protein sequence
MYVLGLDVGG TSSRALLLDA SGRRIGYGRA PGGNPAAHGT GTAAANIRQA LEPALLGVDP 
AEVVGAVVGM AGVGALDRAV FDRMWASAGL RCVPAVTGDL GVAFAAGTAE PRGTVLIAGT
GAIAARIEDG EPVAVSDGLG WLLGDQGSGF WLGREAARAA VRGLSRGESD GLLTRLVAEE
IRDSDGRDGP DGRGVQDSRD GHDVRDGRDV QDGRDGRDGR AAGWPPVDGR AEAIRLVVHA
QGHSPLELAR LAPLVSRAAA AGDPDALKIV ATAAGLLCAT VAEVRQEGED TPIVLAGSVL
TSEGPVCSAV RDGLGAPTAL AGDGAAAAAW LAAKEAFGLD REAAARLHRR ILREA