Gene Sros_1165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1165 
Symbol 
ID8664440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1188555 
End bp1189688 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content73% 
IMG OID 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_003336906 
Protein GI271962710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.816168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA CACTCGCCGA CGCCCGCATC GTGACCCCCG AAGGTGTCCA CGGAGGCTGG 
CTCACCATAG AAGACGGCCG CATCACCCAC ATCGGCCAGG GGTCCGCGCC CGGACCCGGC
CACAGCCTCG CGGGCCGGTA CGTCGTGCCG GGATTCGTCG ACATCCACAA CCACGGCGGG
GCGGGCGGCT CCTTCCCCAC CGGCGATCCG GACCAGGCGA GCCGGATCGC CGCCCTGCAC
GCCCGGCACG GCACCACCAC CCTCATGGCC AGCCTGGTCA CCGCGGCCCT CGACGACCTG
GCCGGGGCGA CCTCCGCCCT GGCCGACCTG TGCGAGGACG GCCTGCTGGC CGGCATCCAC
TTCGAGGGCC CCTACATCTC CAAGGCCCGC TGCGGCGCGC ACAACCCGGC GCTGCTCCGC
GAGCCCTCCC CGCGGGAGTT CGGCGACCTG CTCAGGGCCG GGCGCGGCCA CGTGCGGATG
CTCACCATCG CCGCCGAGCT GCCCGGCGCG CTGGACACCA TCCGGGAGGC GGTCGCGAAC
AACGTGATCG CCGCGCTCGG GCACAGCGAC GCCACCTACG AGCAGACCAT CGCGGGCATC
GACGCGGGCG GCAGCGTCGC GACCCACCTC TACAACGCGA TGCCGCCGCT GCACCACCGC
GACCCCGGCC CGATCGCCGC CCTGCTGCAG GACGAGCGCG TCACGATCGA GCTGATCAAC
GACGGCGTGC ACCTGCACCC GGCGATGATG CGCCTGGCCT ACGACGTCGC GGGGCCCGGC
CGTACCGCGC TGATCACCGA CGCCATGGCG GCGGCCGGCA TGGGCGACGG CGTCTACGGG
CTCGGCCCGA TGAAGGTCGA CGTCGTGGAC GGCGTCGCCC GGCTGGCCGA GGGCGGCTCC
ATCGCGGGCA GCACCCTGAC CATGGACGTC GCGTTCCGGC GCAGCGTCCA GCAGGTCGGG
CTGTCGCTGC CGGAGGCGGC CGAGGTCGCC TCGCTCACCC CCGCCCGGGT GCTCGGCCTC
GCCGACCGCC TCGGCTCCGT CTCCGTCGGC AAGCAGGCCG ACCTGGTGGT GCTCACCGGC
GACCTGGAGG TCGCCGGTGT CATGAAGCAC GGAAACTGGA TCACAGAACC CTGA
 
Protein sequence
MSITLADARI VTPEGVHGGW LTIEDGRITH IGQGSAPGPG HSLAGRYVVP GFVDIHNHGG 
AGGSFPTGDP DQASRIAALH ARHGTTTLMA SLVTAALDDL AGATSALADL CEDGLLAGIH
FEGPYISKAR CGAHNPALLR EPSPREFGDL LRAGRGHVRM LTIAAELPGA LDTIREAVAN
NVIAALGHSD ATYEQTIAGI DAGGSVATHL YNAMPPLHHR DPGPIAALLQ DERVTIELIN
DGVHLHPAMM RLAYDVAGPG RTALITDAMA AAGMGDGVYG LGPMKVDVVD GVARLAEGGS
IAGSTLTMDV AFRRSVQQVG LSLPEAAEVA SLTPARVLGL ADRLGSVSVG KQADLVVLTG
DLEVAGVMKH GNWITEP