Gene Sros_1362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1362 
Symbol 
ID8664637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1411485 
End bp1413512 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content73% 
IMG OID 
ProductFucose 4-O-acetylase and related acetyltransferase-like protein 
Protein accessionYP_003337100 
Protein GI271962904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA CCCGATCCCC CTTCCAGCTT CCGACCCCCC GGGACGACGG CGCCCCCGAC 
GCCGACACCG GCGGGTCCGC CGGCGGACAG GGCCGGACAC CCGCCCCGTG GCCGATGCCC
GAGCACGCGC AGTCGTCCCA GACCTCCTGG CCGGAGCCAC CGGACCGTGC TTCCACCGCC
TCCTCCTGGG GCCGCCGGGA CGCGCAGGGC TCGGAGGAGG ACCCCGGCGA GGTCACCGCC
TGGGGCTACC CGGCCTACCA GGAGCCGCAG TCCTCCGCCA AGGAGGACGC GTGGGCGAAC
TGGGACACCC AGACACCTCC CGGAGAGCAG GAGCAGGCAC CGCGGCGGGA GACCCGGAAG
TCCTCCGGGT CCCAGGGCCG GACGCCGTAC GGGGACACCC CGCCGCCCGC CGCCCCCGGG
GCACGGGCGG CGGGCGAGGA CGCCCCGTCG CCCGCCGCGC CGGAGTTCTG GGCGATGCAC
GAGGAGGCAC GACCGTCCGC CGCGCCGGAG GGGTGGCCCG CCGCCCAGCC GCCCTCGGCG
CCCGACCCCT GGGCGACGCG CGACGCCCAG CAGCCCGCGG TGCCGGGGTC CTGGGCGACG
CGCGGGGACG CACAGCCGTC AGCCGCGCGG GAGACGCGGG CGCCGTACGC GGACGTGCGG
CCCCCCGGCC CCCGGAGCCC GCTGGACATG ACGCACCGCG CCGCACCCGC CGGGAACCTG
CTGGACCCGC TGGACCCCGC CTGGGCCTCG GACCACGACG CCGCCCGGAC GCAGGCCGCT
CCCGCGGCGT GGGCGCCGGA GGCCCCCCGG CACAGGCCGC AGCCGCAGGA GGCTCCGGCC
GCCGAGCGGC CGTTCTCCTA CTGGGAGAAC TCCGCCCGTG ACCCCGAGCC CTCCCCCTGG
GGCCGGTTCC CGGAGCAGGA GCCGGAGCAG GCGCAGGAGC AGCGCCCGGA GGAGGCACCC
GCCGCTCCCC CGGCCAGGAA GAAGCGCGAG CCGTACCTCG ACAACGTCAA GTTCGTCCTG
ATCGCCCTGG TCGTGACCGG GCACTCGCTG GTGCCCACCC TGGCCGCGCA CTCGGCCAAG
TCGGCCTACC TGTTCATCTA CACCTTCCAC ATGCCGGCGT TCGTGCTGAT CAGCGGTTAT
CTCGGCCGGA ACTTCTGGAA CTCCAACGCC AAGATCAACA AGCTGGTCGA CACCATGCTG
GTGCCCTACG TGGTCGTGGA GATCGGCTAC GCGCTGCTCC GCTACGGGCT GGGCCAGAAA
TGGACCCTGA CGATCATCGA CCCGGCCTGG CTGAACTGGT ACCTGCTGGC CCTGGTGCTC
TGGCGGATCT CCACGCCCAT CTGGAACCGG ATGCGGCAGC CGCTGCTGGT TGCGGTGGTC
ATCTACATGG TGGCCGGCTT CTCGGAGATC TCCGGCGACT TCAGCATCGA CCGCTTCTTC
GGCCTGCTCC CCTTCTACGT GCTCGGCCTG GTGCTCAAGC CCGAGCACTT CGACCTGCTC
AAGCCGGTCT GGGTCAGGAT CGTGGCCGGG ATCACGGTCG CCGGGGGCAT CGCGGTGGCG
GTCTTCATCG CCCCGCACGT CAACCTCAAG CCGATCTACT TCCGCTACAG CATCAAGTCC
ATGGACACGA GCTGGCTGAT CGGGCTCGGC GTGCGCGGCG CCGTGCTGGT CGCGGCGCTG
GCCATGTCGG TCGCGCTGCT GGCCCTGGTG CCCCGGCGCG AGACCTGGTT CTCCGACCTC
GGCACCCGCA CGCTCTACGC CTACCTGCTC CACGGCGTCG TGGTGCTCAT CGCCAAGGAC
CAGGGGTGGC TGAGCTTCCC CTGGCTGTAC GGCCCGCTGG GCGTGCTGGC GATCATGTCC
AGCTCCCTGG CGCTGGCCAT CGTCCTGTGC CTGCCGGAGA CGCGCACGCT CTTCAAGTGG
CTGCTGGAAC CCCGCCTGGT CTGGCTCTAC CGCCGCCCGT CGGCGGACTC CCCCGGCAAG
CAGGCGGAAC CGGCCCGCAA GGAGAGTTCA GCGGCAGTTC CTCGGTAA
 
Protein sequence
MSDTRSPFQL PTPRDDGAPD ADTGGSAGGQ GRTPAPWPMP EHAQSSQTSW PEPPDRASTA 
SSWGRRDAQG SEEDPGEVTA WGYPAYQEPQ SSAKEDAWAN WDTQTPPGEQ EQAPRRETRK
SSGSQGRTPY GDTPPPAAPG ARAAGEDAPS PAAPEFWAMH EEARPSAAPE GWPAAQPPSA
PDPWATRDAQ QPAVPGSWAT RGDAQPSAAR ETRAPYADVR PPGPRSPLDM THRAAPAGNL
LDPLDPAWAS DHDAARTQAA PAAWAPEAPR HRPQPQEAPA AERPFSYWEN SARDPEPSPW
GRFPEQEPEQ AQEQRPEEAP AAPPARKKRE PYLDNVKFVL IALVVTGHSL VPTLAAHSAK
SAYLFIYTFH MPAFVLISGY LGRNFWNSNA KINKLVDTML VPYVVVEIGY ALLRYGLGQK
WTLTIIDPAW LNWYLLALVL WRISTPIWNR MRQPLLVAVV IYMVAGFSEI SGDFSIDRFF
GLLPFYVLGL VLKPEHFDLL KPVWVRIVAG ITVAGGIAVA VFIAPHVNLK PIYFRYSIKS
MDTSWLIGLG VRGAVLVAAL AMSVALLALV PRRETWFSDL GTRTLYAYLL HGVVVLIAKD
QGWLSFPWLY GPLGVLAIMS SSLALAIVLC LPETRTLFKW LLEPRLVWLY RRPSADSPGK
QAEPARKESS AAVPR