Gene Sros_5044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5044 
Symbol 
ID8668338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5566305 
End bp5567726 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content74% 
IMG OID 
Productamidase 
Protein accessionYP_003340578 
Protein GI271966382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00761067 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0205889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAC TGCACCACAT ATCGGCGGCG GACGCGCTGC GCGCCTTCCG CTCCCGGCAG 
CTGTCGCCGG TCGAGCTGAC CGAGGCGGTG ATCGCCCGCG CCGAACGGAC CGAGCCGGTC
GTCAACGCGC TGTGCCACCG CTTCTTCGAT GAGGCGCTGC GGCAGGCCAA GCGGGCCGAA
CGCCGCTACG CAGGCCAGGA CGGGCCGCCG CGGCCGCTGG AGGGGCTGCC CACCGTCGTC
AAGGAGGACG AACCGGTTAC GGGGCACCCC TGGACCCAGG GGTCGCTACG GTATCGCGAC
GTCGTCGCCG GGCACACCTC GCTCTTCGTC CGGCGCCTCC TCGACGCCGG CGTGATCGTG
CACGCCCGGA GCACGGCGTC GGAGTTCGCT TCCGCCGCGT TCACGCACTC GGCGCTCTGG
GGCGTCACCC GCAACCCGTG GAACCCGGAG TTCTCCCCCG GCGGCTCGTC GGGCGGCTCG
GGCGCGGCGC TCGCCGCCGG GTCGACGGTG CTGGCGACGG GATCGGACAC CGCGGGCTCG
ATCCGGGTGC CCGCCTCGTT CAGCGGTGTG GTCGGGTTCA AGCCGCCGCA CGGCCGGGTA
CCGGTGGACC CGCCCTATCA CCTCGACACC TACGTGCACT CCGGTGTGCT GGCCCGCACC
GTCGCCGACG TCGCACTGAT GCAGAACGTC GTGGCCGGGC CCCACCCGGG GGACGTCGGC
TCGCTGCGGC CCCGTCACGT CCTGCCGGAT CCCGCCGAGC TTGGCCGCGA CCTGCGCGGG
ATGCGGATCG CCTTGTCCGA GGACCTCGGT GACTGGGCGG TCGACCCGGA GGTCCGCCGC
AACACCCGGG AGTTCGGCGA GCGGCTGCGA GCGGCCGGGG CCCGCGTCGA GGAGGTCGCG
CTCCCGGTGC CGCGGGCGCA GGTGCTGCGC GCGGCGGCCA TCCACTTCCA CCACGGATTC
GGCGCCGCCG TCGCGGCCGA CGGGCGCAAG CCCGGCGCCC CTCTCACCCC GTATGCGCAG
GCGTTCGCGC GGTGGGCGGC CGAGGGCGCC GCCGGCGCCG GCGTGCTCGA CGGATTCGCG
ATCGAGTCCG ACCTTTACCG GCCCGTCGGC GAGCTGCTCG AGCGGTTTGA CGCGCTCGTC
TGCCCGACCG CGGCCACCCG TGGACTGGTG GCGGGCGAGG ACTACCTCGA CCACGGCCCG
GAGGTCGACG GCGAACGGCT CGGGCACTAC CTGGAGTCGC TGCTCGCGCT CCCGTTCAAC
ATCATGAACC GCTGCCCCGT GCTGGCCGTG CCGTCCGGCG TCGCCGACAA CGGGGTGCCC
ACCGGGGTGC AGATCGTCGG GCGGCCGTTC GACGACACCA CGCCGTTCCG TGTCGGGGCG
GCGGTCGAGC AGCGGCCGCA CTGGCCGGAG GTCGGGACGT GA
 
Protein sequence
MDELHHISAA DALRAFRSRQ LSPVELTEAV IARAERTEPV VNALCHRFFD EALRQAKRAE 
RRYAGQDGPP RPLEGLPTVV KEDEPVTGHP WTQGSLRYRD VVAGHTSLFV RRLLDAGVIV
HARSTASEFA SAAFTHSALW GVTRNPWNPE FSPGGSSGGS GAALAAGSTV LATGSDTAGS
IRVPASFSGV VGFKPPHGRV PVDPPYHLDT YVHSGVLART VADVALMQNV VAGPHPGDVG
SLRPRHVLPD PAELGRDLRG MRIALSEDLG DWAVDPEVRR NTREFGERLR AAGARVEEVA
LPVPRAQVLR AAAIHFHHGF GAAVAADGRK PGAPLTPYAQ AFARWAAEGA AGAGVLDGFA
IESDLYRPVG ELLERFDALV CPTAATRGLV AGEDYLDHGP EVDGERLGHY LESLLALPFN
IMNRCPVLAV PSGVADNGVP TGVQIVGRPF DDTTPFRVGA AVEQRPHWPE VGT