Gene Sros_4414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4414 
Symbol 
ID8667708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4926114 
End bp4927631 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340031 
Protein GI271965835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.694835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0446961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA CATTTGAGCC TACCTTCAGA TGCTTGGTCC CAGTGCTCGC CTTGATCGCC 
CTGTCCGCAG CCGTGGCGCC CCCGGCCAGA GCCGACGGCG GCCCGCCCCC ACCGCTCGAC
TGGAAGCCCT GCGCCCAGGG CCCCGACGAC GCCTCCGGCA GGGAGCTCGA TCGGGCAGGC
GCACGGTGCG CGGAGCTCAC CGTCCCGCTC GACCACTCCA GGCCTGGCGG CCGCACCATC
AAACTCGCGC TGTCCCGCCT CCCCGCCACC GACCGCGCCC ATCGCATCGG CACCATGGTC
CTCAACAGCG GCGGCCCCGG CGAGAGCACG CTGGGCATGC CCCTGCAGAC GCGCGCGGCC
ATGAAGGACG TCGCCGCCCG TTACGACCTC GTCGGCCTGG ACCCGCGCTT CGTCGGGCGC
AGCACGCCGC TCGACTGCGG CTGGCCCATC GGCCTGTGGC TTCGCTCGGC CGGTCCGACC
CGCGCCCGGT TCGATCACCA GGTGGCGGTG CAGCGCGACC TGGCCGAGCG CTGCGCCCGG
CGCCACGCCG ACGTGCTTCC GTACGCCAAC ACGCGCGACA CGGCCCGCGA CATCGACCTC
GTGCGCCGCG TCCTCGGGGA GCGCCGGATC TCCTTCCTCG GCTACTCCTA CGGCAGCTAC
CTAGGCGCGG TCTACGCCCA GATGTTCCCC GGCCGCACCG ACCGCGTGGT CCTGGACAGC
GCGGGCGACC CGAACAAGTG GGGCCCCAGG GCGACGCAGG GCACCGAAGA TGAGGCGGAA
CGCGCACTGC GCGGCTGGGC GGCCTGGGCC GCCAAACGTC ACGGCACCTA TGGCCTCGGC
GCCACCCCCG CCCGCGTGCT CGCCACCGTG AACGCGATCG TCGCCGCCGC GCAGGACCGC
CCGCTGCGCG TCGGACCGTA CGAGGTGGAC GACCAGGCGG TCCCGTACAT CCTCTCCGTC
GGCTCCGGCG ACGACCGCCC CGCGGCCCGC GCGGAGTTCA CCTCCACCGT TCGGACCCTG
AACGAGGCCG CGCACGGCCG CCCGGCCGAC CCCGGCTCCG AACTCGACGG ATTCCTCACG
TTCGTCCTGA CCAGCGCCGG CTCCCCGCTC GCCAGCCCGG CCGCCGCGAT CACCTGCGGC
GACCGCGCCG CGCCGCGCGA CCCCGACGCC TACTGGAACG ACGTCCAGCG CAGCCGTGCG
CGCCACCCCC TCTTCGGCCC TCTCAAGAAC AACATCTGGC CCTGCGCCTT CTGGCCGAAC
CACCCGCGCG AGCGCCCCAC GCACGTCGCC AACCCCACCC CCGCGCTGAT CGTGTCCGCC
ACCGGCGACA CCGCCACCAC CTACGAGGGC AGCAAGGCCA TGCACCGGGC GCTGACCGGC
TCCCGCCTGC TCACCCTGCG TGGCAGCACC GCCCACGGGA TCTACGGCGA ATACGGCAAC
GCCTGCGTGG ACGCCAAGGT CAACGCCTAC CTGGCCACCG GCACCCTCCC CGCCACCGAC
CAGGTCTGCC GTCCTTGA
 
Protein sequence
MKLTFEPTFR CLVPVLALIA LSAAVAPPAR ADGGPPPPLD WKPCAQGPDD ASGRELDRAG 
ARCAELTVPL DHSRPGGRTI KLALSRLPAT DRAHRIGTMV LNSGGPGEST LGMPLQTRAA
MKDVAARYDL VGLDPRFVGR STPLDCGWPI GLWLRSAGPT RARFDHQVAV QRDLAERCAR
RHADVLPYAN TRDTARDIDL VRRVLGERRI SFLGYSYGSY LGAVYAQMFP GRTDRVVLDS
AGDPNKWGPR ATQGTEDEAE RALRGWAAWA AKRHGTYGLG ATPARVLATV NAIVAAAQDR
PLRVGPYEVD DQAVPYILSV GSGDDRPAAR AEFTSTVRTL NEAAHGRPAD PGSELDGFLT
FVLTSAGSPL ASPAAAITCG DRAAPRDPDA YWNDVQRSRA RHPLFGPLKN NIWPCAFWPN
HPRERPTHVA NPTPALIVSA TGDTATTYEG SKAMHRALTG SRLLTLRGST AHGIYGEYGN
ACVDAKVNAY LATGTLPATD QVCRP