Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3801 |
Symbol | |
ID | 8667091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4239794 |
End bp | 4241641 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003339464 |
Protein GI | 271965268 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA CCACGACTCA GGGATGGCGC GGCGTCGCCG CCGAGAACAA GGACGACCTG TCGGAGCAGG TGTCGTTCCT GCTCCGCGCC CGGTCACGAC GGCTGCTCGG CGAGCTGCTG CGGCCGTACC GCAGGGAGAT CGCGCTGCTC GTCGCGGTCA TCGTGATCTG CAACGCCGCC GCTCTCGCCA TCCCCTACCT GATCAAGGTG GGCATCGACG CCGGCATCCC GCCGATGGTC GCGGGCGAGG GGCCGGCCAC GCTGGTGACG GTCGTCGTGG CGGTCCTGGC GGCGGCGATC ACCCAGGCGG CCACCCGGCA GGTGTTCCTC AGGATGGCGG GCCGCATCGG CCAGAGCATC CTGCTGGAGC TACGGCGGCG GGTCTTCGCC CACTTCCAGA GGCTGTCGCT GTCGTTCCAC GACGACTACA CCTCGGGCCG GGTCGTCGCC CGGCTCACCT CCGACATCGA GGCCATCTCC GAGATGCTGC AGTCGGGCTT CGACGGCCTG GTCACCGCCG TGCTGACGCT GACCGGCACC GCGGTCCTGC TGCTCGTCCT CGACGTGCCG CTCGCCGTGG TCGCGCTGCT GCCGCTCCCG GTCCTGCTGC TGTTCACCCG ATGGTTCCGG CGGCAGTCGA GCATCACCTA CCGCAGGACC CGGGAGACCG TGGCGCTGGT GATCGTCCAC TTCGTGGAGT CGATGACCGG CATCCGGGCG GTCCAGGCGT TCCGCAGGGA GCCGCGCAAC CAGGAGATCT TCGCCCAGCT CAACGCCGAC TACGGGCACG CCAACGTGCA GAGCATGCGC CTCATCGCGC TCTTCATGCC GGGCGTCAAG CTCATCGGGA ACGTCACGAT CGCCGCCGTC CTGTTCTACG GCGGCCTGCT GGCCATCGAC GGCGACGTCA CGGTGGGCGT GCTCGCCGCG TTCCTGCTCT ACCTGCGCCA GTTCTACGAG CCGATGCAGG AGATCAGCCA GTTCTACAAC ACCTTCCAGT CGGCGGGGGC GGCCCTGGAG AAACTCTCCG GCGTGCTGGA GGAGAGGCCC GCGGTGGCCG AGCCCCGCAC TCCCGTGGCG CTGGAACGGC CACGCGGGGA GATCCGGTTC GAGGAGGTGG AGTTCTCCTA CCTGGACGGC ACCCCGGTAC TGTCCCGGAT GGATCTGGCG ATCCCGGCGG GGCAGACCGT GGCGCTGGTC GGCACCACCG GGGCGGGGAA GACCACGCTG GCGAAACTGG TCTCCCGGTT CTACGACCCC GTCGCCGGCC GTGTGCTGCT CGACGGGGTC GACCTGCGTG ATCTCGGCGA GGACTCGCTG CGCGGCGCGG TGGTCATGGT GACCCAGGAG AACTTCCTGT TCACCGGATC GGTCGCCGAC AACATCAGGT TCGGCCGGCC CGGCTCGACC ATGGCCGAGG TCGTCGAGGC CGCCCGGTCC ATCGGCGCCC ACGAGTTCAT CTCGGCGCTT CCGGAGGGCT ACGACACCCA GGTCGCCAAA CACGGCGGCA GGCTGTCGGC CGGGCAGCGG CAGCTCGTGG CGTTCGCCCG GGCCTTCCTC GCCGACCCCG CGGTGCTGAT CCTCGACGAG GCGACCTCCA GCCTGGACGT CCCCGGCGAA CGGCTGGTGC AGCGGGCGAT GCGGACGATC CTGGCGGAAC GGACCGCTCT GATCATCGCC CACCGGCTGT CGACCGTCGA GATCGCCGAC CGGGTGCTCG TGATGGACGG CGGCGGCATC GTCGAGGACG GTCCTCCCGA CCAGCTCATC GCACGGGCGG GCCGCTTCGC CGGCCTGCAC CAGGCGTGGC TGGACAGCAT CTCGGATCTG CCGGCCCCAC CCGGTTAG
|
Protein sequence | MSQTTTQGWR GVAAENKDDL SEQVSFLLRA RSRRLLGELL RPYRREIALL VAVIVICNAA ALAIPYLIKV GIDAGIPPMV AGEGPATLVT VVVAVLAAAI TQAATRQVFL RMAGRIGQSI LLELRRRVFA HFQRLSLSFH DDYTSGRVVA RLTSDIEAIS EMLQSGFDGL VTAVLTLTGT AVLLLVLDVP LAVVALLPLP VLLLFTRWFR RQSSITYRRT RETVALVIVH FVESMTGIRA VQAFRREPRN QEIFAQLNAD YGHANVQSMR LIALFMPGVK LIGNVTIAAV LFYGGLLAID GDVTVGVLAA FLLYLRQFYE PMQEISQFYN TFQSAGAALE KLSGVLEERP AVAEPRTPVA LERPRGEIRF EEVEFSYLDG TPVLSRMDLA IPAGQTVALV GTTGAGKTTL AKLVSRFYDP VAGRVLLDGV DLRDLGEDSL RGAVVMVTQE NFLFTGSVAD NIRFGRPGST MAEVVEAARS IGAHEFISAL PEGYDTQVAK HGGRLSAGQR QLVAFARAFL ADPAVLILDE ATSSLDVPGE RLVQRAMRTI LAERTALIIA HRLSTVEIAD RVLVMDGGGI VEDGPPDQLI ARAGRFAGLH QAWLDSISDL PAPPG
|
| |