Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1729 |
Symbol | |
ID | 8665006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 1844544 |
End bp | 1846400 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003337463 |
Protein GI | 271963267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.50707 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGCCA ACAATCCCCC CGACCAGTCG CCTACGCCAG GCCAGCAGCC TGACAGGACG ATCGCCTACC GTTGGAACGA GGGTGCGCAG CAGAACAGTC AACCGCACGC CCAGGGCCAC CCCCAGCAGG GTGGTTACCC GCAGCAGGGA CAGCCCGGCT ACCCGCAGCA GCCCGGATAC CCACAGCAGA ACTACGGCCA GCAGGGCCAG CCCGGCTACC AGCAGCAGCC GCCGAACTAC GGCCAGCAGC AGGGCCAGCA GGGCTACCCC GGCTACCAGC AGCAGGGCCA GCAGGGTTAC CAGCAGGCCC AGCCCGGCTA CCAGCAGCAG CAGGGCTACC AGCAGCAGAA CTACGGCCAG CAGCCCGGCT GGCAGCAGCA GGGCCCCGAT TTCCTCGGCA CGGGACAGCC GACCCCGCCC GCCCGCAAGG GCGGCAAGGG CTGGCTGATC GCGGTGATCG CCGCCCTGGT CGTCGTCCTC GTGGGCGGCG GCGGCGCCTT CGCGGTCAAC CTGCTCAGCG GCGGTGGCAC CCAGCCGCAC GACGTGCTGC CCGGCAACGC CATCGGCTAC GCGCGCCTCG ACTTCGACCC GGCGGCCAAC CAGAAGCTGG CGCTGTTCAG CATCGCCCGG AAGTTCACCG TCACCAAGGA CTCCTTCACC GGCGACGACC CGCGCAAGGC CTTCTTCGAC CAGGCCAAGA AGAGCGGCTT CGACAAGGTG GACTACGCCG CCGATGTCCA GCCGTGGCTC GGCGACCGCA TCGGCATGGC CGCGCTCACC CCGGCCAAGC GCGGTGCCGA GCCCGGCTTC GTGGTCGCCG TCCAGGTGAC CGACGAGGCC AAGGCGAAGG CGGGAATCGC CAAGCTGATG GACGGGGAGA AGTACGGCAT CGCGTTCCGC GAGGACTACG CGCTGCTCAC CGCCACCCAG GCGGAGGCCG ACCAGGCCGC CAAGGCGGCG CCCCTGTCCG ACAACGCCAA CTTCTCCGAC GACCTGAGCG CCCTGGGTGA GACCGGCGTG CTCTCCTTCT GGATGGACGC GGGCAAGCTC GCGGACCTCG CCTCCGAGAT CGCCCCCCAG GACCCCGCCA CCCTCGCGCA GATCAAGAAC GTCCGCGTGG CCGGCGCGCT CCGCTTCGAC GGCCAGTACG TCGAACTGGC CGGCATCAGC CGCGGGGCGA AGGCCCTGGA GGGCATGGGC GAGCCCGAGC CCTCCAGGAT CGGCCAGCTC CCGGTCTCCA CCGCCGGCGC GATCTCGATC TCCGGTCTCG GCGACGTGAT CGGCAAGCAG TGGGCCCAGA TCATGAAGTC GGCCGACCAG GCCGGCGGCG GCGGGAGCTT CCAGCAGTTC GCCGACCAGG CCCAGCAGAA GTACGGGCTG GCGCTCCCCG CCGACCTGGC GACGATGCTC GGCAAGAACC TCACCCTGGC GGTGGACGCC AACGGCCTCG ACGGCGACCA GCCCAAGTTC GGGGCCCGGA TCACCACCGA CCCGGCCAAG GCGCAGGAGG TCGTCGGCAA GATCGAGAAG TTCCTCGCCG ACTCGGGCAC CGCGGTCCCG CAGCTCGCCA AGGTCCCCGG TGACGGCACC TTCGTCCTGG CCAGCTCGCA GGAGTACGCC GCCGAACTCG CCAAGGACGG CAGCCTGGCC GACGACGAGA CGTTCAACCT CGCGATCCCC GACGCCGGCG CGGCGACCTT CGCCGCCTAC GTCGACCTCA ACAAGGTCGA GAAGTTCTAC CTGGAGAGCC TGCAGGGTGA CGACAAGGCC AACCTCCAGC AGCTGCGCGC CGTAGGGATC AGCGGAACGC AGTCCGGTAC GGACGCCTCC TTCTCCCTGC GAGTGCTGTT CGACTGA
|
Protein sequence | MPANNPPDQS PTPGQQPDRT IAYRWNEGAQ QNSQPHAQGH PQQGGYPQQG QPGYPQQPGY PQQNYGQQGQ PGYQQQPPNY GQQQGQQGYP GYQQQGQQGY QQAQPGYQQQ QGYQQQNYGQ QPGWQQQGPD FLGTGQPTPP ARKGGKGWLI AVIAALVVVL VGGGGAFAVN LLSGGGTQPH DVLPGNAIGY ARLDFDPAAN QKLALFSIAR KFTVTKDSFT GDDPRKAFFD QAKKSGFDKV DYAADVQPWL GDRIGMAALT PAKRGAEPGF VVAVQVTDEA KAKAGIAKLM DGEKYGIAFR EDYALLTATQ AEADQAAKAA PLSDNANFSD DLSALGETGV LSFWMDAGKL ADLASEIAPQ DPATLAQIKN VRVAGALRFD GQYVELAGIS RGAKALEGMG EPEPSRIGQL PVSTAGAISI SGLGDVIGKQ WAQIMKSADQ AGGGGSFQQF ADQAQQKYGL ALPADLATML GKNLTLAVDA NGLDGDQPKF GARITTDPAK AQEVVGKIEK FLADSGTAVP QLAKVPGDGT FVLASSQEYA AELAKDGSLA DDETFNLAIP DAGAATFAAY VDLNKVEKFY LESLQGDDKA NLQQLRAVGI SGTQSGTDAS FSLRVLFD
|
| |