Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6793 |
Symbol | |
ID | 8670102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 7483681 |
End bp | 7485930 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Carbonate dehydratase |
Protein accession | YP_003342245 |
Protein GI | 271968049 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.286615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGACGG ACACGAAGAG TTACCGGCCG CTCGCGAAGA GTGGCATCAG ATCTGTGCTG CGATACGACC TGCCCGCCTC CCTGGTGGTC TTCCTGGTGG CGGTGCCCCT CTCGCTGGGC ATCGCGGTGG CCTCCGGAGC ACCGCTGGCG GCCGGGCTCA TCGCAGCCGT GGTAGGCGGT CTGGTGGCGG GCTCACTGGG CGGCTCGGTC GTCCAGGTGA GCGGTCCCGC CGCCGGGCTC TCACTGGTGG TCGCCGAGCT GGTGACAACG TACGGCTGGC GCGCCACCTG CATGATCACC TTGATGGCGG GCGCCCTGCA GCTCCTCCTG GGGCTGTTCC GGGTCGCCCG CGCGGCGCTG GCGGTCTCTC CCGCCGTGGT GCACGGCATG CTCGCCGGGG TCGGCGTGGT GATCGCCCTC TCCCAGTTGC ACGTGGTGCT CGGCGGCAGC CCGCAGCGAT CGGCGATCGG AAACCTGATC GAGCTGCCCC AGCAGATCAT CCACAACCAC AGCCACGCGG TCTTCGCCGG CCTGCTCACC ATCGCGGTCC TGGTCGGCTG GACCCGGCTG CCGCAGCGGC TGCGGATCGT CCCGGCGCCG CTGGCGGCCC TGTTCGTCGC CTCGGCGACC GCCGGGGCCC TCGGCTGGGA CGTCACCCGG GTCGACCTCT CGCAGAGCCT GAGCGAGTGG GCCACGCCGA TCTGGCCCAG GGGCGACTGG CACGGCATCG TCGGGGCGGT GCTGCTGGTG GCGCTGCTGG CCGGGGTGGA GTCGCTGCTC TCCTCGGTGG CCACCGACAA GCTGCACGAC GGCAGGCGTT CCGACCTGGA CAGGGAGCTC ACCGCCCAGG GCGTGGCCAA CATGGTGACC GGCGCGCTGG GCGGTCTGGC CATCGCCGGG GTCATCGTGC GCAGCACCAC CAACGTCCGC GCCGGGGCGC GCAGCCGCTG GTCGACGGTC ATGCACGGCC TGTGGATCCT GGTGTTCGCG GTCTGCCTGG GGTGGACGAT CACGCTGATC CCCATGGAGG CGCTGGCCGC CCTGCTGGTC TTCATCGGCG TCCAGATGGT CAACCTGGGG CACCTGCGCA ACCTCCGCGG CCACGGCGAG ATCCCGATAT ACGTCGTCAC CATGGCCGGA GTGGTGCTGG TCGGCCTCGC CGAAGGCGTG CTGCTCGGCC TCGGCCTCGC CATGCTGTCG GCGCTGCGCC GTCTCACCTG GATCACCGTG CGGGCCAGGC CCGAGCCCGA CGGCCGCTGG CACGTGCTGA TCGGCGGTTC GCTGACCTTC CTCGGCGTGC CGAGGCTCAC CTCGGAGCTG CGCGCCGTAC CGGCCGGGGC CGCGGTGGAG CTGGACCTGA ACATCGACTT CATGGACAAC GCCGCCTTCG AGGCCATCCA CACCTGGCGG CAGGACCACG AACGCGGCGG CGGCACCGTC GACATCGACG AGATCCACGA CGAGTGGTAC GCCATGGCCG CCAGCGGGGC CCGGATGTTC CCGGCCAAGA CGCCGCCCCG TGCGCCGGAG CGCTGGTGGC TGCCCTGGGC GCATCGGCGG CGGGGACGGC CCGTGCCCGC CAGCTCCCTG GTCCCCGCCC AGCATCCGCC GGCGGGGAGC GGCCCCGCCG TGCCCGACCT GCTCGCCGGG GCACGGGAGT TCCACCGTCG CACGGCTCCG CTGGTCCGCC CGTTCCTGCT GGCGATGGCC CGCAAGCAGG AGCCCTCCCA TCTCTTCATC ACCTGCGCGG ACTCCCGGGT CGTGCCGAAC CTGATCACCG CCAGCGGCCC CGGCGACCTG TTCACCGTCC GCAACATCGG CAACCTGGTG CCGCGCGTGG GGGCGGCGCC TCCGGACGAC TCGGTGGCCG CGGCGATCGA GTACGCCACC GACGCGCTCA ACATCCGGAC CATCACCGTC TGCGGCCACT CCGGCTGCGG TGCCATGGCC GCGCTGCTGA GCGGTCACGA GAAGGCGCCG GGACTGCCCG CGCTCAGCCG CTGGCTGCAC CACGGCGACC ACAGCCTGGC CCGGTTCGTC GCCACCGAGG GCGACGGCGT CGACGACGGC CCGCTGGACC GCCTCTGCAG GGTCAACGTG ATCCAGCAGT TGGAGAACCT GCGGACCTAT CCCCAGGTGG ACCGGCTCGT CCGGGCCGGA CGCCTACAAC TGGTGGGCCT CTACTTCGAC ATCGGCACGG CCCGGGTCCA CGTCCTCGAA CAGCCGCCGC TCACGGCCAG TCCGCTCTGA
|
Protein sequence | MGTDTKSYRP LAKSGIRSVL RYDLPASLVV FLVAVPLSLG IAVASGAPLA AGLIAAVVGG LVAGSLGGSV VQVSGPAAGL SLVVAELVTT YGWRATCMIT LMAGALQLLL GLFRVARAAL AVSPAVVHGM LAGVGVVIAL SQLHVVLGGS PQRSAIGNLI ELPQQIIHNH SHAVFAGLLT IAVLVGWTRL PQRLRIVPAP LAALFVASAT AGALGWDVTR VDLSQSLSEW ATPIWPRGDW HGIVGAVLLV ALLAGVESLL SSVATDKLHD GRRSDLDREL TAQGVANMVT GALGGLAIAG VIVRSTTNVR AGARSRWSTV MHGLWILVFA VCLGWTITLI PMEALAALLV FIGVQMVNLG HLRNLRGHGE IPIYVVTMAG VVLVGLAEGV LLGLGLAMLS ALRRLTWITV RARPEPDGRW HVLIGGSLTF LGVPRLTSEL RAVPAGAAVE LDLNIDFMDN AAFEAIHTWR QDHERGGGTV DIDEIHDEWY AMAASGARMF PAKTPPRAPE RWWLPWAHRR RGRPVPASSL VPAQHPPAGS GPAVPDLLAG AREFHRRTAP LVRPFLLAMA RKQEPSHLFI TCADSRVVPN LITASGPGDL FTVRNIGNLV PRVGAAPPDD SVAAAIEYAT DALNIRTITV CGHSGCGAMA ALLSGHEKAP GLPALSRWLH HGDHSLARFV ATEGDGVDDG PLDRLCRVNV IQQLENLRTY PQVDRLVRAG RLQLVGLYFD IGTARVHVLE QPPLTASPL
|
| |