Gene Sros_6793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6793 
Symbol 
ID8670102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7483681 
End bp7485930 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content72% 
IMG OID 
ProductCarbonate dehydratase 
Protein accessionYP_003342245 
Protein GI271968049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.286615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACGG ACACGAAGAG TTACCGGCCG CTCGCGAAGA GTGGCATCAG ATCTGTGCTG 
CGATACGACC TGCCCGCCTC CCTGGTGGTC TTCCTGGTGG CGGTGCCCCT CTCGCTGGGC
ATCGCGGTGG CCTCCGGAGC ACCGCTGGCG GCCGGGCTCA TCGCAGCCGT GGTAGGCGGT
CTGGTGGCGG GCTCACTGGG CGGCTCGGTC GTCCAGGTGA GCGGTCCCGC CGCCGGGCTC
TCACTGGTGG TCGCCGAGCT GGTGACAACG TACGGCTGGC GCGCCACCTG CATGATCACC
TTGATGGCGG GCGCCCTGCA GCTCCTCCTG GGGCTGTTCC GGGTCGCCCG CGCGGCGCTG
GCGGTCTCTC CCGCCGTGGT GCACGGCATG CTCGCCGGGG TCGGCGTGGT GATCGCCCTC
TCCCAGTTGC ACGTGGTGCT CGGCGGCAGC CCGCAGCGAT CGGCGATCGG AAACCTGATC
GAGCTGCCCC AGCAGATCAT CCACAACCAC AGCCACGCGG TCTTCGCCGG CCTGCTCACC
ATCGCGGTCC TGGTCGGCTG GACCCGGCTG CCGCAGCGGC TGCGGATCGT CCCGGCGCCG
CTGGCGGCCC TGTTCGTCGC CTCGGCGACC GCCGGGGCCC TCGGCTGGGA CGTCACCCGG
GTCGACCTCT CGCAGAGCCT GAGCGAGTGG GCCACGCCGA TCTGGCCCAG GGGCGACTGG
CACGGCATCG TCGGGGCGGT GCTGCTGGTG GCGCTGCTGG CCGGGGTGGA GTCGCTGCTC
TCCTCGGTGG CCACCGACAA GCTGCACGAC GGCAGGCGTT CCGACCTGGA CAGGGAGCTC
ACCGCCCAGG GCGTGGCCAA CATGGTGACC GGCGCGCTGG GCGGTCTGGC CATCGCCGGG
GTCATCGTGC GCAGCACCAC CAACGTCCGC GCCGGGGCGC GCAGCCGCTG GTCGACGGTC
ATGCACGGCC TGTGGATCCT GGTGTTCGCG GTCTGCCTGG GGTGGACGAT CACGCTGATC
CCCATGGAGG CGCTGGCCGC CCTGCTGGTC TTCATCGGCG TCCAGATGGT CAACCTGGGG
CACCTGCGCA ACCTCCGCGG CCACGGCGAG ATCCCGATAT ACGTCGTCAC CATGGCCGGA
GTGGTGCTGG TCGGCCTCGC CGAAGGCGTG CTGCTCGGCC TCGGCCTCGC CATGCTGTCG
GCGCTGCGCC GTCTCACCTG GATCACCGTG CGGGCCAGGC CCGAGCCCGA CGGCCGCTGG
CACGTGCTGA TCGGCGGTTC GCTGACCTTC CTCGGCGTGC CGAGGCTCAC CTCGGAGCTG
CGCGCCGTAC CGGCCGGGGC CGCGGTGGAG CTGGACCTGA ACATCGACTT CATGGACAAC
GCCGCCTTCG AGGCCATCCA CACCTGGCGG CAGGACCACG AACGCGGCGG CGGCACCGTC
GACATCGACG AGATCCACGA CGAGTGGTAC GCCATGGCCG CCAGCGGGGC CCGGATGTTC
CCGGCCAAGA CGCCGCCCCG TGCGCCGGAG CGCTGGTGGC TGCCCTGGGC GCATCGGCGG
CGGGGACGGC CCGTGCCCGC CAGCTCCCTG GTCCCCGCCC AGCATCCGCC GGCGGGGAGC
GGCCCCGCCG TGCCCGACCT GCTCGCCGGG GCACGGGAGT TCCACCGTCG CACGGCTCCG
CTGGTCCGCC CGTTCCTGCT GGCGATGGCC CGCAAGCAGG AGCCCTCCCA TCTCTTCATC
ACCTGCGCGG ACTCCCGGGT CGTGCCGAAC CTGATCACCG CCAGCGGCCC CGGCGACCTG
TTCACCGTCC GCAACATCGG CAACCTGGTG CCGCGCGTGG GGGCGGCGCC TCCGGACGAC
TCGGTGGCCG CGGCGATCGA GTACGCCACC GACGCGCTCA ACATCCGGAC CATCACCGTC
TGCGGCCACT CCGGCTGCGG TGCCATGGCC GCGCTGCTGA GCGGTCACGA GAAGGCGCCG
GGACTGCCCG CGCTCAGCCG CTGGCTGCAC CACGGCGACC ACAGCCTGGC CCGGTTCGTC
GCCACCGAGG GCGACGGCGT CGACGACGGC CCGCTGGACC GCCTCTGCAG GGTCAACGTG
ATCCAGCAGT TGGAGAACCT GCGGACCTAT CCCCAGGTGG ACCGGCTCGT CCGGGCCGGA
CGCCTACAAC TGGTGGGCCT CTACTTCGAC ATCGGCACGG CCCGGGTCCA CGTCCTCGAA
CAGCCGCCGC TCACGGCCAG TCCGCTCTGA
 
Protein sequence
MGTDTKSYRP LAKSGIRSVL RYDLPASLVV FLVAVPLSLG IAVASGAPLA AGLIAAVVGG 
LVAGSLGGSV VQVSGPAAGL SLVVAELVTT YGWRATCMIT LMAGALQLLL GLFRVARAAL
AVSPAVVHGM LAGVGVVIAL SQLHVVLGGS PQRSAIGNLI ELPQQIIHNH SHAVFAGLLT
IAVLVGWTRL PQRLRIVPAP LAALFVASAT AGALGWDVTR VDLSQSLSEW ATPIWPRGDW
HGIVGAVLLV ALLAGVESLL SSVATDKLHD GRRSDLDREL TAQGVANMVT GALGGLAIAG
VIVRSTTNVR AGARSRWSTV MHGLWILVFA VCLGWTITLI PMEALAALLV FIGVQMVNLG
HLRNLRGHGE IPIYVVTMAG VVLVGLAEGV LLGLGLAMLS ALRRLTWITV RARPEPDGRW
HVLIGGSLTF LGVPRLTSEL RAVPAGAAVE LDLNIDFMDN AAFEAIHTWR QDHERGGGTV
DIDEIHDEWY AMAASGARMF PAKTPPRAPE RWWLPWAHRR RGRPVPASSL VPAQHPPAGS
GPAVPDLLAG AREFHRRTAP LVRPFLLAMA RKQEPSHLFI TCADSRVVPN LITASGPGDL
FTVRNIGNLV PRVGAAPPDD SVAAAIEYAT DALNIRTITV CGHSGCGAMA ALLSGHEKAP
GLPALSRWLH HGDHSLARFV ATEGDGVDDG PLDRLCRVNV IQQLENLRTY PQVDRLVRAG
RLQLVGLYFD IGTARVHVLE QPPLTASPL