Gene Sros_2808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2808 
Symbol 
ID8666094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3048160 
End bp3051453 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content71% 
IMG OID 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003338509 
Protein GI271964313 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTAAGC GCACGGACAT CCAGTCGGTC ATGGTGATCG GCTCCGGGCC CATCGTGATC 
GGGCAGGCCT GCGAGTTCGA CTACTCGGGC ACCCAGGCCT GCCGCGTGCT GCGCGCCGAG
GGCTTCCGCG TGATCCTCGT CAACAGCAAC CCGGCGACGA TCATGACGGA CCCCGAGTTC
GCCGACGCCA CCTACGTCGA GCCGATCACC CCGGACATGG TCGAGAAGAT CATCGCCAAG
GAGCGTCCCG ACGCGCTGCT GCCCACCCTC GGCGGCCAGA CCGCGCTGAA CACCGCGATC
GCCCTGCACG AGGCCGGCGT CCTGGCCAAA TACGACGTCG AGCTGATCGG CGCCGACGTG
GACGCCATCC AGGCGGGCGA GAACCGCGAG CTGTTCAAGG GCATCGTGGC CAAGGTCGCG
CGGGAGCGCG GCCTCAACGC CGACTCGGCC CGCTCGTTCG TCTGCCACAC CCTGGACGAG
TGCCTGACGG CCGCCGGCGA GCTGGGCTAC CCGCTGGTCG TGCGCCCCTC CTTCACCATG
GGCGGCGTCG GCTCCGGCTT CGCCCACGAC GATGAGGGCC TGCGCCGCAT CGCCGGAGCG
GGCCTCGACG CCTCGCCGAC CACCGAGGTG CTCCTGGAGG AGTCCATCCT CGGCTGGAAG
GAGTACGAGC TGGAGGTCAT GCGCGACAAG GCCGACAACG TCGTCATCGT CTGCTCCATC
GAGAACATCG ACCCGATGGG CGTGCACACC GGCGACAGCG TCACCGTGGC CCCCGCCCTG
ACGCTCACCG ACCGCGAGTA CCAGAACATG CGCGACGTCG CCATCGCGGT CATCCGCGAG
GTCGGCGTGG ACACCGGCGG CTGCAACATC CAGTTCGCCG TCGACCCGGC CACCGGCCGC
ATGGTCGTCA TCGAGATGAA CCCGCGCGTC TCCCGCTCCA GCGCGCTCGC CTCGAAGGCC
ACCGGCTTCC CGATCGCGAA GATCGCCGCC AAGCTGGCCA TCGGCTACAC CCTGGACGAG
ATCCCCAACG ACATCACCAA GGAGACCCCG GCGTCGTTCG AGCCCTCGCT CGACTACATC
GTGGTCAAGG TGCCCCGCTT CGCCTTCGAC AAGTTCCCCG GGGCCGACCA GACCCTGACG
ACCCACATGA AGTCGGTCGG CGAGGCCATG GCCATCGGCC GGTCCTTCCC CGAGGCCCTG
CAGAAGGCGC TGCGCTCGCT GGAGAAGAAG GGCGCCGTCT TCACCTGGGC GGGCGAGCCG
GGCGACAAGA ACGACCTCCT GAAGGCCTGC CGCACGCCGC ACGACGGCCG GCTGTTCACC
ATGCAGCAGG CCATCCGCGC CGGGGCCACG CCCGCCGAGC TGTTCGAGGC CACCGCGGTG
GACCCGTGGT TCCTGGACCA GCTCCAGGCG ATCGACGAGG TCGCCGCCGA GCTGCGCCAG
GTCCCCCAGC TCACCGCCGA GGTGCTGACG AAGGTCAAGC GGTACGGCTT CAGCGACCTG
CAGATCGCCG AGCTCCGCGA CATGACCGAG CCCCGGGTCC GCGACCTGCG CCACGCGCTC
GGCGTCCGGC CGGTCTACAA CACCGTCGAC ACCTGCGCCG CCGAGTTCGC CGCGCGCACG
CCCTACCTCT ACTCCACCTA CGACGAGGAG ACCGAGGTCC CGTACGGCGA GCGCCCCAAG
GTCCTCATCC TCGGCTCGGG GCCGAACCGG ATCGGCCAGG GCATCGAGTT CGACTACGCC
TGCGTGCACG CCTCCTTCGA GCTGTCGGCG GCCGGCTACG AGACCGTCAT GGTCAACTGC
AACCCCGAGA CGGTCTCCAC CGACTACGAC ACCTCCGACC GGCTCTACTT CGAGCCGCTC
ACCCTGGAGG ACGTCCTGGA GGTCGTCCAC GCCGAGCAGC AGACCGGCCC GGTCGCGGGC
GTCATCGTCC AGCTCGGCGG CCAGACCCCG CTCGGCCTGG CCCAGGCGCT CAAGGACGCC
GGGGTCCCGA TCGTCGGCAC CTCGCCGGAG TCGATCCACC TCGCCGAGGA GCGCGGCGCG
TTCGGCCGCG TCCTGGCCGA GGCCGGGCTG CCCGCGCCCA AGCACGGCAC CGCGGTCACC
GTGGACGAGG CCCTGGCGAT CGCCGGGGAG ATCGGCTACC CGGTCCTGGT CCGGCCCTCC
TACGTCCTCG GCGGCGCCGG CATGGCCATC GTCTACGACG ACGAGACGCT GACGACCTAC
ATGTCCAAGG CCGGCGCGGC CAGCGACCAC CCGGTGCTGG TGGACAAGTT CCTCGACGAG
GCCGTCGAGA TCGACGTGGA CGCGCTGTTC GACGGCGAGG AGCTCTACCT CGGCGGCGTG
ATGGAGCACA TCGAGGAGGC CGGCATCCAC TCCGGCGACT CGGCGTGCTC GCTGCCCCCG
ATGACGCTCG GCAGCCACGA CATCAAGCGC ATCCGCGCCG CCACCGAGGA GATCGCCCGC
GGCGTGGGCG TGCGCGGCCT GCTCAACGTG CAGTACGCCA TGTCGGCCAA CATCCTCTAC
GTGCTGGAGG CCAACCCGCG TGCCAGCCGT ACGGTGCCGT TCGTCTCCAA GGCCACGGCG
GTGCCGCTGG CCAAGGCGGC GGCCCGCGTG ATGATGGGCG CCACGGTGGC CGAGCTGCGC
GCCGAGGGCA TGCTCCCGGC CGAGGGCGAC GGCGGCACCA TGCCGCTGGA CGCGCCCATC
GCGGTCAAGG AGGCGGTGCT GCCGTTCAAC CGCTTCCGCG GCGTGGACAC CGTGCTGGGC
CCGGAGATGC GCTCCACCGG CGAGGTCATG GGCATCGACC GGTTCTTCGG CACGGCCTAC
GCCAAGTCGC AGGCCGCCGC CTACGGCTCG CTGCCGACCG GCGGGCGGGC GTTCGTCTCG
GTGGCGAACC GGGACAAGCG TGCGATGATC TTCCCGGTCA AGGCGCTCGC GGACCTCGGT
TTCGAGATTC TGGCCACCGA GGGAACGGCC GAGGTGCTGC GCCGTAACGG CGTCCATGCC
AAGATCGTGC GAAAGCAGAG CGACGGAACC GGTCCCGAGG GTGAGCCGAC CATCGGCCGG
CGCATCCTGG ATGGTGAGGT GGATCTCATC GTGAACACGC CCTTCGGCAG CCCCGGCCAG
TCCGGGCCGC GGCTGGACGG CTACGAGATC CGCACCGCCG CCGTGCTGCG GGGCATCCCC
TGCATCACGA CCGTCCAGGG GCTCGCCGCG GCCGTCCAGG GCATCCAGGC CATCGTCCGC
GGCGACATCG GCGTACGGTC GCTCCAGGAG CACGCGGAGC AGCTGAGGGG ATGA
 
Protein sequence
MPKRTDIQSV MVIGSGPIVI GQACEFDYSG TQACRVLRAE GFRVILVNSN PATIMTDPEF 
ADATYVEPIT PDMVEKIIAK ERPDALLPTL GGQTALNTAI ALHEAGVLAK YDVELIGADV
DAIQAGENRE LFKGIVAKVA RERGLNADSA RSFVCHTLDE CLTAAGELGY PLVVRPSFTM
GGVGSGFAHD DEGLRRIAGA GLDASPTTEV LLEESILGWK EYELEVMRDK ADNVVIVCSI
ENIDPMGVHT GDSVTVAPAL TLTDREYQNM RDVAIAVIRE VGVDTGGCNI QFAVDPATGR
MVVIEMNPRV SRSSALASKA TGFPIAKIAA KLAIGYTLDE IPNDITKETP ASFEPSLDYI
VVKVPRFAFD KFPGADQTLT THMKSVGEAM AIGRSFPEAL QKALRSLEKK GAVFTWAGEP
GDKNDLLKAC RTPHDGRLFT MQQAIRAGAT PAELFEATAV DPWFLDQLQA IDEVAAELRQ
VPQLTAEVLT KVKRYGFSDL QIAELRDMTE PRVRDLRHAL GVRPVYNTVD TCAAEFAART
PYLYSTYDEE TEVPYGERPK VLILGSGPNR IGQGIEFDYA CVHASFELSA AGYETVMVNC
NPETVSTDYD TSDRLYFEPL TLEDVLEVVH AEQQTGPVAG VIVQLGGQTP LGLAQALKDA
GVPIVGTSPE SIHLAEERGA FGRVLAEAGL PAPKHGTAVT VDEALAIAGE IGYPVLVRPS
YVLGGAGMAI VYDDETLTTY MSKAGAASDH PVLVDKFLDE AVEIDVDALF DGEELYLGGV
MEHIEEAGIH SGDSACSLPP MTLGSHDIKR IRAATEEIAR GVGVRGLLNV QYAMSANILY
VLEANPRASR TVPFVSKATA VPLAKAAARV MMGATVAELR AEGMLPAEGD GGTMPLDAPI
AVKEAVLPFN RFRGVDTVLG PEMRSTGEVM GIDRFFGTAY AKSQAAAYGS LPTGGRAFVS
VANRDKRAMI FPVKALADLG FEILATEGTA EVLRRNGVHA KIVRKQSDGT GPEGEPTIGR
RILDGEVDLI VNTPFGSPGQ SGPRLDGYEI RTAAVLRGIP CITTVQGLAA AVQGIQAIVR
GDIGVRSLQE HAEQLRG