Gene Sare_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1849 
Symbol 
ID5704784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2132517 
End bp2135864 
Gene Length3348 bp 
Protein Length1115 aa 
Translation table11 
GC content69% 
IMG OID641271350 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001536725 
Protein GI159037472 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGC GGACCGACCT CAGGCACGTC CTGGTGATCG GCTCCGGGCC GATCGTGATC 
GGGCAGGCCT GCGAGTTCGA CTACTCGGGA ACCCAGGCCT GCCGGGTGCT GCGCAACGAG
GGGATCCGGG TCAGCCTGGT CAACTCGAAC CCTGCGACGA TCATGACCGA TCCGGAGTTC
GCCGACGCCA CGTACGTCGA GCCGATCACC CCGGAGTTCG TCGAACTGGT CATCGCCAAG
GAGCGGCCGG ACGCGGTGCT GCCCACTCTG GGTGGCCAGA CCGCGCTGAA CACGGCTGTC
GCGCTGCATG AGGCGGGTGT GCTGGAGAAG TACGGCGTCG AGTTGATCGG CGCGAACATC
GACGCGATCC GTCGGGGCGA GGACCGGCAG CTGTTCAAGG ACATCGTCGC CAAGGCCGGC
GTCCGGATCG GGGTCGAGGA CCCGGCGGCG CTGGTGCCGC GTTCCCGGGT CTGCCACTCG
ATGGACGAGG TCCGCGACAC CATCGCCGAG CTGGGCCTGC CAGCGGTGAT TCGTCCCTCG
TTCACGATGG GTGGCCTCGG TTCCGGCATG GCCCACACCG ATGCGGACCT GGAGCGCATC
GCCGGCGCCG GGCTGGCCGC CAGCCCGGTG CACGAGGTGC TCATCGAGGA GAGCGTGCTC
GGCTGGAAGG AGTACGAGCT CGAGCTGATG CGCGACCGGC ACGACAACGT CGTGGTGGTC
TGCTCGATCG AGAACGTCGA CCCGATGGGC GTGCACACCG GCGACAGCGT CACCGTGGCC
CCGGCGATGA CGCTCACCGA CCGGGAGTAC CAGCGCATGC GTGACCTCGG CATCGCGGTG
TTGCGCGAGG TCGGGGTGGA TACCGGTGGC TGTAACATCC AGTTCGCGGT GAATCCGGTC
GACGGCCGGA TCGTCGTGAT CGAGATGAAC CCCCGGGTCT CCCGCTCGTC AGCGCTCGCC
TCGAAGGCGA CCGGCTTCCC GATCGCGAAG ATCGCCGCGA GGCTGGCCAT CGGCTACACC
CTCGACGAGA TCCCCAACGA CATCACCCGG AAGACGCCGG CCGCCTTCGA GCCGACGTTG
GACTACGTGG TGGTCAAGAT TCCCCGGTTC ACGTTCGAGA AGTTCCCCGG CGCCGACCCG
GAGTTGACCA CCACCATGAA GTCGGTTGGT GAGGCGATGA GCCTGGGCCG CAACTTCACC
GAGGCGCTGA ACAAGGCCAT GCGCTCGATG GAGACGAAGG CGGCCGGCTT CTGGACCGTG
CCCGACCCGG CCGACGCCAC CCGGGAGGGC ACCCTCGCCG CGCTGCGGAT GCCGCACGAC
GGGCGGCTCT ACACGGTGGA GCGGGCGCTG CGACTCGGCG CGTCGATCGC CGAGGTGGCC
GCCGCGTCCG GCGGGATGGA CCCGTGGTTC CTCGACCAGA TCGCCGGCCT CGTCGAGCTG
CGGGCCGAGA TCGTCGACGC TCCGGTGCTC GACGTCGACC TGCTGCGCCG GGCCAAGCGG
GCGGGCCTGT CCGACCGGCA ACTCGCCGCG TTGCGCCCGG AGCTGGCCGC CGAGGACGGT
GTCCGGACGC TGCGGCACCG GTTGGATGTC CGGCCGGTCT ACAAGACCGT GGACACCTGC
GCGGCCGAGT TCGGGGCGAC GACCCCCTAC CACTACTCCA CGTACGACGT GGAGACCGAG
GTGGTCGGGT CGGACCGGCC CAAGGTACTG ATCCTCGGCT CGGGGCCGAA CCGGATCGGG
CAGGGCATCG AGTTCGACTA CTCCTGCGTA CACGCCGTCA TGGCGCTGCG TGACGTCGGT
TACGAGACCG TCATGGTCAA CTGCAATCCG GAGACCGTCT CCACCGACTA CGACACCGCG
AACCGGCTCT ACTTCGAGCC GCTGACCTTC GAGGACGTCC TCGAGGTATG GCACGCTGAG
GACACCTCCG GCCGGGCGGC CGGTGGACCG GGCGTGGTCG GTGTGGTCGT CCAGCTCGGT
GGGCAGACGC CGTTGGGTCT GGCGCAGCGG CTCAAGGACG CCGGGCTGCC GGTCGTCGGC
ACCTCCCCGG AGTCGATCCA CCTGGCCGAG GAGCGGGGGG CGTTCGGCGC ACTACTGGCC
CGCGCTGGCC TGCGTGCTCC GGCACACGGC ACCGCCACCT CCTACGACGA GGCAAGAGCG
ATCGCCGACG AGATCGGCTA CCCGGTGCTG GTCCGCCCGT CGTATGTGCT CGGTGGACGG
GGGATGGAGA TCGTCTACGA CGACGCCACC CTGCGGGACT ACATTGGTCG GGCCACCGAC
ATCTCCCCGG ACCATCCGGT GTTGGTGGAC CGTTTCCTCG ACGACGCCAT CGAGATCGAC
GTGGACGCGC TGTGCGACGC CACCGGCGAG GTATACCTCG GTGGGGTGAT GGAGCACATC
GAGGAGGCTG GCATCCACTC CGGTGATTCG TCCTGCGCGT TGCCGCCGAT CACCCTGGCC
GGTTCGCACG TCGCCGAGGT GCGTCGGTAC ACCGAGGCGA TCGCCCGGGG CATCGGTGTG
CGGGGCCTGC TCAACGTGCA GTACGCCCTC AAGGACGACC AGCTCTGGGT GCTCGAGGCC
AACCCCCGCG CGTCCCGGAC CGTGCCGTTT GTCTCCAAGG CCACCGCGGT GCCGCTGGCC
AAGGCAGCGG CCCGGATCGC GCTCGGCGCC ACCATCGCCG AGCTGCGAAC GGAGGGCCTG
CTCCCGGCCG ACGGCGACGG CGGCAGCCTC CCGTCGGACG CGCCGATCGC GGTCAAGGAG
GCGGTGCTGC CGTTCAAGCG GTTCCGCACC CCGACCGGGA AGGGGATCGA TTCGCTGCTC
GGACCCGAGA TGAAGTCCAC CGGCGAGGTG ATGGGCATCG ACACTGCCTT CGGGCACGCT
TTCGCCAAGT CGCAGTCGGC CACGTACGGC TCGCTGCCGA CCAGCGGGAA GATCTTCGTG
TCGGTCGCCA ACCGGGACAA GCGCGGCATG ATCTTTCCGA TCAAGCGCCT GGCCGACCTG
GGTTTCGAGA TCATCGCTAC CACCGGCACC GCCGAGGTGC TGCGCCGACA CGGCATCGCC
TGTGAGCAGG TCCGCAAGCA CTACGAGTCG GGGGAGAGCG AGGACGCCGT CTCGCTGATC
CGTTCCGGTG AGGTGGCGTT CGTGGTGAAC ACCCCGCAAG GTTCCGGTGC CAGCGCCCGC
TCTGATGGTT ACGAGATCCG CAGTGCCGCC GTGACGGCGG ACATCCCGTG CATCACGACC
GTCCCCGGTG CCGCGGCGGC GGTGATGGCG GTCGAGGCCC GGATCCGCGG TGACATGCAG
GTCCGTCCGC TGCAGGCCCT CCACGCCTCG CTGCGGGCCC CGCGATGA
 
Protein sequence
MPKRTDLRHV LVIGSGPIVI GQACEFDYSG TQACRVLRNE GIRVSLVNSN PATIMTDPEF 
ADATYVEPIT PEFVELVIAK ERPDAVLPTL GGQTALNTAV ALHEAGVLEK YGVELIGANI
DAIRRGEDRQ LFKDIVAKAG VRIGVEDPAA LVPRSRVCHS MDEVRDTIAE LGLPAVIRPS
FTMGGLGSGM AHTDADLERI AGAGLAASPV HEVLIEESVL GWKEYELELM RDRHDNVVVV
CSIENVDPMG VHTGDSVTVA PAMTLTDREY QRMRDLGIAV LREVGVDTGG CNIQFAVNPV
DGRIVVIEMN PRVSRSSALA SKATGFPIAK IAARLAIGYT LDEIPNDITR KTPAAFEPTL
DYVVVKIPRF TFEKFPGADP ELTTTMKSVG EAMSLGRNFT EALNKAMRSM ETKAAGFWTV
PDPADATREG TLAALRMPHD GRLYTVERAL RLGASIAEVA AASGGMDPWF LDQIAGLVEL
RAEIVDAPVL DVDLLRRAKR AGLSDRQLAA LRPELAAEDG VRTLRHRLDV RPVYKTVDTC
AAEFGATTPY HYSTYDVETE VVGSDRPKVL ILGSGPNRIG QGIEFDYSCV HAVMALRDVG
YETVMVNCNP ETVSTDYDTA NRLYFEPLTF EDVLEVWHAE DTSGRAAGGP GVVGVVVQLG
GQTPLGLAQR LKDAGLPVVG TSPESIHLAE ERGAFGALLA RAGLRAPAHG TATSYDEARA
IADEIGYPVL VRPSYVLGGR GMEIVYDDAT LRDYIGRATD ISPDHPVLVD RFLDDAIEID
VDALCDATGE VYLGGVMEHI EEAGIHSGDS SCALPPITLA GSHVAEVRRY TEAIARGIGV
RGLLNVQYAL KDDQLWVLEA NPRASRTVPF VSKATAVPLA KAAARIALGA TIAELRTEGL
LPADGDGGSL PSDAPIAVKE AVLPFKRFRT PTGKGIDSLL GPEMKSTGEV MGIDTAFGHA
FAKSQSATYG SLPTSGKIFV SVANRDKRGM IFPIKRLADL GFEIIATTGT AEVLRRHGIA
CEQVRKHYES GESEDAVSLI RSGEVAFVVN TPQGSGASAR SDGYEIRSAA VTADIPCITT
VPGAAAAVMA VEARIRGDMQ VRPLQALHAS LRAPR