Gene Sare_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1848 
Symbol 
ID5704711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2131364 
End bp2132524 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID641271349 
Productcarbamoyl-phosphate synthase, small subunit 
Protein accessionYP_001536724 
Protein GI159037471 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.293096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGGG GATCGGCGAT CCTCGTCCTG GAGGACGGGC GCACATTCCA CGGCGAGGCG 
TACGGCAGCG TCGGCGAGAC GTTCGGCGAA GCAGTTTTCA ACACCGGTAT GACCGGCTAC
CAGGAGACGC TCACCGACCC GTCCTACCAC CGGCAGGTGG TGGTGCAGAC CGCGCCGCAC
ATCGGCAACA CGGGGGTCAA CGGTGAGGAC GACGAGTCCG GCCGGATCTG GGTTGCCGGG
TACGTGGTCC GTGACCCCGC CCGGATCGGT TCCAACTGGC GGGCCACCGG TGGCCTGGAG
GACCGGCTCG CCGCCGAGGG GGTGGTCGGC ATCAGCGGCG TGGACACCCG CGCGCTCACC
CGGCACCTCC GGGAGCGCGG CGCGATGCGA GTCGGGGTGT CCAGCGTCGA GACCGATCCG
GTGGCGCTGC TGGCCCGGGT CCGGCAGGCC CCGCCGATGG TCGGCGCGGA TCTCTCCGCC
GAGGTGACCA CGCCGAAGCC GTACGTGGTC GAGGCCAAGG GGGAGCACCG GTTCACCGTC
GCCGCCCTGG ACCTGGGCAT CAAACGCAAC GTGCCTCGGC GGCTCGCGGC GCGGGGTGTC
ACCACCCATG TGTTGCCGGC TCATTCGGGC ATCGACGACC TGCAGGCCAC TGGCGCGGAC
GCGATTTTCC TGTCGCCCGG ACCGGGTGAC CCGGCGACCG CCGACGGGCC GGTGGCCCTG
GCCCGTGCGG TGCTCACCAG TGGGGTGCCG CTGTTCGGTA TCTGCTTCGG TAGCCAGATC
CTCGGTCGGG CACTGGGCTT CGGCACGTAC AAGCTCGGTT ACGGCCACCG CGGCATCAAC
CAGCCGGTGC TCGACCGGGC CACCGGCAAG GTCGAGGTGA CCAGCCACAA CCACGGCTTC
GCCGTCGCGT TCCCCGGTGC CGAGCCGGGC GCCGTCGTGC CGAACCAGGT GGTCGACACC
GAATTCGGTG GTATCGAGGT CTCGCACGTC TGCCTCAATG ACAACGTGGT CGAGGGGCTG
CGGGCACGCG AGGTGCCCGC CTTCACCGTT CAGTACCACC CGGAGGCGGC GGCCGGTCCG
CACGATGCGG ACTACCTGTT CGACCGCTTC ACTGAGCTCA TCGAGGGCCG TAACCACAGC
AGGGGGCGCA CGAATGCCTA A
 
Protein sequence
MRRGSAILVL EDGRTFHGEA YGSVGETFGE AVFNTGMTGY QETLTDPSYH RQVVVQTAPH 
IGNTGVNGED DESGRIWVAG YVVRDPARIG SNWRATGGLE DRLAAEGVVG ISGVDTRALT
RHLRERGAMR VGVSSVETDP VALLARVRQA PPMVGADLSA EVTTPKPYVV EAKGEHRFTV
AALDLGIKRN VPRRLAARGV TTHVLPAHSG IDDLQATGAD AIFLSPGPGD PATADGPVAL
ARAVLTSGVP LFGICFGSQI LGRALGFGTY KLGYGHRGIN QPVLDRATGK VEVTSHNHGF
AVAFPGAEPG AVVPNQVVDT EFGGIEVSHV CLNDNVVEGL RAREVPAFTV QYHPEAAAGP
HDADYLFDRF TELIEGRNHS RGRTNA