Gene Smed_4238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4238 
Symbol 
ID5318282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp721133 
End bp723145 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content63% 
IMG OID640776043 
Productcarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_001312976 
Protein GI150376380 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACA TGTTCAAGAA AATCCTCATT GCCAATCGTG GTGAAATCGC CTGCCGCGTC 
ATCCGCACAG CGAAGTCTCT CGACATCCCG ACCGTCGCCG TCTACTCGGA TGCGGACCGC
GACGCGATGC ATGTGCGCAT GGCGGACGAG GCCGTTCCTA TCGGCCCCTC ACCCTCAAAC
CAGTCCTATA TCGTCATAGA CAGGATTCTT GAGGCAATCC GCAAGACCGG CGCCGATGCG
GTGCATCCTG GCTACGGCTT CCTTTCTGAA AATTCCGCCT TCGCCGAGGC TCTGGAGAAA
GAGGGCGTAA CCTTCATCGG TCCGCCAGTC AGGGCCATCG AAGCGATGGG CGACAAGATC
ACCTCGAAGA AGCTTGCCGC CGAAGCAGGC GTTTCCACCG TTCCCGGCCA TATGGGGCTG
ATCGGGGATG CGGACGAGGC CGCACGCATC GCCGCTTCGA TCGGCTTTCC GGTGATGATA
AAGGCGTCTG CCGGCGGCGG CGGCAAGGGA ATGCGGATCG CCTGGACCGA GGACGAGGCA
CGCGAGGGCT TTCAGTCATC GAAGAACGAG GCGAAGAGTT CCTTCGGCGA CGACCGCATC
TTCATCGAGA AATTCGTGAC CGAGCCCCGC CACATCGAGA TCCAGGTGCT CGGCGACAAG
CATGGCAACA TCGTCTATCT GGGCGAGCGC GAATGCTCGA TCCAGCGGCG GAACCAGAAG
GTTATCGAGG AGGCGCCCTC GCCCTTCCTC GACGAGAAGA CGCGCCGCGC CATGGGCGAG
CAGGCGGTGG CGCTGGCAAA GGCCGTCGCC TATTTTTCGG CCGGCACGGT CGAGTTCATC
GTCGACGCCC GCGGAAACTT CTATTTTCTC GAGATGAATA CCCGGCTGCA GGTCGAGCAT
CCGGTGACGG AACTCGTCAC CGGCCTCGAT CTCGTCGAGC AGATGATCCG CGTCGCGGCG
GGGGAGAAAC TCGCCTTTGC ACAGAAGGAT GTGAAGCTCG ACGGATGGGC GATCGAAAGC
CGGCTCTATG CGGAAGACCC CTACCGCAAT TTTCTTCCTT CGATCGGCCG GCTGAGCCGC
TACCGCCCGC CGGAGGAAGG CCCGCGGGCG GACGGTACCG TCATTCGCAA CGATACCGGC
GTCTTGGAAG GCGGCGAGAT CTCGATGTAT TACGATCCGA TGATCGCCAA GCTTTGCACC
TGGGGTCCGG ACCGGGCGAC CGCCGTCCGG GCGATGGCGG ATGCGCTCGA CGCGTTCGAG
ATCGAAGGCG TCGGCCACAA CCTGCCCTTC CTTGCAGCCG TCATGCAGCA GGATCGTTTC
CGCGAGGGAC GGCTGACCAC GGCCTATATC GCCGAGGAGT TTGCCGGCGG TTTTCAGGGC
GTGGTGCCGG ACGACACAGC GGCGCGCAAA CTCGCCGCCA TCGCGGTGAG CGTCAATCAG
ACGCTGCAGG AGCGCGCCAG CCGCATCTCG GGCACCATCG GCAACCATCG CCGGGTCATC
GGTCACGAAT GGGTGGCAAG CCTCGACGGG CACGAAATTC AGGTCACATG CGAAGCTTCC
GCCGACGGCA CCTATGTACG CTTTGCCGAC GGGACATCTG TCTCCGTCGC AACTGACTGG
ACCCCCGGTC GCACCCGTGC CGCTTTCAAC ATAGAAAATC AGCCGATGAG CGTGAAGGTC
GAGCTTGCCG GCACCGGAAT AAGGCTGCGC TGGCGCGGGA TCGACGTCGT TGCACGGGTC
AGAAGCCCAC ACATTGCCGA ACTCGCCCGG CTGATGCCGA AGAAGCTGCC GCCGGACACG
TCGAAGATGC TGCTCTGCCC GATGCCGGGG GTAGTGACGT CGATCACGGT GAAGGCCGGG
GAGACAGTGG AGGCCGGACA GGCGATCGCC GTCGTCGAGG CAATGAAGAT GGAGAATATA
TTGAGGGCGG AAAGACGCTC GATCGTGAAG CGCGTGGCGA TCGAAGCCGG GGCGAGCCTG
GCCGTGGACG AGTTGATCAT GGAGTTCGAG TGA
 
Protein sequence
MENMFKKILI ANRGEIACRV IRTAKSLDIP TVAVYSDADR DAMHVRMADE AVPIGPSPSN 
QSYIVIDRIL EAIRKTGADA VHPGYGFLSE NSAFAEALEK EGVTFIGPPV RAIEAMGDKI
TSKKLAAEAG VSTVPGHMGL IGDADEAARI AASIGFPVMI KASAGGGGKG MRIAWTEDEA
REGFQSSKNE AKSSFGDDRI FIEKFVTEPR HIEIQVLGDK HGNIVYLGER ECSIQRRNQK
VIEEAPSPFL DEKTRRAMGE QAVALAKAVA YFSAGTVEFI VDARGNFYFL EMNTRLQVEH
PVTELVTGLD LVEQMIRVAA GEKLAFAQKD VKLDGWAIES RLYAEDPYRN FLPSIGRLSR
YRPPEEGPRA DGTVIRNDTG VLEGGEISMY YDPMIAKLCT WGPDRATAVR AMADALDAFE
IEGVGHNLPF LAAVMQQDRF REGRLTTAYI AEEFAGGFQG VVPDDTAARK LAAIAVSVNQ
TLQERASRIS GTIGNHRRVI GHEWVASLDG HEIQVTCEAS ADGTYVRFAD GTSVSVATDW
TPGRTRAAFN IENQPMSVKV ELAGTGIRLR WRGIDVVARV RSPHIAELAR LMPKKLPPDT
SKMLLCPMPG VVTSITVKAG ETVEAGQAIA VVEAMKMENI LRAERRSIVK RVAIEAGASL
AVDELIMEFE