Gene Sare_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2687 
Symbol 
ID5706770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3060992 
End bp3062368 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content70% 
IMG OID641272145 
Productcarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_001537515 
Protein GI159038262 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.125716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000597874 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCAGCA AAGTGCTCAT CGCCAACCGG GGCGAGATCG CGCTGCGAGT GCTGCGGGCC 
TGCCGCGAGC TGGGAATTCG AACCGCGGTG GTGTACTCCA CGGCGGACGC TGACTCGGCC
GCGGTGCGGC TCGCCGACGA GTCGGTCCTG ATCGGACCGG CGGCGAGTCG CCGCAGCTAC
CTGAACCCGG CGGCCATCGT CGAGGCTGCC CGCCTGGTCG GTGCGCAGGC GGTGCACCCC
GGCTACGGCT TCCTCTCCGA AGACGCCGAC TTCGCCGAGA TCTGTGCCGA CAACGGCCTC
GTCTTCGTCG GTCCGCCACC CCAGGTGATG GCCGCGTTGG CTGACAAGTC GTCCGCGCGG
GCGCTGATGA GCCGCGCCGG CCTGCCGCTG CCACCGGGCA GCGTCGCACC GGTGCCGACC
GCTGCCGACG CCGCCCGGGT TGCCGCCGAG GTGGGCTACC CGGTGATCGT GAAGGCCGCC
GCTGGGGGCG GTGGCCGGGG GATGACCGTG GTGTCCACAC CGACCGAACT GCCCCGGGCG
TACGCCCGGA CCCGAGCCGC CGCCCAGATC GCGTTTGGCG ACGACCGGGT ATACGTCGAG
CGGTACCTCG GCAGCGCGCG GCACGTCGAG GTGCAGGTGC TCTGCGATGC CCACGGCAAC
GGTGTGCACC TGGGCACCCG GGATTGTTCG GTGCAGCGCC GACACCAGAA GCTCGTCGAG
GAGGCGCCCG CTCCGGCACT GTCGGCGGCC ACCCTGGACG CTATCGCCAG TGGCGCCCTC
CGCGGCGCAC TGGACGTCGG GTTCACCGGT GCCGGAACAG TGGAATTCCT GGTTGATTCA
GCGGAACAGT TCCACTTCCT GGAGATCAAC TGTCGGATTC AGGTGGAGCA TCCGGTCACC
GAGATGATCA CCGGTATCGA CCTCGTGCAC GAGCAACTAC GCCTGGCGGC CGGCCGGACG
CTGCGGTGGC GGCAGGAGGA GATCGTGACC AACGGCGTGG CGGTCGAGTG CCGGGTCAAC
GTCGAGGACC CCGATCGGGG CTTCGCGCCG ACGCCCGGCC GCCTGGAGCG GTTCGTCCCT
CCGGGTGGCC CGTTCACCCG GGTCGACACG CACGGGTACC CCGGCTACGT CGTCGGCCCC
CACTACGACT CCCTGTTGGC CAAGGTGGCG GTGTGGGCTC CAGACCGGGA ACTGGCCCTC
AACCGACTGG AACGCGCCCT CGACGAGTTC GAGGTCGCTG GCCCGGGCGT GCACACCACC
ATCCCGTTCG TCCGGCGGGT GCTCGACGAC GCCGGGTTCC GCAAGGGCCG TCACTGCACC
GGTCTGGTCG AGACGTTGCT CGCCGACCTG CCCAAGCAAT CGAGGAGGAA CGCATGA
 
Protein sequence
MFSKVLIANR GEIALRVLRA CRELGIRTAV VYSTADADSA AVRLADESVL IGPAASRRSY 
LNPAAIVEAA RLVGAQAVHP GYGFLSEDAD FAEICADNGL VFVGPPPQVM AALADKSSAR
ALMSRAGLPL PPGSVAPVPT AADAARVAAE VGYPVIVKAA AGGGGRGMTV VSTPTELPRA
YARTRAAAQI AFGDDRVYVE RYLGSARHVE VQVLCDAHGN GVHLGTRDCS VQRRHQKLVE
EAPAPALSAA TLDAIASGAL RGALDVGFTG AGTVEFLVDS AEQFHFLEIN CRIQVEHPVT
EMITGIDLVH EQLRLAAGRT LRWRQEEIVT NGVAVECRVN VEDPDRGFAP TPGRLERFVP
PGGPFTRVDT HGYPGYVVGP HYDSLLAKVA VWAPDRELAL NRLERALDEF EVAGPGVHTT
IPFVRRVLDD AGFRKGRHCT GLVETLLADL PKQSRRNA