Gene Sare_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1043 
Symbol 
ID5706542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1167406 
End bp1168920 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content69% 
IMG OID641270559 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001535943 
Protein GI159036690 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.683174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.268504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGGG AAACCATCGG TCGGCTGGTC AGCACCGAGC CGGCCACCGG GCGCATCACC 
TTCGCCGAGT TGTGGGGCGC CCGTACGCTC GACCTCGCCG ACCTGTACCG GCGTTCCGCC
CGAGTGGCTC GCTGGCTGCT CGGGCGCGGG GTCCGGCCCG GCGACCGCAT CGGCATCCAC
GCGGCGAACG GGCTGGAATG GGTGCTGCTC GACCTGGCGG CACTGCGGCT GAAGGTGGAG
ACGGCCGGGC TCGAGCCGGG CAAGTTCACA CCCGACAGCG ACCTGCTGGC CCGCTACGAC
CTGACGCTCC TGTGCACCGA TCGGCACGCC GAAGGGCCCG GCATCGTACC CGTCGGTGAG
GTGGCGGAGG CCGCCGGACG TCGCGACCTG GACGAACCCG CGCTGCCGCC GGTGACGTGG
CAACCCGAGG ACGTCACCAC GATCAAGTTC ACCTCGGGCA GTACCGGCGA GCCCAAGGGC
CTGGGTGCCA CCGCGGGCAG CATCGACAGC TCGCTGCGGG CCGTCCAGGA GATCTTCGAA
CACGGGCCGG GTGACGACCT GTTCGTCTTC CTGCCGCTCT CCCTGCTCCA GCAGCGCTAC
TGGATCTACT CGGCATTGCT GCACGGCCAC GACGTCACGG TCAGCACCTA CGAGGCGGCC
TTCGCGGCGC TGCGCCAGGT CCGACCCACC GTGGTGATGG GGGTGCCCGC CTTCTACGAG
ACCGCCAAGC GCCAGATCGA GGCCAGGCAG CGTCGCGGTT CGTCGGTGAC GGAGGCCGCA
CAGGCGGTGT TCGGCGACCG CATCCGATAT CTGTGGACCG GTTCCGCGCC CGCGGCCCCG
AGCACCCTGC GCTTCTTCGT CGACGCCGGT CTTCCCATCT ACGAGGGGTA CGGGCTCAAC
GAAACCTGCA TCGTCACCAA GAACCACCCG AAGGCCCATC GCGAGGGCAG TGTGGGCCAA
GTGCTGCGGG GCAAGGAGGT CCTGGTCGAC GCGGACGGTG TCGTCCACGT CCGCAGTGAC
CACCCTGTCA ACACCCGCTA CATCTATGCC GAACCCGGCA GCTCGGAGCA GATCTTCGCA
CCCGACGGCA CGGTGCGCAC CGGTGACCTC GGGCACCTTG ACGAGGACGG TTTCCTCTTC
ATCCGGGGGC GGGCCGACGA CGTGATCGTC CTGGACAACG GCAGGAAGGT CATTGTCCGG
CCGATCGAGG AACAGTTGAG GTCAGACCCG GCGATCGCCG AGTGCGTCTT GTTCTGTCCT
GGTCAGACCG AGTTGGTCGC CGTGGTCTCG CCGGCCCACG TACCCGCCGA CCGGGCGGCG
ATCGCCGCCC GTCTCGCTTC GACCAACGCC GCGCTCACCA GTGACGAGCG GATCAGCCGG
ATGATCCTCG CCGACGAGCC GTTCAGCATC GACAACGGCC TGCTCACCTC GCAGTACAAG
CCCAGGAGGC GGCAGATCCT CGCTGCCCAC CACGCCGCAG TGCACGACCC CAAGGAGGGA
ATCCATGCTC CCTGA
 
Protein sequence
MTRETIGRLV STEPATGRIT FAELWGARTL DLADLYRRSA RVARWLLGRG VRPGDRIGIH 
AANGLEWVLL DLAALRLKVE TAGLEPGKFT PDSDLLARYD LTLLCTDRHA EGPGIVPVGE
VAEAAGRRDL DEPALPPVTW QPEDVTTIKF TSGSTGEPKG LGATAGSIDS SLRAVQEIFE
HGPGDDLFVF LPLSLLQQRY WIYSALLHGH DVTVSTYEAA FAALRQVRPT VVMGVPAFYE
TAKRQIEARQ RRGSSVTEAA QAVFGDRIRY LWTGSAPAAP STLRFFVDAG LPIYEGYGLN
ETCIVTKNHP KAHREGSVGQ VLRGKEVLVD ADGVVHVRSD HPVNTRYIYA EPGSSEQIFA
PDGTVRTGDL GHLDEDGFLF IRGRADDVIV LDNGRKVIVR PIEEQLRSDP AIAECVLFCP
GQTELVAVVS PAHVPADRAA IAARLASTNA ALTSDERISR MILADEPFSI DNGLLTSQYK
PRRRQILAAH HAAVHDPKEG IHAP