Gene Sare_3613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3613 
SymbolaceE 
ID5706639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4169297 
End bp4172035 
Gene Length2739 bp 
Protein Length912 aa 
Translation table11 
GC content67% 
IMG OID641273038 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_001538402 
Protein GI159039149 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000348525 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCACGG AACGCAAGCG CCCGGTGATC ACCGCCGGTC TGCCGAGCCA GCTTCCGGAT 
ATCGACCCCG AAGAAACCGG TGAGTGGGTC GAGTCGCTTG ACGGTGTTAT CGACGATCGC
GGAACCAAAC GCGCCCGCTA CGTCATGCTG CGCCTGCTGG AGCGGGCCCG CGAGCGTCAG
GTCGGGGTGC CGTCCCTGAC CACCACGGAC TACATCAACA CCATCACTCC GGAACGGGAG
CCCTGGTTCC CCGGCGACGA GCACGTCGAG CGGCGTATCC GGGCCTACAT CCGGTGGAAC
GCCGCGATGC TGGTGCACCG GGCACAGCGG CCGGAGATCG GTGTCGGCGG GCACATCTCG
ACGTTCGCCA GTTCGGCCTC GCTCTACGAG GTCGGCTTCA ACCACTTCTT CCGGGGCAAG
GACCACCCGG GTGGCGGCGA CCACATCTTC TACCAGGGGC ACGCCTCCCC GGGCATGTAC
GCGCGGGCGT TTCTTGAGGG GCGGCTCAGC GAACACCAGC TCGACGGGTT CCGCCAGGAG
CTGTCGCACC CCGGCGGCGG CCTGCCGTCC TACCCTCACC CCCGCCTGAT GCCGGACTTC
TGGGAGTTCC CCACCGTCTC GATGGGTCTC GGCGGTGTCA ACGCCATCTA CCAGGCGCGG
TTCAACCGTT ACCTGCACCA CCGCGGCATC AAGGACACCT CCGACCAGCA CGTGTGGGCG
TTCCTCGGCG ACGGCGAGAT GGACGAGCCG GAGTCGCTTG GCGCGATCGG AACGGCCGCC
CGGGAGGAAC TGGACAACCT CACCTTCGTC ATCAACTGCA ACCTGCAACG CCTGGACGGG
CCGGTCCGGG GCAACGGCAA GGTCATGCAG GAGTTGGAGG CATTCTTCCG AGGTGCCGGC
TGGAACGTCA TCAAGGTCGT CTGGGGCCGC GAGTGGGATC CGCTGCTCGC CCGGGACACC
GACGGTGCGC TGGTCAACCT CATGAACACC ACGCCCGACG GTGACTACCA GACCTACAAG
GCAGAATCCG GGGCGTACAT CCGGGAGCAC TTCTTCGGCC GCGATCCGCG GACCCGCAAG
ATGGTCGAGC ACCTCAGCGA CGACGAGATC TGGAACCTGA AGCGGGGTGG CCACGACTAC
CGCAAGCTCT ACGCGGCGTA CAAGGCCGCG ATGGAGCACA CCGGACAGCC CACGGTGATC
CTGGCCAAGA CCATCAAGGG TTGGACGCTC GGCTCGCACT TCGAGGGGCG CAACGCCACC
CACCAGATGA AGAAGCTGAC GTTGGAGGAC CTGAAGACCT TCCGCGACCG GCTCTACCTG
GATATCCCGG ACAAGGCACT GGAGGAGAAC CCCTACCTGC CGCCGTACTA CCGTCCGGAG
GCCAAGTCCG ACGAGCTCGA GTACCTACAC GAGCGTCGCC GGCAGCTCGG CGGCTACCTG
CCGTCCCGAC GGCCCGGCAC CAAGCGGCTC ACCATTCCCG GCCCGGAGCG CTTCGCCGAC
GTCAAGCGCG GTTCGGGCAA GCAGAAGGTG GCCACCACGA TGGCCTTCGT CCGCCTGCTC
AAGGACCTGA TGAAGGACCG GGAGTTCGGC CGACGCTGGG TGCCGATCGT CCCGGACGAG
GCCCGCACCT TCGGCATGGA CTCACTGTTC CCGACGCAGA AGATCTACTC GCCGCATGGC
CAGCGGTACA CGTCGGTCGA CCGGGAGCTG TTCCTGTCGT ACAAGGAGGC GACCGGCGGG
CAGATCCTGC ACGAGGGCAT CAACGAGGTC GGCTCGGTCG CCTCCTTCAC CGCGGCCGGT
TCCTCGTATG CCACGCACGA CGAGCCGATG ATCCCGATGT ACATCTTCTA CTCGATGTTC
GGGTTCCAGC GGACCGCGGA CGGGCTCTGG GCAGCGGCCG ACCAGATGAC CCGTGGCTTC
CTGCTCGGCG CGACCGCCGG ACGGACCACG CTGAACGGCG AGGGTCTCCA GCATGAGGAT
GGTCATTCGC TGTTGATCGC CGCCACCAAC CCGGCGGTGG TCGCCTACGA TCCGGCGTTC
GCCTACGAGA TCGCCCACAT CGTGGAGAAC GGCCTGCACC GCATGTACGG CGCGGCGCAG
GAGAACGTCT TCTACTACCT GACGGTCTAC AACGAGCCGA TGGTGCAGCC GGCGGAGCCG
ACGGACGTCG ACGTCGAGGG TGTGCTGAAG GGAATCTATC GGTACGCGCC GGCGCCCCAG
GTGGACGGTC CGAAGGCACA GCTACTCGCC TCCGGTACCG GCATGCAGTG GGCGCTCAAG
GCACAGGAGC TACTCGCCCA GGACTGGGGG GTTGCGGCCA GCGTCTGGTC AGTCACCTCC
TGGACGGAGC TACGCCGGGA CGCGGTCGAC GCGGAGGAGC ACAATCTGCT CAACCCGACG
GGTGAGCAGC GGGTGCCGTA CGTGACGACA AAGCTGGCCG ACGCCGATGG TCCGAAGGTC
GCGGTCAGTG ACTGGATGCG CGCGGTGCCG GATCTGATCG CCCGTTGGGT ACCCGGCGAC
TACACCTCGC TCGGCACCTG CGGGTTCGGC AAGTCCGACA CACGGCACGC ACTGCGCCGC
TACTTCCACG TGGACGCCGA GTCGATCGTG GTCGCCACGC TGCGGCAGCT CGCCCTCCGC
GGCGCGGTAC CGGCGGGAGT TCCCGCCGAG GCCGCCAAGA AGTACGCCAT TGACGACATC
GGGGCCGCCC CGGTCGGTGA GACCGGCGGC GACAGCTGA
 
Protein sequence
MATERKRPVI TAGLPSQLPD IDPEETGEWV ESLDGVIDDR GTKRARYVML RLLERARERQ 
VGVPSLTTTD YINTITPERE PWFPGDEHVE RRIRAYIRWN AAMLVHRAQR PEIGVGGHIS
TFASSASLYE VGFNHFFRGK DHPGGGDHIF YQGHASPGMY ARAFLEGRLS EHQLDGFRQE
LSHPGGGLPS YPHPRLMPDF WEFPTVSMGL GGVNAIYQAR FNRYLHHRGI KDTSDQHVWA
FLGDGEMDEP ESLGAIGTAA REELDNLTFV INCNLQRLDG PVRGNGKVMQ ELEAFFRGAG
WNVIKVVWGR EWDPLLARDT DGALVNLMNT TPDGDYQTYK AESGAYIREH FFGRDPRTRK
MVEHLSDDEI WNLKRGGHDY RKLYAAYKAA MEHTGQPTVI LAKTIKGWTL GSHFEGRNAT
HQMKKLTLED LKTFRDRLYL DIPDKALEEN PYLPPYYRPE AKSDELEYLH ERRRQLGGYL
PSRRPGTKRL TIPGPERFAD VKRGSGKQKV ATTMAFVRLL KDLMKDREFG RRWVPIVPDE
ARTFGMDSLF PTQKIYSPHG QRYTSVDREL FLSYKEATGG QILHEGINEV GSVASFTAAG
SSYATHDEPM IPMYIFYSMF GFQRTADGLW AAADQMTRGF LLGATAGRTT LNGEGLQHED
GHSLLIAATN PAVVAYDPAF AYEIAHIVEN GLHRMYGAAQ ENVFYYLTVY NEPMVQPAEP
TDVDVEGVLK GIYRYAPAPQ VDGPKAQLLA SGTGMQWALK AQELLAQDWG VAASVWSVTS
WTELRRDAVD AEEHNLLNPT GEQRVPYVTT KLADADGPKV AVSDWMRAVP DLIARWVPGD
YTSLGTCGFG KSDTRHALRR YFHVDAESIV VATLRQLALR GAVPAGVPAE AAKKYAIDDI
GAAPVGETGG DS