Gene Emin_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1519 
Symbol 
ID6263600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1609945 
End bp1611483 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content45% 
IMG OID642612006 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_001876403 
Protein GI187251921 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.283334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000106657 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCTTAA AAGCAGAGGA AATTACAAGC ATTATAAAGA GTAAGATAGC AAACTTTACT 
CCGCAGGCTG ATATTAACGA AACTGGCACC GTTTTACAAG TTGGCGACGG TATTGCCCGC
ATTTATGGTT TAAAAAACGC CGTGGCGGGT GAGCTTTTGG AATTCCCCAA CAACGTAAAA
GGCCTTGCCC TTAACTTAGA AACGGACAAT ATCGGCTGCG TGCTTATGGG GGAGGATTCC
TCCATACAAG AAGGTGACCC TGTTAAAAGA ACCGGCCAAG TTATCAACGT TCCCGTTGGG
GACGCGCTTT TAGGCCGCGT GGTTGACCCT TTAGGCAAGC CTTTAGACGG CAAAGGCCCT
ATTAAAACAA ACTCTTCAAG ACCTTTGGAA ATTGTAGCTC CCGGCGTTAT TGAACGCCAG
CCCGTTAAAC AACCTCTGCA AACAGGGTTA AAAGCTATTG ACTCACTTGT TCCTATAGGC
AAAGGACAGC GTGAACTTAT TATCGGCGAC AGGCAGACAG GTAAAACTGC CATCGCCATT
GACGCTATTT TAAATCAAAA AAACCAGCCC GCAGACCAAA GAACGCTTTG CGTTTACGTA
GCCATCGGGC AAAAACAAAG CACGGTAGCC CAGGTTGTGC AAACCTTAAC GGAATTCGGC
GCGATGGAAT ATACTGTAAT CGTATCTGCC AGCGCGGCTG ACCCGGCTTC CCTTTTATAT
ATAGCTCCTT ACGCGGGCTC GTCAATAGCT GAGGAGTTTA TGTGGAATAA ACGCGACGTT
CTTATTATTT ATGACGATTT ATCAAAACAC GCCCAGGCTT ATAGACAAAT GTCGCTCCTT
TTACGCAGAC CTCCGGGCCG CGAAGCTTAT CCCGGCGACG TTTTTTACTT GCATTCAAGA
TTGTTAGAAC GCGCGTGCAA ACTTTCTGAC AAAAACGGCG GCGGCTCTAT TACGGCGCTG
CCTATTATTG AAACACAGGC TAACGACATG TCTGCCTATA TTCCAACAAA CGTAATTTCA
ATTACTGACG GGCAAATTTA CTTAGAAAGC GGTCTTTTCC ACAGCGGTAT GAAACCGGCG
GTTAACGTAG GTCTTTCCGT ATCGCGCGTG GGCGGTTCGG CGCAGAAAAA GATTATGAGA
AGCGTTTCCG GCACACTGCG TTTGGATATG TCCCAATATA AAGAATTGGA AGCTTTTTCC
CAATTCGGCA GCGATTTGGA CAAAGAATCA CAGCAACAGC TTACAAGAGG CAAAAGAATA
AACGAACTTT TTAAACAAGA CCAATATACT CCTATGCCGG TTGAGGAGCA GGTTTTGGTA
TTCTTTGCCG GCACAAACGG ATTTTTAGAC AATATTGAAG TAAATTTGGT TAAAGAGTAT
GAAAAACAGC TTCTTACTTA CTTTAAAGCG GAAAAGAAAG ATTTGTTTGA AGAACTTAAG
AACGCTCCCG AAATGAGTGA AAACCTTACA AATAAATTAA AAGAGGCTTT AACAGCATTC
GGTGAAGTTT TTAAAAACTC GCACAGTACG GCGCAGTAG
 
Protein sequence
MSLKAEEITS IIKSKIANFT PQADINETGT VLQVGDGIAR IYGLKNAVAG ELLEFPNNVK 
GLALNLETDN IGCVLMGEDS SIQEGDPVKR TGQVINVPVG DALLGRVVDP LGKPLDGKGP
IKTNSSRPLE IVAPGVIERQ PVKQPLQTGL KAIDSLVPIG KGQRELIIGD RQTGKTAIAI
DAILNQKNQP ADQRTLCVYV AIGQKQSTVA QVVQTLTEFG AMEYTVIVSA SAADPASLLY
IAPYAGSSIA EEFMWNKRDV LIIYDDLSKH AQAYRQMSLL LRRPPGREAY PGDVFYLHSR
LLERACKLSD KNGGGSITAL PIIETQANDM SAYIPTNVIS ITDGQIYLES GLFHSGMKPA
VNVGLSVSRV GGSAQKKIMR SVSGTLRLDM SQYKELEAFS QFGSDLDKES QQQLTRGKRI
NELFKQDQYT PMPVEEQVLV FFAGTNGFLD NIEVNLVKEY EKQLLTYFKA EKKDLFEELK
NAPEMSENLT NKLKEALTAF GEVFKNSHST AQ