Gene Dshi_2936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2936 
SymbolatpA1 
ID5710787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3092718 
End bp3094256 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content65% 
IMG OID641268862 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001534270 
Protein GI159045476 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.147396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0629902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCC AAGCTGCCGA AATCTCTGCG ATCCTCAAGG AGCAGATCAA GAACTTTGGC 
CAGGAAGCCG AAGTCGCCGA GGTGGGCCGC GTGCTCAGCG TCGGCGACGG GATTGCACGG
GTCCACGGGC TGGACAACGT GCAGGCCGGT GAAATGGTCG AGTTCCCGGG CGGCATCCGC
GGGATGGCCC TGAACCTCGA AATCGACAAT GTGGGTGTCG TGATCTTCGG CTCGGACCGG
GACATCAAGG AAGGCGACAT CGTCAAGCGC ACCAAGTCCA TCGTGGACGT GCCCGTGGGC
GACGCGCTGC TGGGCCGGGT CGTGGATGGC CTGGGCAACC CGCTGGACGG CAAGGGCCCG
ATCGAGACCA CCGAGCGCAG CATCGCGGAC GTGAAGGCGC CGGGCATCAT CCCGCGCAAA
TCCGTGCATG AGCCGATGGC GACCGGCCTG AAATCTGTCG ACGCCATGAT CCCGATCGGG
CGCGGCCAGC GCGAGCTGAT CATCGGCGAC CGCCAGACCG GCAAGACCGC CGTGGCGCTC
GACACGATCC TGAACCAGAA GGCCTATAAC GACGCCGCCG GCGACGACGA GAGCAAGAAG
CTCTACTGCG TCTACGTGGC CGTGGGGCAG AAGCGCTCCA CCGTGGCGCA GCTGGTCAAG
AAGCTCGAAG AAACTGGTGC CATCGAATAC TCCATCGTCG TGGCCGCCAC CGCCTCCGAC
CCGGCGCCGA TGCAGTTCCT CGCACCCTAT GCCGCGACCT CCATGGCGGA ATTCTTCCGC
GACAATGGCC GCCATGCGCT GATCATCTAT GATGACCTCT CGAAGCAGGC CGTGTCTTAC
CGTCAGATGT CGCTGCTGCT GCGTCGCCCG CCGGGCCGCG AAGCCTATCC GGGCGACGTG
TTCTACCTGC ACTCCCGCCT GCTGGAGCGG TCGGCGAAGC TGGGCGACGA TCATGGCAAC
GGGTCGCTGA CCGCGCTGCC GATCATCGAA ACGCAAGGCG GCGACGTGTC GGCCTTTATC
CCGACCAACG TGATCTCGAT CACCGACGGC CAGATCTTCC TGGAAACCGA GCTGTTCTAC
CAGGGCATCC GCCCCGCCGT GAACACCGGT CTGTCGGTGT CGCGCGTGGG CTCCTCGGCC
CAGACCAACG CGATGAAATC CGTCGCTGGC CCGGTGAAGC TGGAACTGGC GCAGTACCGC
GAAATGGCGG CCTTCGCGCA GTTCGGCTCC GACCTCGACG CCGCCACACA GCAGCTGCTG
AACCGTGGTG CGCGCCTGAC CGAGCTGATG AAGCAGCCGC AATACTCGCC GCTGACCAAT
GCCGAGATCG TCTGCGTGAT CTTCGCCGGC ACCAAGGGCT ACCTCGACAA GATCCCCGTG
GGGGACGTGG GCCGTTACGA GAAGGGCCTG CTGGCGCACC TGCGCGGCAA GCACAAGGGC
CTGCTGGACT ACATCACCAA GGAAGATCCC AAGATCAAGG GTGAGGCCGA AGACAAGATC
CGCGCAGCGC TCGACGAATT CGCCGCGACC TTCGCGTAA
 
Protein sequence
MGIQAAEISA ILKEQIKNFG QEAEVAEVGR VLSVGDGIAR VHGLDNVQAG EMVEFPGGIR 
GMALNLEIDN VGVVIFGSDR DIKEGDIVKR TKSIVDVPVG DALLGRVVDG LGNPLDGKGP
IETTERSIAD VKAPGIIPRK SVHEPMATGL KSVDAMIPIG RGQRELIIGD RQTGKTAVAL
DTILNQKAYN DAAGDDESKK LYCVYVAVGQ KRSTVAQLVK KLEETGAIEY SIVVAATASD
PAPMQFLAPY AATSMAEFFR DNGRHALIIY DDLSKQAVSY RQMSLLLRRP PGREAYPGDV
FYLHSRLLER SAKLGDDHGN GSLTALPIIE TQGGDVSAFI PTNVISITDG QIFLETELFY
QGIRPAVNTG LSVSRVGSSA QTNAMKSVAG PVKLELAQYR EMAAFAQFGS DLDAATQQLL
NRGARLTELM KQPQYSPLTN AEIVCVIFAG TKGYLDKIPV GDVGRYEKGL LAHLRGKHKG
LLDYITKEDP KIKGEAEDKI RAALDEFAAT FA