Gene Dret_2214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2214 
Symbol 
ID8420071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2516420 
End bp2517934 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content58% 
IMG OID645038814 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_003199076 
Protein GI258406334 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.188551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCA AAGCAGAGGA AATCAGCAAG GTCATTCAGG ACCAGATCCA GAATTATGAA 
GCTCGGATGG AAACCAGCGA AACCGGTACG GTCATCTATG TTGGTGACGG TATCGCCCGG
GTCCACGGCG TGCAAAACGC CATGGCCATG GAATTGCTGG AGTTCCCCGG CGGCGTAAAA
GGGATGGTCC TGAACCTGGA CGAAGACAAT GTCGGCGTGG CCTTGCTCGG TGAGGACTCC
CACATCAAAG AAGGGGACGA GGTCAAACGT ACGGGCGAGA TTTTTTCCGT GCCGGTTGGC
GAAGCGGTCA AGGGTCGCGT TATTTCACCC CTGGGTGAGC CGCTCGACGG CCTGGGGCCC
ATTGACGACA CCGAGAGCCG TCCGGTGGAA ATCAAAGCTC CGGGCATTGT TCAGCGTAAA
TCGGTCCATG AACCCATGTA CACTGGGCTG AAGGCCATTG ACGCCATGAC GCCCATCGGC
CGTGGTCAGC GCGAATTGAT CATCGGCGAC CGTCAGGTTG GAAAGACCGC GATCTGTCTG
GACTCGATTC TGGCTCAGAA AGATTCCGAC GTGCATTGCT TCTACGTGGC CATCGGGCAG
AAAAAATCCA CTGTGGCTTT GGTGGCCGAC GCCCTGCGCC AGCACGGCGC CATGGAATAC
ACGACCATCA TCTCGGCCAC CGCGTCTGAA GCGGCCTCGC TGCAATTTAT TGCCGCCTAC
GCCGGATGCA CCATGGGCGA ACATTACCGG GACAACGGCC AGCACGCGCT TATCTGCTAC
GATGACTTGT CCAAGCAGGC TGTGGCTTAT CGCCAGATGT CACTGCTGCT GCGGCGCCCC
CCCGGACGTG AAGCCTTCCC CGGCGACGTC TTCTACCTCC ACTCCCGTCT GCTCGAGCGC
GCGGCCAAGC TGAGTGATGA GGAAGGGGCC GGCTCCCTGA CCGCCCTGCC GATCATCGAA
ACCCAGGCAG GCGACGTGTC AGCCTACATC CCGACAAACG TTATTTCCAT TACCGACGGC
CAGGTCTATC TGGAACCGAA CTTGTTCAAT GCTGGGGTCC GTCCGGCGAT TAACGTCGGT
CTCTCGGTCT CCCGTGTCGG CGGTGCTGCT CAGATCAAGG CCATGAAACA GGTTGCTGGT
ACATTGCGCC TTGACCTGGC CCAATATCGC GAATTGGCCG CGTTCGCCCA GTTCGGTTCC
GACCTGGACA AGGCGACACA GCAAAAACTC GAGCGCGGTG CGCGCATGGT CGAATTGCTC
AAGCAGCCCC AATACAAGCC CATGAAGGCC GAGGAACAGG TGGCCGTGCT CTTTGCCGGC
ACCCGCGGCT ATATGGATGA TGTTCCTACT GATGCCGTGC GGAAGTTCGA GGACGAATAT
CTCGAGTTCA TGTACAATTC CAAGCAGGAC GTTTTGCAGG CCATCAAAGA GCAGGCAAAA
TTGGACGAGG CCCTGGAAGA AAAACTCAAG GCGGCTGTGG AGGAGTTCAA AAAGGGATTC
CGCGCTGAAG GCTAG
 
Protein sequence
MQIKAEEISK VIQDQIQNYE ARMETSETGT VIYVGDGIAR VHGVQNAMAM ELLEFPGGVK 
GMVLNLDEDN VGVALLGEDS HIKEGDEVKR TGEIFSVPVG EAVKGRVISP LGEPLDGLGP
IDDTESRPVE IKAPGIVQRK SVHEPMYTGL KAIDAMTPIG RGQRELIIGD RQVGKTAICL
DSILAQKDSD VHCFYVAIGQ KKSTVALVAD ALRQHGAMEY TTIISATASE AASLQFIAAY
AGCTMGEHYR DNGQHALICY DDLSKQAVAY RQMSLLLRRP PGREAFPGDV FYLHSRLLER
AAKLSDEEGA GSLTALPIIE TQAGDVSAYI PTNVISITDG QVYLEPNLFN AGVRPAINVG
LSVSRVGGAA QIKAMKQVAG TLRLDLAQYR ELAAFAQFGS DLDKATQQKL ERGARMVELL
KQPQYKPMKA EEQVAVLFAG TRGYMDDVPT DAVRKFEDEY LEFMYNSKQD VLQAIKEQAK
LDEALEEKLK AAVEEFKKGF RAEG