Gene Hhal_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2027 
Symbol 
ID4710376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2230020 
End bp2232821 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content68% 
IMG OID639856500 
Productpreprotein translocase, SecA subunit 
Protein accessionYP_001003593 
Protein GI121998806 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) 
TIGRFAM ID[TIGR00963] preprotein translocase, SecA subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.942365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCAG CCATCGCCAA ACGCGTCTTC GGAACCCGTA ACGATCGTGC GCTGAAGCGT 
CTGCGCAAGC GGATCGAGGC CATCAACGCG CACGAGCCGG AGCTGCAGAA GCTCTCCGAC
GAGCAGCTCC AGGCCAAGAC CGATGCGTTC AAGGCGCGGC TCGCGCAGGG CGAAACCCTC
GACGACCTGC TCGAGGAGGC GTTCGCGGTG GTGCGCGAGG CTTCACGGCG GGTGTTGGGA
CTGCGCCACT TCGATGTCCA GCTGCTCGGC GCGATGGTCC TGCACGACGG CAACATCTCC
GAGATGAAGA CCGGTGAGGG GAAGACTCTG GTGGCCACCC TGGCGGTCTA CCTCAATGCC
CTGACCGGTC GCGGTGTCCA TGTGGTGACC GTCAACGACT ACCTGGCCCG GCGCGACGCC
GAGTGGATGG GCCGGCTGTA CCGCTTCCTG GGCATGGAGG TGGGTGTGGT GGTGCCGCGC
CAGCCCCGCG AGGAGAAGGT CGCCGCCTAC CAGGCCGACA TCACCTACGG CACCAACAAC
GAGTTCGGCT TCGACTATCT GCGCGACAAC ATGGCCTTCC GCAAGGAGGA CAAGGTCCAG
CGGGACCTGT ACTACGCGCT GGTCGACGAG GTCGACTCCA TCCTCATCGA CGAGGCCCGC
ACCCCGCTGA TCATCTCGGG CCCGGCGGAG CAGGCCGGCG AGCTCTACGA GGCCATGTCC
CGCCTGGTGC CCCGCCTGCA GGCGCAGAAG CCCGAGGAGC GCCCCGAGGA GAACCCGGAA
CTGGGGCCGG GGGATTACTA CGTCGATGAG AAGGCTCGCC AGGTCTACCT CACCGAGGGC
GGGCACGACC GGGCCGAGGA GCTCCTGCGC GAAGAGGGGC TGATCGGGGA GAACGACTCC
CTCTACGACG CTCGCAACAT CAACGTGGTC CACCACCTGA ACGCGGCGCT GCGGGCCCAC
ACCCTCTACG AGCGCGATGT CCACTACCTG ATCCGCGACA ACCAGGTGGT CATCGTCGAC
GAGTTCACCG GCCGCGCCAT GCCCGGCCGG CGCTGGTCCG AGGGGTTGCA CCAGGCGGTG
GAGGCCAAGG AGGGGCTGCC CATCCAGGCC GAGAACCAGA CCCTGGCCTC GATCACCTTC
CAGAACTACT TCCGCCTCTA TGACAAGCTC GCCGGCATGA CCGGTACCGC GGATACTGAG
GCCTTCGAGT TCCAGCACAT CTACGGGCTG GAGGTGCTCT CCATCCCCAC CCACCGGCCG
ATGGTCCGCG ACGACGCCCA CGATCTGGTC TATCGGACGG CGGACGAGAA GTACGAGGCG
ATCATCGCCG ACATCCGCGA CTGCGTGCAG CGCGACCAGC CGGTGCTGGT GGGCACCACC
TCCATCGAGG CCTCGGAGCG CCTCTCCAAG GCACTGAAGG ACGCCGGCGT GGAGCACAAC
GTGCTCAACG CCAAGCACAA CGAGAGTGAG GCGCAGATCA TCGCCGATGC CGGCCGTCCG
GGTACCGTCA CCATCGCCAC CAACATGGCA GGCCGCGGGA CGGACATCGT TCTCGGCGGC
AACCTGGACC AGGAGCTGGC CGAGCTCGGC GAGGACCCGG ATCCCGCCGA GGTCGAGCGG
CGCAAGGCCG AGTGGCAGGA CCGCCACGAC CGGGTGGTCA ACGCCGGCGG TCTGCACGTT
ATCGGCACCG AGCGCCACGA GTCGCGGCGC ATCGACAACC AGCTGCGCGG GCGTTCCGGC
CGCCAGGGCG ACCCGGGCTC GTCGCGTTTC TACCTGTCGC TGGAGGACTC GCTGCTGCGC
ATCTTCGCCT CCGAGCGCAT GTCCGGGATG CTCGAGAAGC TCGGCATGCA GCACGGTGAG
GCCATCGAGT CGGGCATGGT TTCGCGGGTG ATCGAGAACG CCCAGCGCAA GGTCGAGGCG
CACAACTTCG ACATGCGCAA GCACCTGCTG GAGTTCGACG ACGTCGCCAA TGACCAGCGC
AAGGTCGTCT ACGAGCAGCG CAACGAGCTC CTCGAAGCCG ACGATGTCGC CGAGACCGTC
GATGCCATCC GCCAGGACGT GGTGGAGAAG GTCATCTCCG AGCACATCCC GCCCGGGTCC
ATCGACGAGC AGTGGGACGT CCCCGGCCTG GAGCGCACCC TTAAGGAAGA GTTCGGCCAG
GAGCTTCCCA TCCAGCGCTG GCTCGACGAT GAGGACGATC TCCACGAGGA GACCCTGCGC
GAGCGCATCC AGGGCGAGAT CGAAAAGGCC TACCGGGCCA AGGAGGCCGA GGCCGGTGCC
AGTGTGGTGC GTCACTTCGA GAAGGCGGTG ATGCTCCAGG TCCTGGACAA GCACTGGAAG
GAGCACCTGG CCGCCATGGA TTACCTGCGA CAGGGGGTGG GGCTGCGCGG CTACGCCCAG
CGCAACCCGA AGCAGGAGTT CAAGAAGGAC GCCTTCGCCA TGTTCCAGGA GATGCTCGAG
GGCCTGAAGC GGGACGCCGT CGGCGTGCTG CTGCGCGTCC AGGTTCGTGC CGAGGAGGAC
GTGGAGGCCG TCGAGGAGCA GCGTCGCCAG GAGGCGGAGC GCATGCAGAT GCGCCACGCC
GCACCGGCCT CCGCCGCGGC GGGGGCTGTC GCAGCGGGTA GTGGTGCCGC CGGGGCAGCC
GCAGCCGAGG GGGACAGCGC GCCGACCGGC GGGGCCCAGC AGCAGTCCGC CGGTGGCCGA
GGGCAGGAGA CGGTGGCCCG GGAAGGCCCG AAGGTCGGGC GCAACGAGTC GTGCCCCTGC
GGCTCCGGCA AGAAGTACAA GCACTGCTGC GGGAAGCTCT AA
 
Protein sequence
MFSAIAKRVF GTRNDRALKR LRKRIEAINA HEPELQKLSD EQLQAKTDAF KARLAQGETL 
DDLLEEAFAV VREASRRVLG LRHFDVQLLG AMVLHDGNIS EMKTGEGKTL VATLAVYLNA
LTGRGVHVVT VNDYLARRDA EWMGRLYRFL GMEVGVVVPR QPREEKVAAY QADITYGTNN
EFGFDYLRDN MAFRKEDKVQ RDLYYALVDE VDSILIDEAR TPLIISGPAE QAGELYEAMS
RLVPRLQAQK PEERPEENPE LGPGDYYVDE KARQVYLTEG GHDRAEELLR EEGLIGENDS
LYDARNINVV HHLNAALRAH TLYERDVHYL IRDNQVVIVD EFTGRAMPGR RWSEGLHQAV
EAKEGLPIQA ENQTLASITF QNYFRLYDKL AGMTGTADTE AFEFQHIYGL EVLSIPTHRP
MVRDDAHDLV YRTADEKYEA IIADIRDCVQ RDQPVLVGTT SIEASERLSK ALKDAGVEHN
VLNAKHNESE AQIIADAGRP GTVTIATNMA GRGTDIVLGG NLDQELAELG EDPDPAEVER
RKAEWQDRHD RVVNAGGLHV IGTERHESRR IDNQLRGRSG RQGDPGSSRF YLSLEDSLLR
IFASERMSGM LEKLGMQHGE AIESGMVSRV IENAQRKVEA HNFDMRKHLL EFDDVANDQR
KVVYEQRNEL LEADDVAETV DAIRQDVVEK VISEHIPPGS IDEQWDVPGL ERTLKEEFGQ
ELPIQRWLDD EDDLHEETLR ERIQGEIEKA YRAKEAEAGA SVVRHFEKAV MLQVLDKHWK
EHLAAMDYLR QGVGLRGYAQ RNPKQEFKKD AFAMFQEMLE GLKRDAVGVL LRVQVRAEED
VEAVEEQRRQ EAERMQMRHA APASAAAGAV AAGSGAAGAA AAEGDSAPTG GAQQQSAGGR
GQETVAREGP KVGRNESCPC GSGKKYKHCC GKL