Gene Haur_3205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3205 
Symbol 
ID5735073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4051874 
End bp4054972 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content51% 
IMG OID641280351 
Productpreprotein translocase, SecA subunit 
Protein accessionYP_001545970 
Protein GI159899723 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) 
TIGRFAM ID[TIGR00963] preprotein translocase, SecA subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAT GGCTTGGGAA ACTACTGGGC GACCCCAACG CTAAGGTTGT GAAGAAAATG 
CAGCCAACCT TAGATGAAAT TAACGCCCTT GAGCCGAAAA TGAAGGCTCT TAGCGATGAA
CAACTCCGTG AAAAAACGGC TGAATTACGC ACACGGTTCG CAGAGCTAAC CAAAGCGGAT
CGTGAAGCGC TCGACGACCG CTACGCCGAC GAAAATCGCC ATGATTCCAC CGTCGAAAAA
GACTATCAAA AAGAATTACG GGTGATCGAA GATGCAGCCC TTGATGAATT GTTGCCTGAA
GCCTTTGCTT TGGTGCGCGA AGCCTCAAGC CGCGTGATTG GTCAACGTCA CTATGATGTC
CAGATGATCG GCGGGATTGT GCTGCACGAA GGCCGCATTG CCGAAATGAA GACTGGCGAA
GGTAAGACCT TGGTGGCTTC CTTGCCCTTG TTTCTCAATG CAATTGCTGG CCGTGGCGCA
CACTTGATTA CCGTCAACGA CTACCTCGCT AAAGTTGGTG GTGGCTGGAT GGGGCCAATC
TTCCATAGCC TTGGCATGAG CACAGGCTAT ATCGCCCACG ATTATTCAGC AATTTACGAC
CCCAATTATA TCGACCCCAA CGCCAAACAA GATGATAGCC GTTTGGTGCA CTGGCGGCCT
TGCTCACGCC GCGAAGCCTA TATGGCCGAT ATGACCTACG GGACAAATAA TGAATATGGC
TTCGATTATC TGCGCGATAA CATGGTGCAG CACAAAGATC AATGTGTGCA GCGCGAATTG
CACTATGCGA TTGTTGACGA AGTAGATAAT ATTTTGATCG ACGAAGCCCG TACTCCCTTG
ATTATCTCTG GCCCAGCCCA AGAATCCAGT GATAACTATC GCCGTTTTTC TTCGTTGGTA
CGTGGCCTCA AGCGCTCCAG CATTTCGCCA GATGAAGTGC GCAAGGGCTT GAAAGACGAT
TTTGATGGCG ATTATTGGAT CGACGAAAAA TCGCGCTCGA TTACCTTGAC CGAATCGGGC
TTAGAGGTGA TGGAAAAGCG GCTCAATTTG CCTGATGGCG AAAATATGTA CGATGCCAAG
AACTTCGAAT TAACTCACTA TTTGGAAAAC GCCCTCAAGG CTGAATATGT TTTCCATCGT
GATGTTGATT ATGTGGTGCA AAATGGCGAA GTTGTGATCG TCGATGAATT TACAGGCCGA
ACAATGCCGG GTCGGCGTTG GTCTGATGGT TTGCACCAAG CGGTTGAGGC CAAAGAGGCG
GTCGAAGTGC GGCGCGAAAA CGTCACCTTG GCCACAATTA CCTTCCAAAA TTACTTCCGA
ATGTACAACA AACTTGGTGG TATGACTGGT ACGGCAATCA CCGAAGCTGA AGAATTTAGC
AAAATCTACA ATTTGGAAGT AGTGATTATC CCAACCAACC GCCAAGTCGT GCGCGAAGAT
TATCGCGACC ATATTTATGC CTCGCAAAAA GCTAAATACA ACGCTGTGTT GCGTGAAATC
AAAGAGATGC ACGAAGTTGG CCGCCCAGTC TTGGTCGGTA CAACCTCGGT CGAAAGCTCG
GAAATTGTCA GCAATTTGCT GAAGCAAGAG GGGCTTGAAC ACTATCTCTT GAATGCTAAG
CAACACGAAC GTGAAGCGTA TATCGTGGCC CAAGCTGGCC GTACTGGCGC AATCACGATT
GCAACCAACA TGGCAGGTCG GGGAACTGAC ATTTTGCTTG GTGGAAACCC CGATGGTTTA
ATTGAGGAAC ATCTCAAAGC CCTCGGCACA ACCATTACCG ATGCAACGCC TGAGCAACTA
GCCCAAGCCC AAGCCCAAGC GAAAGCCGAT GTTGAGGCCG AACGTAAAGC TGTGATGGAA
GCTGGTGGCT TACACATCAT TGGCACTGAG CGCCACGAAG CTCGTCGGAT CGATAATCAG
CTGCGTGGGC GGGCTGGCCG TCAAGGCGAC CCCGGTTCAT CGCGCTTCTT CATTTCGCTT
GAAGATGAGT TGATGACACG CTTCGGACGC ATCGATACGA TCAAGCGTTT GATGGAGCGT
ATGTCTGATG GCGATGAGGA ATTGCCACTC GAATCGGGCT TACTCGATAA AGCGATCGAA
AGTGCCCAAA CTCGGGTCGA AGGCTATAAC TTCGATGTGC GGAAGCATGT GGTCGAATAC
GACGACGTGG TAAACAAGCA GCGCGAAGTG ATTTACGCCG ATCGCCATGC CATTCTTGGC
GGCGAAGATA TGGGCGACCG CATTCTTGAG ATGGTCGTTG ATGAAATCGA CATCCATGTC
GAAGAATTTC TCGATAATCG CGAGCTAGAC AAGCCTGATC TCGAAGGCTT CTTGCGTCAG
TTATATTCGA TTGTGCCCCA ACTCAAGGCC CAAGAAACTG AATTGGCAGC CCGCTTCAAA
GGCAAACAAG CTGATGAAAT TGGCGAAATT GCCACCGAAG TGGTGGAAGA AGCCTACAAT
CGGCTTGGCG AAGAGTTGGC AACCCAATAC ACGACCTTGT TGCAACGCGG AGTCCAGCCA
ATTCCTGGGG TCAGCGGCCC AGAAGCCTTC TTTGCCCACT TCGAGCGCCA AGAAATGCTG
GGGGCAATCG ACCGTGAATG GATCGATTAT CTGACGGCAG TTGATGAATT GCGCCAAGGC
ATCGGCAACG TCGCAATTGC CCAACAAGAT CCATTGGTAG CCTTCAAGCG CGAAGCCTTT
AAGATGTTCG ACGAACTCAA AGGCAACATC CAAAACCGGA TTGTCTACAA CTTCTTTACT
GATGCGGCCA ACTGGCAAGT GCGTTTGCGC CAAGTTGAGC TTGAAATGGA AGCCCGTTTG
GCTCTGGCTC AAACTGCTGG CGGCTCAGAA AACGCCACCG AAGACGCACC CAAGCCAGCC
AAACGTGGCG TTGGTGGAGC GGCACGCCGG GTCAGCAATG CCGCAGGCCA AGCTGCACCA
GCCCGTCGAA TCGTGATCAA AATCGGACGC AATGATCCTT GCCCATGCGA TAGCGGCAAG
AAGTTCAAAG CATGCCACGG CTTGCCGGGC AAAGAAGCCG AACTTGAGGC GATTTTGGCG
GTCAAGCATA CCCACGCGCA AGCGGTTGGT AAAAAATAA
 
Protein sequence
MFKWLGKLLG DPNAKVVKKM QPTLDEINAL EPKMKALSDE QLREKTAELR TRFAELTKAD 
REALDDRYAD ENRHDSTVEK DYQKELRVIE DAALDELLPE AFALVREASS RVIGQRHYDV
QMIGGIVLHE GRIAEMKTGE GKTLVASLPL FLNAIAGRGA HLITVNDYLA KVGGGWMGPI
FHSLGMSTGY IAHDYSAIYD PNYIDPNAKQ DDSRLVHWRP CSRREAYMAD MTYGTNNEYG
FDYLRDNMVQ HKDQCVQREL HYAIVDEVDN ILIDEARTPL IISGPAQESS DNYRRFSSLV
RGLKRSSISP DEVRKGLKDD FDGDYWIDEK SRSITLTESG LEVMEKRLNL PDGENMYDAK
NFELTHYLEN ALKAEYVFHR DVDYVVQNGE VVIVDEFTGR TMPGRRWSDG LHQAVEAKEA
VEVRRENVTL ATITFQNYFR MYNKLGGMTG TAITEAEEFS KIYNLEVVII PTNRQVVRED
YRDHIYASQK AKYNAVLREI KEMHEVGRPV LVGTTSVESS EIVSNLLKQE GLEHYLLNAK
QHEREAYIVA QAGRTGAITI ATNMAGRGTD ILLGGNPDGL IEEHLKALGT TITDATPEQL
AQAQAQAKAD VEAERKAVME AGGLHIIGTE RHEARRIDNQ LRGRAGRQGD PGSSRFFISL
EDELMTRFGR IDTIKRLMER MSDGDEELPL ESGLLDKAIE SAQTRVEGYN FDVRKHVVEY
DDVVNKQREV IYADRHAILG GEDMGDRILE MVVDEIDIHV EEFLDNRELD KPDLEGFLRQ
LYSIVPQLKA QETELAARFK GKQADEIGEI ATEVVEEAYN RLGEELATQY TTLLQRGVQP
IPGVSGPEAF FAHFERQEML GAIDREWIDY LTAVDELRQG IGNVAIAQQD PLVAFKREAF
KMFDELKGNI QNRIVYNFFT DAANWQVRLR QVELEMEARL ALAQTAGGSE NATEDAPKPA
KRGVGGAARR VSNAAGQAAP ARRIVIKIGR NDPCPCDSGK KFKACHGLPG KEAELEAILA
VKHTHAQAVG KK