Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3205 |
Symbol | |
ID | 5735073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4051874 |
End bp | 4054972 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280351 |
Product | preprotein translocase, SecA subunit |
Protein accession | YP_001545970 |
Protein GI | 159899723 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) |
TIGRFAM ID | [TIGR00963] preprotein translocase, SecA subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAT GGCTTGGGAA ACTACTGGGC GACCCCAACG CTAAGGTTGT GAAGAAAATG CAGCCAACCT TAGATGAAAT TAACGCCCTT GAGCCGAAAA TGAAGGCTCT TAGCGATGAA CAACTCCGTG AAAAAACGGC TGAATTACGC ACACGGTTCG CAGAGCTAAC CAAAGCGGAT CGTGAAGCGC TCGACGACCG CTACGCCGAC GAAAATCGCC ATGATTCCAC CGTCGAAAAA GACTATCAAA AAGAATTACG GGTGATCGAA GATGCAGCCC TTGATGAATT GTTGCCTGAA GCCTTTGCTT TGGTGCGCGA AGCCTCAAGC CGCGTGATTG GTCAACGTCA CTATGATGTC CAGATGATCG GCGGGATTGT GCTGCACGAA GGCCGCATTG CCGAAATGAA GACTGGCGAA GGTAAGACCT TGGTGGCTTC CTTGCCCTTG TTTCTCAATG CAATTGCTGG CCGTGGCGCA CACTTGATTA CCGTCAACGA CTACCTCGCT AAAGTTGGTG GTGGCTGGAT GGGGCCAATC TTCCATAGCC TTGGCATGAG CACAGGCTAT ATCGCCCACG ATTATTCAGC AATTTACGAC CCCAATTATA TCGACCCCAA CGCCAAACAA GATGATAGCC GTTTGGTGCA CTGGCGGCCT TGCTCACGCC GCGAAGCCTA TATGGCCGAT ATGACCTACG GGACAAATAA TGAATATGGC TTCGATTATC TGCGCGATAA CATGGTGCAG CACAAAGATC AATGTGTGCA GCGCGAATTG CACTATGCGA TTGTTGACGA AGTAGATAAT ATTTTGATCG ACGAAGCCCG TACTCCCTTG ATTATCTCTG GCCCAGCCCA AGAATCCAGT GATAACTATC GCCGTTTTTC TTCGTTGGTA CGTGGCCTCA AGCGCTCCAG CATTTCGCCA GATGAAGTGC GCAAGGGCTT GAAAGACGAT TTTGATGGCG ATTATTGGAT CGACGAAAAA TCGCGCTCGA TTACCTTGAC CGAATCGGGC TTAGAGGTGA TGGAAAAGCG GCTCAATTTG CCTGATGGCG AAAATATGTA CGATGCCAAG AACTTCGAAT TAACTCACTA TTTGGAAAAC GCCCTCAAGG CTGAATATGT TTTCCATCGT GATGTTGATT ATGTGGTGCA AAATGGCGAA GTTGTGATCG TCGATGAATT TACAGGCCGA ACAATGCCGG GTCGGCGTTG GTCTGATGGT TTGCACCAAG CGGTTGAGGC CAAAGAGGCG GTCGAAGTGC GGCGCGAAAA CGTCACCTTG GCCACAATTA CCTTCCAAAA TTACTTCCGA ATGTACAACA AACTTGGTGG TATGACTGGT ACGGCAATCA CCGAAGCTGA AGAATTTAGC AAAATCTACA ATTTGGAAGT AGTGATTATC CCAACCAACC GCCAAGTCGT GCGCGAAGAT TATCGCGACC ATATTTATGC CTCGCAAAAA GCTAAATACA ACGCTGTGTT GCGTGAAATC AAAGAGATGC ACGAAGTTGG CCGCCCAGTC TTGGTCGGTA CAACCTCGGT CGAAAGCTCG GAAATTGTCA GCAATTTGCT GAAGCAAGAG GGGCTTGAAC ACTATCTCTT GAATGCTAAG CAACACGAAC GTGAAGCGTA TATCGTGGCC CAAGCTGGCC GTACTGGCGC AATCACGATT GCAACCAACA TGGCAGGTCG GGGAACTGAC ATTTTGCTTG GTGGAAACCC CGATGGTTTA ATTGAGGAAC ATCTCAAAGC CCTCGGCACA ACCATTACCG ATGCAACGCC TGAGCAACTA GCCCAAGCCC AAGCCCAAGC GAAAGCCGAT GTTGAGGCCG AACGTAAAGC TGTGATGGAA GCTGGTGGCT TACACATCAT TGGCACTGAG CGCCACGAAG CTCGTCGGAT CGATAATCAG CTGCGTGGGC GGGCTGGCCG TCAAGGCGAC CCCGGTTCAT CGCGCTTCTT CATTTCGCTT GAAGATGAGT TGATGACACG CTTCGGACGC ATCGATACGA TCAAGCGTTT GATGGAGCGT ATGTCTGATG GCGATGAGGA ATTGCCACTC GAATCGGGCT TACTCGATAA AGCGATCGAA AGTGCCCAAA CTCGGGTCGA AGGCTATAAC TTCGATGTGC GGAAGCATGT GGTCGAATAC GACGACGTGG TAAACAAGCA GCGCGAAGTG ATTTACGCCG ATCGCCATGC CATTCTTGGC GGCGAAGATA TGGGCGACCG CATTCTTGAG ATGGTCGTTG ATGAAATCGA CATCCATGTC GAAGAATTTC TCGATAATCG CGAGCTAGAC AAGCCTGATC TCGAAGGCTT CTTGCGTCAG TTATATTCGA TTGTGCCCCA ACTCAAGGCC CAAGAAACTG AATTGGCAGC CCGCTTCAAA GGCAAACAAG CTGATGAAAT TGGCGAAATT GCCACCGAAG TGGTGGAAGA AGCCTACAAT CGGCTTGGCG AAGAGTTGGC AACCCAATAC ACGACCTTGT TGCAACGCGG AGTCCAGCCA ATTCCTGGGG TCAGCGGCCC AGAAGCCTTC TTTGCCCACT TCGAGCGCCA AGAAATGCTG GGGGCAATCG ACCGTGAATG GATCGATTAT CTGACGGCAG TTGATGAATT GCGCCAAGGC ATCGGCAACG TCGCAATTGC CCAACAAGAT CCATTGGTAG CCTTCAAGCG CGAAGCCTTT AAGATGTTCG ACGAACTCAA AGGCAACATC CAAAACCGGA TTGTCTACAA CTTCTTTACT GATGCGGCCA ACTGGCAAGT GCGTTTGCGC CAAGTTGAGC TTGAAATGGA AGCCCGTTTG GCTCTGGCTC AAACTGCTGG CGGCTCAGAA AACGCCACCG AAGACGCACC CAAGCCAGCC AAACGTGGCG TTGGTGGAGC GGCACGCCGG GTCAGCAATG CCGCAGGCCA AGCTGCACCA GCCCGTCGAA TCGTGATCAA AATCGGACGC AATGATCCTT GCCCATGCGA TAGCGGCAAG AAGTTCAAAG CATGCCACGG CTTGCCGGGC AAAGAAGCCG AACTTGAGGC GATTTTGGCG GTCAAGCATA CCCACGCGCA AGCGGTTGGT AAAAAATAA
|
Protein sequence | MFKWLGKLLG DPNAKVVKKM QPTLDEINAL EPKMKALSDE QLREKTAELR TRFAELTKAD REALDDRYAD ENRHDSTVEK DYQKELRVIE DAALDELLPE AFALVREASS RVIGQRHYDV QMIGGIVLHE GRIAEMKTGE GKTLVASLPL FLNAIAGRGA HLITVNDYLA KVGGGWMGPI FHSLGMSTGY IAHDYSAIYD PNYIDPNAKQ DDSRLVHWRP CSRREAYMAD MTYGTNNEYG FDYLRDNMVQ HKDQCVQREL HYAIVDEVDN ILIDEARTPL IISGPAQESS DNYRRFSSLV RGLKRSSISP DEVRKGLKDD FDGDYWIDEK SRSITLTESG LEVMEKRLNL PDGENMYDAK NFELTHYLEN ALKAEYVFHR DVDYVVQNGE VVIVDEFTGR TMPGRRWSDG LHQAVEAKEA VEVRRENVTL ATITFQNYFR MYNKLGGMTG TAITEAEEFS KIYNLEVVII PTNRQVVRED YRDHIYASQK AKYNAVLREI KEMHEVGRPV LVGTTSVESS EIVSNLLKQE GLEHYLLNAK QHEREAYIVA QAGRTGAITI ATNMAGRGTD ILLGGNPDGL IEEHLKALGT TITDATPEQL AQAQAQAKAD VEAERKAVME AGGLHIIGTE RHEARRIDNQ LRGRAGRQGD PGSSRFFISL EDELMTRFGR IDTIKRLMER MSDGDEELPL ESGLLDKAIE SAQTRVEGYN FDVRKHVVEY DDVVNKQREV IYADRHAILG GEDMGDRILE MVVDEIDIHV EEFLDNRELD KPDLEGFLRQ LYSIVPQLKA QETELAARFK GKQADEIGEI ATEVVEEAYN RLGEELATQY TTLLQRGVQP IPGVSGPEAF FAHFERQEML GAIDREWIDY LTAVDELRQG IGNVAIAQQD PLVAFKREAF KMFDELKGNI QNRIVYNFFT DAANWQVRLR QVELEMEARL ALAQTAGGSE NATEDAPKPA KRGVGGAARR VSNAAGQAAP ARRIVIKIGR NDPCPCDSGK KFKACHGLPG KEAELEAILA VKHTHAQAVG KK
|
| |