Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2855 |
Symbol | |
ID | 4070374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3395745 |
End bp | 3398726 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984873 |
Product | protein translocase subunit secA |
Protein accession | YP_591930 |
Protein GI | 94969882 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) |
TIGRFAM ID | [TIGR00963] preprotein translocase, SecA subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTAACA AGGCTATTGC CAAGATTTTC GGCACGCAGA ACGAACGCGA GATCAAGCGC TTGATGCCCA TCGTGGCCCA GATCAACGCG CTTGAGCCGC AGGTGAAGCA GTTCTCCGAC GACCAGCTCC GTGCCAAGAC CGACGAATTC CGCGCCAAGA TCCAGGAGCG CCTCGCCAAG TACGAGGAAG CCGAGCACAA AAACCACGCC CTAAAGGAAG TCCTCGACGA GATCCTTCCT GAGGCCTTCG CCATTTGCCG CGAAGCAGGC TGGCGCGTCC TCAACATGCG CCACTTCGAC GTCCAGTTGA TCGGCGGCAT GGTGCTGCAC TCCGGGCGCA TCTCTGAAAT GAAAACCGGT GAAGGTAAGA CGCTCGTCGC CACCCTTCCC GTGTATCTCA ACGCCCTCTC CGGCCGCGGC GTGCACGTCG TCACCGTGAA TGACTACCTC GCCAAGCGCG ACTCGGAGTG GATGGGAAAG CTCTACAACT TCCTCGGCCT GTCGGTTGGG GTCATCGTGC ACGATCTCGA CGACGACCAG CGCCGCGAAG CCTACCGGGC CGACGTCACC TACGGCACCA ACAACGAGTT CGGCTTCGAC TACCTCCGCG ACAACATGAA GTTCGAGCTC AGCGACTGTG TGCAGCGTGA GTTCAACTTC GCCATCGTCG ACGAAGTCGA CTCCATCTTG ATCGACGAAG CGCGTACCCC GCTCATCATC AGCGGCGCCA GCGAAGAGTC CACCGACAAG TATCAGCGCG TCAACGTCAT CATCCCGCGC CTCGAAAAGG GCGAGGAGAT CGAAGGCAGG GAACCCGGCG ACAAGATTCT CACCGGCGAC TACGTTGTGG ACGAGAAGCA CAAGACCATC ACCGTAAGTG ACGATGGCTG GGAGAAAGTC GAAAAGCTGC TCGGCATCGG CAACATCGCC GACCCCGAAA ACTGGGACCT GAAGCACCAC GTGGAAGTCG CCATCAAAGC CCACGCCCTC TATCACGTCG ACGTGGAGTA CGTGGTGAAG GATGGCGAAG TCCTCATCGT GGATGAGTTC ACCGGACGCC TGATGCCCGG CCGCCGTTGG TCCGATGGCC TGCACCAGGC CGTGGAAGCC AAAGAAGGCG TGAAGGTCGA GCGCGAGAAC CAGACCCTCG CCACCATTAC CTTCCAGAAT TACTTCCGCC TCTACAAGAA GCTCGCCGGC ATGACCGGTA CGGCGGAAAC GGAAGCCGCC GAATTCGACA AGATCTACAA GCTCGAAGTG GTGGTCATTC CGACCAACCG CACTCTGCTG CGCAAAGAAA ATCCCGACGT CGTCTACCGC ACCGAGAAAG AAAAGTTCTT CGCCGTTGCC GACGAGATCG CCAAGCTCTC AGTGAGCCAG CAGCCAGTGC TGGTCGGCAC GGTTTCCATC GAAAAGTCGG AACGTCTCTC CGAGCTGCTG AAGCGCAAGA ACATCAAGCA CGTCGTGCTG AACGCGAAGT TCCACGAGCG CGAAGCCGAA TACGTAGCGC AGGCCGGACG CCTCGGCCAG GTCACGATTG CCACCAACAT GGCCGGCCGC GGTACCGACA TTCTGCTCGG CGGCAACCCC GAGTTCATGG CCAAGCAGGA GACCCTGAAG AAGGGCGTTG CCCAGCCCGT GCACGCTGCC GGCGGCGAAG TCGACGCGCG CCCCGACGAT CCCAACACGG TCTATTGGTA TTACGCCGGC AATGAATACG TCTGCCCGCG CGCGCAGTGG GAAGAAATCC TCGCACACTA CAAGACGCAG ACCGATTTCG AGCACGAGCA GGTGAAGCAG GCTGGTGGCC TCTTTATTCT CGGCACCGAG CGCCACGAAT CACGCCGCAT CGATAACCAG CTTCGTGGAC GCGCCGGCCG ACAGGGCGAT CCCGGTGCAT CGCGCTTCTA TCTCTCGCTC GAAGACGACC TCATGCGCAT CTTCGCCAAG GAGTGGGTCT CGACCCTTCT CCAGCGGCTC GGCATGGAAG AGGGCGTGCC CATCGAGTCG AAGATGATCT CGCGCCGTAT CGAGAAAGCG CAGGAGGCAG TCGAAGCGCA GAACTTCGAA GCCCGTAAAC ACCTCCTCGA ATACGACGAC GTGATGAACA AGCAGCGTAT GGCCGTTTAC GGCCTGCGCC GCCAACTCCT CGAAGGTCTC GACCAGAAAG AGCTCATCAT CGACGAGTAC GTCACCGAGA TCCTCGGCGA CCTGCTCGAC AAATTTGCTC CCACCGAGAA GCACCCCGAA GATTGGGACA TCGCCGGACT CAAGGGCGAG ATCTTCACCC GCTTTGGCGT GGACATCATT GCTGAGGGCG TCGAGCCCGA GAAGCTCAAT CGCATGCAGC TTGGCGATGG GATCTTCGAC AAGCTCAAAG AGCGCTACGA AGCAAAAGAG CAGCTCATCG GCAACGACCA GATGCGTCAC CACGAGCGCG TCATCATGCT CAGCGTGATC GACCAGCTCT GGAAAGACCA CCTGCTCAAC ATGGACCACC TGAAGGAAGG CATCGGCCTG CGTGGCTACG CCCAGCACGA TCCGCTCGTC GAGTACAAGC GCGAGTCCTT CGACATGTTC GAAGGCATGA TGGCCACCTT CAAGGAGCAG ACCGTCCGCT ACCTGTATCT CATGCAGATC ATCGACGCCG CGACCAATAT GCCCGTCGAA ATCCCGCGCC GCCGCGCTCC GGAGAACGTT CGCGAGTTGG GTCCGGTGTT GGAAGCTGAG AATGCTCCCG AGCCGCAGAT CTCCGGCGGC AACGGCCAGC AGCCTCCGCA ACGTCGCCAA CAGACTTCGC TCGACGATCT CGAAAAGCAA TTCGAGCGTA AGAAGAAGCG TGAGCTCGAA CAGGCTCGCA TGGCGGGCGG CGGCATGCCC GACGCCGTCC AGCAGGTAGT CCGCAGCGGC GACAAAATCG GCCGCAACGA CCCTTGCTTC TGTGGCAGCG GCAAGAAATA CAAGAAGTGC CACGGCGCAT AG
|
Protein sequence | MINKAIAKIF GTQNEREIKR LMPIVAQINA LEPQVKQFSD DQLRAKTDEF RAKIQERLAK YEEAEHKNHA LKEVLDEILP EAFAICREAG WRVLNMRHFD VQLIGGMVLH SGRISEMKTG EGKTLVATLP VYLNALSGRG VHVVTVNDYL AKRDSEWMGK LYNFLGLSVG VIVHDLDDDQ RREAYRADVT YGTNNEFGFD YLRDNMKFEL SDCVQREFNF AIVDEVDSIL IDEARTPLII SGASEESTDK YQRVNVIIPR LEKGEEIEGR EPGDKILTGD YVVDEKHKTI TVSDDGWEKV EKLLGIGNIA DPENWDLKHH VEVAIKAHAL YHVDVEYVVK DGEVLIVDEF TGRLMPGRRW SDGLHQAVEA KEGVKVEREN QTLATITFQN YFRLYKKLAG MTGTAETEAA EFDKIYKLEV VVIPTNRTLL RKENPDVVYR TEKEKFFAVA DEIAKLSVSQ QPVLVGTVSI EKSERLSELL KRKNIKHVVL NAKFHEREAE YVAQAGRLGQ VTIATNMAGR GTDILLGGNP EFMAKQETLK KGVAQPVHAA GGEVDARPDD PNTVYWYYAG NEYVCPRAQW EEILAHYKTQ TDFEHEQVKQ AGGLFILGTE RHESRRIDNQ LRGRAGRQGD PGASRFYLSL EDDLMRIFAK EWVSTLLQRL GMEEGVPIES KMISRRIEKA QEAVEAQNFE ARKHLLEYDD VMNKQRMAVY GLRRQLLEGL DQKELIIDEY VTEILGDLLD KFAPTEKHPE DWDIAGLKGE IFTRFGVDII AEGVEPEKLN RMQLGDGIFD KLKERYEAKE QLIGNDQMRH HERVIMLSVI DQLWKDHLLN MDHLKEGIGL RGYAQHDPLV EYKRESFDMF EGMMATFKEQ TVRYLYLMQI IDAATNMPVE IPRRRAPENV RELGPVLEAE NAPEPQISGG NGQQPPQRRQ QTSLDDLEKQ FERKKKRELE QARMAGGGMP DAVQQVVRSG DKIGRNDPCF CGSGKKYKKC HGA
|
| |