Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0767 |
Symbol | |
ID | 3905796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 891816 |
End bp | 894800 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637878100 |
Product | preprotein translocase subunit SecA |
Protein accession | YP_479880 |
Protein GI | 86739480 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) |
TIGRFAM ID | [TIGR00963] preprotein translocase, SecA subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.42226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTACTAG ACAAGATCTT GCGTGCCGGC GAGGGCCGGA TCCTGCGCAA GCTCAAGGCG ATCGCCGAGC AGGTGAACCT GATCGAGGAC GACTTCACCG GCCTGTCCGA CGGTGAACTG CGAGGCATGA CCGACGAGTT CCGCCAGCGG CTCGCGGACG GGAAGGAGAC CCTCGACGAC CTGCTGCCCG AGGCCTTCGC CGCCGTGCGC GAGGCGGCGC GGCGCACGCT GGGCCAGCGG CATTTCGATG TGCAGATCAT GGGGGGCGCG GCCCTTCATC TCGGCAACAT CGCCGAGATG AAGACCGGTG AGGGCAAGAC GCTGGTCTCG ACCCTGCCGA CCTACCTCAA CGCGCTGGCC GGTAAGGGTG TGCACGTCAT CACCGTCAAC GACTACCTCG CCCAGCGCGA CGCCGAGAAC ATGGGCCGGG TCCATCGTTT CCTCGGCCTC ACCGTGGGGG TGATCCATCC GCAGATGCCG CCGCCGGTCC GGCGGGCCCA GTACGCCTGC GACATCACCT ACGGCACCAA CAACGAGTTC GGGTTCGACT ACCTCCGTGA CAACATGGCC TGGAGTTCGG AGGAGCTCGT CCAGCGTGGC CACAACTTCG CGGTCGTCGA CGAGGTGGAC TCCATCCTCA TCGACGAGGC CCGCACGCCG TTGATCATCA GCGGTCCAGC GGATCATCCG ACCAGGTGGT ACACGGAGTT TGCCCGGATC GCCCCGCTGC TCGAACGCGA TGTCGATTAC GAGGTCGAAG AGGGCAAGCG GACGGTGGCC ATCACCGAGT CCGGGGTTGA GAAGGTCGAG GACCAGCTCG GCATCGAGAA CCTCTACGAA TCGGTGAATA CCCCGCTCGT GGGCTACCTG AACAATTCGC TGAAGGCCAA GGAGCTCTAC AAGCGGGACA AGGACTACAT CGTTACCGAC GGTGAGGTTC TCATCGTCGA CGAGTTCACC GGCCGCGTGC TCCACGGTCG TCGCTACAGC GAGGGAATGC ACCAGGCGAT CGAGGCCAAG GAAAAGGTCG AGATCAAGCA GGAGAACCAG ACCCTCGCGA CGATCACGCT GCAGAACTAC TTCCGGCTCT ACGACAAGCT CTCCGGCATG ACCGGTACCG CCATGACCGA GGCGGCCGAG TTCCACCAGA TCTACTCGCT CGGGGTCGTC CCCATCCCGA CGAACAAGCC GATGGTCCGG CTCGACCAGC CGGACGTCGT CTACAAGACC GAGATCGCGA AGTTCGACGC CGTGGTGGAG GACATCGCCG AGCGGCACGA GAAGGGCCAA CCGGTCCTGG TCGGCACCAC CAGCGTCGAG AAGTCCGAGT ACCTCTCGAA GCAGCTTCGC AAGCGTGGTG TGCCGCACGA GGTGCTCAAC GCCAAGCACC ACGAGCGGGA GGCGGCCATC ATCGCCGAGG CGGGCCGCAA GGGCGCCGTC ACGGTGGCGA CGAACATGGC CGGTCGTGGT ACGGACATCA TGCTCGGCGG TAACCCGGAG TTCATTGCCC AGGCCGAGCT GCGCCAGCGC GGCCTCTCGC CGATCGAGAC CCCCGAGGAC TATGAGGCGG CCTGGCAGGA GGCCCTGGAG AAGGCCAGGC AGTCGGTGAA GGCCGAGCAC GAGGAGGTCG TCGACGCCGG CGGCCTGTAC GTGCTCGGCA CCGAGCGGCA CGAGTCCCGG CGCATCGACA ACCAGCTGCG TGGCCGGGCC GGCCGGCAGG GCGACCGCGG TGAGTCGCGC TTCTACCTCT CCCTCGGTGA CGATCTCATG CGGTTGTTCA ACGCGGCCGC GGTCGAGGGC ATCATGGATC GGCTGAACAT CCCCGAGGAC GTCCCGATCG AATCGAAGAT CGTGACTCGG GCGATCCGGT CGGCCCAGAC CCAGGTCGAG GGGCAGAACT TCGAGATCCG CAAGAACGTC CTCAAGTACG ACGAGGTCAT GAACAAGCAG CGCACCGTGA TCTATGAGGA GCGCCGCAAG GTTCTCGGCG GTGCCGATCT CCACGAGCAG GTGCGTCACT TCGTTGACGA CACCGTCGAG GGATACGTGC GCGGCGCCAC CGCCGACGGG TACCCGGAGG AGTGGGATCT CGACACGCTC TGGACGGCGC TCGGGCAGCT CTACCCGGTC GGTGTGGTGG CACCCGATGT CGATGATCGA GACGGGCTCA CTGCCGATCA CCTGCTCGAG GACATCCAGG TCGACGCGCA GGAGGCGTAC GACCGGCGGG AACTCGACCT CGGCGACGGC CCCGACAGCG AACCGATCAT GCGGGAGCTG GAGCGACGGG TCGTCCTCGC GGTCTTGGAC CGCAAGTGGC GCGAGCACCT CTACGAGATG GACTACCTGC AGGAGGGCAT CGGGCTGCGG GCGATGGGAC AGCGGGACCC GCTGGTCGAG TACCAGCGTG AGGGTTTCGA CATGTTCCAG ACGATGATGG AGGGCATCAA GGAGGAGTCC GTCCGGCTGC TGTTCAACGT CGAGGTCCAG GTTGCGGGGC AGGAGGAGGC CGCCACGTCG GTGGGCGTCG AGCCGGCCGT GTCCGCTGCT CCCGCACCGC CGGCCGCAGC CGCGACCCTG CCCGCTCCGG CGGTGCCGAC GATTCCGGAC GGCGCCGGTC CCGTCGCGGA CGCGCAGCCC GTTCGCCCCG CGGCGGCCCG TCAGACTCCG CCACCCCCTT CACCGGTTCC GTCCGCACCG CTGCCGGTCT TCGTCAAGGG GCTCGAGCCG CGGCGGCCGA CCGGTGGCCT GCGCTACACC GCGCCGTCGG TCGACGGTGG ATCCGGGCCG GTCACGACGG TGGATGGCAG GTCGGGACTG GGCCGCCCGG CTGGAGACGG TGCGCTCAGC GCCGCCCGCG GCGAGGCCGG CACGGCGCAG CCCGGTGCGG GCACGCGTCC CGCTCGCAAT GCGCCCTGCC CGTGTGGGTC GGGCCGCAAG TACAAGCGCT GCCACGGCGA CCCGGCGCGC CGCAACACCG AGTGA
|
Protein sequence | MVLDKILRAG EGRILRKLKA IAEQVNLIED DFTGLSDGEL RGMTDEFRQR LADGKETLDD LLPEAFAAVR EAARRTLGQR HFDVQIMGGA ALHLGNIAEM KTGEGKTLVS TLPTYLNALA GKGVHVITVN DYLAQRDAEN MGRVHRFLGL TVGVIHPQMP PPVRRAQYAC DITYGTNNEF GFDYLRDNMA WSSEELVQRG HNFAVVDEVD SILIDEARTP LIISGPADHP TRWYTEFARI APLLERDVDY EVEEGKRTVA ITESGVEKVE DQLGIENLYE SVNTPLVGYL NNSLKAKELY KRDKDYIVTD GEVLIVDEFT GRVLHGRRYS EGMHQAIEAK EKVEIKQENQ TLATITLQNY FRLYDKLSGM TGTAMTEAAE FHQIYSLGVV PIPTNKPMVR LDQPDVVYKT EIAKFDAVVE DIAERHEKGQ PVLVGTTSVE KSEYLSKQLR KRGVPHEVLN AKHHEREAAI IAEAGRKGAV TVATNMAGRG TDIMLGGNPE FIAQAELRQR GLSPIETPED YEAAWQEALE KARQSVKAEH EEVVDAGGLY VLGTERHESR RIDNQLRGRA GRQGDRGESR FYLSLGDDLM RLFNAAAVEG IMDRLNIPED VPIESKIVTR AIRSAQTQVE GQNFEIRKNV LKYDEVMNKQ RTVIYEERRK VLGGADLHEQ VRHFVDDTVE GYVRGATADG YPEEWDLDTL WTALGQLYPV GVVAPDVDDR DGLTADHLLE DIQVDAQEAY DRRELDLGDG PDSEPIMREL ERRVVLAVLD RKWREHLYEM DYLQEGIGLR AMGQRDPLVE YQREGFDMFQ TMMEGIKEES VRLLFNVEVQ VAGQEEAATS VGVEPAVSAA PAPPAAAATL PAPAVPTIPD GAGPVADAQP VRPAAARQTP PPPSPVPSAP LPVFVKGLEP RRPTGGLRYT APSVDGGSGP VTTVDGRSGL GRPAGDGALS AARGEAGTAQ PGAGTRPARN APCPCGSGRK YKRCHGDPAR RNTE
|
| |