Gene Francci3_0767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0767 
Symbol 
ID3905796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp891816 
End bp894800 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content68% 
IMG OID637878100 
Productpreprotein translocase subunit SecA 
Protein accessionYP_479880 
Protein GI86739480 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) 
TIGRFAM ID[TIGR00963] preprotein translocase, SecA subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.42226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTACTAG ACAAGATCTT GCGTGCCGGC GAGGGCCGGA TCCTGCGCAA GCTCAAGGCG 
ATCGCCGAGC AGGTGAACCT GATCGAGGAC GACTTCACCG GCCTGTCCGA CGGTGAACTG
CGAGGCATGA CCGACGAGTT CCGCCAGCGG CTCGCGGACG GGAAGGAGAC CCTCGACGAC
CTGCTGCCCG AGGCCTTCGC CGCCGTGCGC GAGGCGGCGC GGCGCACGCT GGGCCAGCGG
CATTTCGATG TGCAGATCAT GGGGGGCGCG GCCCTTCATC TCGGCAACAT CGCCGAGATG
AAGACCGGTG AGGGCAAGAC GCTGGTCTCG ACCCTGCCGA CCTACCTCAA CGCGCTGGCC
GGTAAGGGTG TGCACGTCAT CACCGTCAAC GACTACCTCG CCCAGCGCGA CGCCGAGAAC
ATGGGCCGGG TCCATCGTTT CCTCGGCCTC ACCGTGGGGG TGATCCATCC GCAGATGCCG
CCGCCGGTCC GGCGGGCCCA GTACGCCTGC GACATCACCT ACGGCACCAA CAACGAGTTC
GGGTTCGACT ACCTCCGTGA CAACATGGCC TGGAGTTCGG AGGAGCTCGT CCAGCGTGGC
CACAACTTCG CGGTCGTCGA CGAGGTGGAC TCCATCCTCA TCGACGAGGC CCGCACGCCG
TTGATCATCA GCGGTCCAGC GGATCATCCG ACCAGGTGGT ACACGGAGTT TGCCCGGATC
GCCCCGCTGC TCGAACGCGA TGTCGATTAC GAGGTCGAAG AGGGCAAGCG GACGGTGGCC
ATCACCGAGT CCGGGGTTGA GAAGGTCGAG GACCAGCTCG GCATCGAGAA CCTCTACGAA
TCGGTGAATA CCCCGCTCGT GGGCTACCTG AACAATTCGC TGAAGGCCAA GGAGCTCTAC
AAGCGGGACA AGGACTACAT CGTTACCGAC GGTGAGGTTC TCATCGTCGA CGAGTTCACC
GGCCGCGTGC TCCACGGTCG TCGCTACAGC GAGGGAATGC ACCAGGCGAT CGAGGCCAAG
GAAAAGGTCG AGATCAAGCA GGAGAACCAG ACCCTCGCGA CGATCACGCT GCAGAACTAC
TTCCGGCTCT ACGACAAGCT CTCCGGCATG ACCGGTACCG CCATGACCGA GGCGGCCGAG
TTCCACCAGA TCTACTCGCT CGGGGTCGTC CCCATCCCGA CGAACAAGCC GATGGTCCGG
CTCGACCAGC CGGACGTCGT CTACAAGACC GAGATCGCGA AGTTCGACGC CGTGGTGGAG
GACATCGCCG AGCGGCACGA GAAGGGCCAA CCGGTCCTGG TCGGCACCAC CAGCGTCGAG
AAGTCCGAGT ACCTCTCGAA GCAGCTTCGC AAGCGTGGTG TGCCGCACGA GGTGCTCAAC
GCCAAGCACC ACGAGCGGGA GGCGGCCATC ATCGCCGAGG CGGGCCGCAA GGGCGCCGTC
ACGGTGGCGA CGAACATGGC CGGTCGTGGT ACGGACATCA TGCTCGGCGG TAACCCGGAG
TTCATTGCCC AGGCCGAGCT GCGCCAGCGC GGCCTCTCGC CGATCGAGAC CCCCGAGGAC
TATGAGGCGG CCTGGCAGGA GGCCCTGGAG AAGGCCAGGC AGTCGGTGAA GGCCGAGCAC
GAGGAGGTCG TCGACGCCGG CGGCCTGTAC GTGCTCGGCA CCGAGCGGCA CGAGTCCCGG
CGCATCGACA ACCAGCTGCG TGGCCGGGCC GGCCGGCAGG GCGACCGCGG TGAGTCGCGC
TTCTACCTCT CCCTCGGTGA CGATCTCATG CGGTTGTTCA ACGCGGCCGC GGTCGAGGGC
ATCATGGATC GGCTGAACAT CCCCGAGGAC GTCCCGATCG AATCGAAGAT CGTGACTCGG
GCGATCCGGT CGGCCCAGAC CCAGGTCGAG GGGCAGAACT TCGAGATCCG CAAGAACGTC
CTCAAGTACG ACGAGGTCAT GAACAAGCAG CGCACCGTGA TCTATGAGGA GCGCCGCAAG
GTTCTCGGCG GTGCCGATCT CCACGAGCAG GTGCGTCACT TCGTTGACGA CACCGTCGAG
GGATACGTGC GCGGCGCCAC CGCCGACGGG TACCCGGAGG AGTGGGATCT CGACACGCTC
TGGACGGCGC TCGGGCAGCT CTACCCGGTC GGTGTGGTGG CACCCGATGT CGATGATCGA
GACGGGCTCA CTGCCGATCA CCTGCTCGAG GACATCCAGG TCGACGCGCA GGAGGCGTAC
GACCGGCGGG AACTCGACCT CGGCGACGGC CCCGACAGCG AACCGATCAT GCGGGAGCTG
GAGCGACGGG TCGTCCTCGC GGTCTTGGAC CGCAAGTGGC GCGAGCACCT CTACGAGATG
GACTACCTGC AGGAGGGCAT CGGGCTGCGG GCGATGGGAC AGCGGGACCC GCTGGTCGAG
TACCAGCGTG AGGGTTTCGA CATGTTCCAG ACGATGATGG AGGGCATCAA GGAGGAGTCC
GTCCGGCTGC TGTTCAACGT CGAGGTCCAG GTTGCGGGGC AGGAGGAGGC CGCCACGTCG
GTGGGCGTCG AGCCGGCCGT GTCCGCTGCT CCCGCACCGC CGGCCGCAGC CGCGACCCTG
CCCGCTCCGG CGGTGCCGAC GATTCCGGAC GGCGCCGGTC CCGTCGCGGA CGCGCAGCCC
GTTCGCCCCG CGGCGGCCCG TCAGACTCCG CCACCCCCTT CACCGGTTCC GTCCGCACCG
CTGCCGGTCT TCGTCAAGGG GCTCGAGCCG CGGCGGCCGA CCGGTGGCCT GCGCTACACC
GCGCCGTCGG TCGACGGTGG ATCCGGGCCG GTCACGACGG TGGATGGCAG GTCGGGACTG
GGCCGCCCGG CTGGAGACGG TGCGCTCAGC GCCGCCCGCG GCGAGGCCGG CACGGCGCAG
CCCGGTGCGG GCACGCGTCC CGCTCGCAAT GCGCCCTGCC CGTGTGGGTC GGGCCGCAAG
TACAAGCGCT GCCACGGCGA CCCGGCGCGC CGCAACACCG AGTGA
 
Protein sequence
MVLDKILRAG EGRILRKLKA IAEQVNLIED DFTGLSDGEL RGMTDEFRQR LADGKETLDD 
LLPEAFAAVR EAARRTLGQR HFDVQIMGGA ALHLGNIAEM KTGEGKTLVS TLPTYLNALA
GKGVHVITVN DYLAQRDAEN MGRVHRFLGL TVGVIHPQMP PPVRRAQYAC DITYGTNNEF
GFDYLRDNMA WSSEELVQRG HNFAVVDEVD SILIDEARTP LIISGPADHP TRWYTEFARI
APLLERDVDY EVEEGKRTVA ITESGVEKVE DQLGIENLYE SVNTPLVGYL NNSLKAKELY
KRDKDYIVTD GEVLIVDEFT GRVLHGRRYS EGMHQAIEAK EKVEIKQENQ TLATITLQNY
FRLYDKLSGM TGTAMTEAAE FHQIYSLGVV PIPTNKPMVR LDQPDVVYKT EIAKFDAVVE
DIAERHEKGQ PVLVGTTSVE KSEYLSKQLR KRGVPHEVLN AKHHEREAAI IAEAGRKGAV
TVATNMAGRG TDIMLGGNPE FIAQAELRQR GLSPIETPED YEAAWQEALE KARQSVKAEH
EEVVDAGGLY VLGTERHESR RIDNQLRGRA GRQGDRGESR FYLSLGDDLM RLFNAAAVEG
IMDRLNIPED VPIESKIVTR AIRSAQTQVE GQNFEIRKNV LKYDEVMNKQ RTVIYEERRK
VLGGADLHEQ VRHFVDDTVE GYVRGATADG YPEEWDLDTL WTALGQLYPV GVVAPDVDDR
DGLTADHLLE DIQVDAQEAY DRRELDLGDG PDSEPIMREL ERRVVLAVLD RKWREHLYEM
DYLQEGIGLR AMGQRDPLVE YQREGFDMFQ TMMEGIKEES VRLLFNVEVQ VAGQEEAATS
VGVEPAVSAA PAPPAAAATL PAPAVPTIPD GAGPVADAQP VRPAAARQTP PPPSPVPSAP
LPVFVKGLEP RRPTGGLRYT APSVDGGSGP VTTVDGRSGL GRPAGDGALS AARGEAGTAQ
PGAGTRPARN APCPCGSGRK YKRCHGDPAR RNTE