Gene Franean1_5838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5838 
Symbol 
ID5674161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7081047 
End bp7084103 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content68% 
IMG OID641244688 
Productpreprotein translocase subunit SecA 
Protein accessionYP_001510090 
Protein GI158317582 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) 
TIGRFAM ID[TIGR00963] preprotein translocase, SecA subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.198512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.737923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGGCG ATCGAGGGGC TAGCATCTTG GTTGTCGGTG AGCCTCGCCG ACATGCCTGC 
CGGAGGAGTT TCGCCGTGGT TCTAGACAAG ATCTTGCGTG CCGGTGAGGG CCGGATTCTG
CGCAAGCTCA AGGCGATCGC CGAGCAGGTG AACCTCATCG AGGACGACTT CACCGGTCTG
ACCGACGCCG AGCTGCGCGG GATGACCGAC GAGTTCCGCC AGCGCCTGGC CAGCGAAGAG
GAAACCCTCG ACTCGTTGCT GCCCGAGGCG TTCGCGACGG TGCGCGAGGC GGCGCGGCGC
ACGCTCGGCC AGCGCCATTT CGACGTCCAG ATCATGGGTG GCGCGGCGCT GCACCTGGGC
AACATCGCCG AGATGAAGAC CGGTGAGGGC AAGACCCTGG TCTCCACGCT GCCGTCCTAC
CTGAACGCCC TGAGCGGCAA CGGCGTCCAC ATCGTCACGG TGAACGACTA CCTGGCCCAG
CGCGACGCCG AGAACATGGG CCGTGTCCAC CGCTTCCTCG GCCTCACGGT CGGTGTCATC
CATCCCCAGA TGCCGCCGTC GGTGCGCCGC CAGCAGTACC GGTGCGACAT CACCTACGGC
ACGAACAACG AGTTCGGGTT CGACTATCTG CGCGACAACA TGTCCTGGAG CGCCGAGGAA
CTGGTCCAGC GCGGCCATCA CTTCGCCGTC GTGGACGAGG TCGACTCCAT CCTCATCGAC
GAGGCCCGCA CGCCCCTGAT CATCAGCGGC CCGGCCGACA ACCCGACCCG CTGGTACACC
GAGTTCTCGC GGATCGCCCC CCTGCTCGAG CGGGACGTCG ACTACGAGGT CGAGGAGGGC
AAGCGGACGG TCTCGATCAG CGAGGTCGGC GTAGAGAAGG TCGAGGACCA GCTCGGCATC
GAGAACCTGT ACGAGTCGGT GAACACCCCC CTCGTCGGTT ACCTCAACAA CGCGCTGAAG
GCCAAGGAGC TCTACAAGCG GGACAAGGAC TATATCGTCA CCGACGGTGA GGTCCTCATT
GTCGACGAGT TCACCGGTCG CGTCCTGCAC GGCCGCCGTT ACAGCGAGGG AATGCACCAG
GCGATCGAGG CCAAGGAGAA GGTCGAGATC AAGCAGGAGA ACCAGACCCT GGCGACGATC
ACTCTCCAGA ACTACTTCCG GCTCTACGAC AAGCTGTCCG GAATGACCGG TACGGCCATG
ACGGAGGCCG CCGAGTTCCA CCAGATCTAC GCCCTCGGCG TGGTGCCGAT CCCGACCAAC
AAGCCGATGG CTCGCACGGA CCAGGCCGAC GTCGTCTACA AGACCGAGAT CGCGAAGTTC
GACGCGGTCG TGGAGGACAT CGCCGAGCGG CACGAGAACG GTCAGCCCGT CCTGGTCGGC
ACGACCAGCG TGGAGAAGTC GGAGTACCTG TCCAAGCAGC TCGCCAAGCG CGGTGTCCGG
CACGAGGTGC TGAACGCCAA ACACCACGAG CGCGAGGCGA TGATCATCGG CGAGGCGGGC
CGGCGCGGCG CGGTCACCGT GGCGACGAAC ATGGCCGGCC GAGGCACCGA CATCATGCTC
GGCGGCAACC CCGAGTTCAT CGCCCAGACG GAGCTTCGCC AGCGCGGCCT GTCTCCGATC
GACACCCCGG ACGACTACGA GGCGGCCTGG CCCGAGGCGC TGGAGAAGGC CCGTGCGTCG
GTCAAGGCCG AGCACGAGGA AGTCGTCAAC GCCGGTGGCC TCTATGTGCT CGGCACCGAG
CGGCACGAGT CCCGCCGGAT CGACAACCAG CTGCGCGGCC GTGCCGGCCG TCAGGGCGAC
CGCGGCGAGT CCCGGTTCTA CCTCTCCCTC GGTGACGACC TCATGCGGTT GTTCAACGCC
GCGGCGGTCG AGGGCATCAT GGACCGCCTC AACATCCCGG ACGACGTCCC GATCGAGTCC
AAGATCGTGA CTCGCGCGAT CCGTTCCGCG CAGACCCAGG TCGAGGGCCA GAACTTCGAG
ATCCGCAAGA ACGTCCTCAA GTACGACGAA GTCATGAACA AGCAGCGCAC CGTGATCTAC
GAGGAGCGCC GCAAGGTCCT GGAGGGCGCC GACCTGCACG AGCAGGTTCG CCACTTCGTT
GATGACACCG TCGAGGGCTA TGTGCGCGGC GCGACGGCCG ACGGGTACCC GGAGGAGTGG
GACCTCGAAA CCCTCTGGTC GGGCCTGGGC CTGCTCTACC CGGTCGGCGT CGACGCACCG
GGCACCGACG ACCGCGAGGG GCTGACCTCC GACCTCCTGC TCGAGGATCT TCAGGCGGAC
GCGCAGGACG CCTACGACCG GCGCGAGGCC GACCTCGGCG ACAAGCCGGA CGGCGAGGCC
GTCATGCGTG AGCTGGAGCG CCGGGTGGTG CTCGCGGTAC TCGACCGCAA GTGGCGTGAG
CACCTCTACG AGATGGACTA CCTGCAGGAG GGCATCGGCC TGCGGGCGAT GGGGCAGCGA
GACCCGGTCG TGGAGTACCA GCGCGAGGGC TTCGACATGT TCCAGACGAT GATGGAGGGC
ATCAAGGAGG AGTCGGTCCG CCTCCTGTTC AACGTCGAGG TGCAGGTAGC GGGGCGGGAC
ACCGACCAGA GCGGCACCGC GCCCGAAGGA GCGCCCGCAG CCCCGGCCGG CACGACGCCC
ACGCCGGCCA CCCCGGCCGA GCGCGGCTCC GTGTCCGTCG GCGCGGTGCC TCTGGTCACC
CCGGCCGCGC CCGCACCGGC GCAGCCCGCG CGGACCGCTC CCGAGCCTCC GCCGGCCCCG
GTGCCGGCGT CCCCGCCGCC CGTCTTCGTC AAGGGCCTCG AACCGCACCG CCCGACCGGT
GGACTGCGTT ACACGGCGCC GTCTGTCGAC GGCGGCTCGT CCACGGTGAC GACCGTCGAC
AACGGGTCCG AGCTGTCCCG TGCCGGCGGT GGTGGGCCGC GCTCGGCCGG CGGCGGCGTG
TCCCGCGGCG CTGGCGAGGG CGGAACGGCC CGCCCGGCAC GCAACGCGCC CTGCCCCTGC
GGCTCGGGGC GGAAGTACAA GCGCTGCCAC GGCGATCCGG CCCGCCGCAA CGACTGA
 
Protein sequence
MGGDRGASIL VVGEPRRHAC RRSFAVVLDK ILRAGEGRIL RKLKAIAEQV NLIEDDFTGL 
TDAELRGMTD EFRQRLASEE ETLDSLLPEA FATVREAARR TLGQRHFDVQ IMGGAALHLG
NIAEMKTGEG KTLVSTLPSY LNALSGNGVH IVTVNDYLAQ RDAENMGRVH RFLGLTVGVI
HPQMPPSVRR QQYRCDITYG TNNEFGFDYL RDNMSWSAEE LVQRGHHFAV VDEVDSILID
EARTPLIISG PADNPTRWYT EFSRIAPLLE RDVDYEVEEG KRTVSISEVG VEKVEDQLGI
ENLYESVNTP LVGYLNNALK AKELYKRDKD YIVTDGEVLI VDEFTGRVLH GRRYSEGMHQ
AIEAKEKVEI KQENQTLATI TLQNYFRLYD KLSGMTGTAM TEAAEFHQIY ALGVVPIPTN
KPMARTDQAD VVYKTEIAKF DAVVEDIAER HENGQPVLVG TTSVEKSEYL SKQLAKRGVR
HEVLNAKHHE REAMIIGEAG RRGAVTVATN MAGRGTDIML GGNPEFIAQT ELRQRGLSPI
DTPDDYEAAW PEALEKARAS VKAEHEEVVN AGGLYVLGTE RHESRRIDNQ LRGRAGRQGD
RGESRFYLSL GDDLMRLFNA AAVEGIMDRL NIPDDVPIES KIVTRAIRSA QTQVEGQNFE
IRKNVLKYDE VMNKQRTVIY EERRKVLEGA DLHEQVRHFV DDTVEGYVRG ATADGYPEEW
DLETLWSGLG LLYPVGVDAP GTDDREGLTS DLLLEDLQAD AQDAYDRREA DLGDKPDGEA
VMRELERRVV LAVLDRKWRE HLYEMDYLQE GIGLRAMGQR DPVVEYQREG FDMFQTMMEG
IKEESVRLLF NVEVQVAGRD TDQSGTAPEG APAAPAGTTP TPATPAERGS VSVGAVPLVT
PAAPAPAQPA RTAPEPPPAP VPASPPPVFV KGLEPHRPTG GLRYTAPSVD GGSSTVTTVD
NGSELSRAGG GGPRSAGGGV SRGAGEGGTA RPARNAPCPC GSGRKYKRCH GDPARRND