Gene Franean1_2745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2745 
Symbol 
ID5671136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3248284 
End bp3250809 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content74% 
IMG OID641241657 
ProductType IV secretory pathway VirD4 protein-like protein 
Protein accessionYP_001507077 
Protein GI158314569 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0325821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATCG TGACCATGGA CCTGCTCGCC GCGGCGGCCA CCACGCCGTC GTCGCCACTC 
ACGACCTACC TGACCGACCC CGCCGGCTTC CTCCACCAGC TGCTCGGCCA CCTACGTGCC
TGGGCCGTGG TCTGGGGACC CGTCGCCGGC CCGCTGGTCG CTCTCACCGC CGCCGGCCTG
CTCACCCTGC GCCGGCGGCT ACGCCGCCGC TACCAGCAAC GACTCACCGC CGGCGCCCGC
CTCGTGACCG TCCTGGCCCC GCCCACCGTC GACCCGGCGG GCGCGGGCGC GCTGTGGGCG
AACCTGCTCG GCCTGCTGCG CCCGTCCTGG CGACGGCTGG TCGGCCAGCC GCACCTCGTG
TGGGAGTACC TGTTCGACGC CGACGGGGTC CGCATCCAAA TCTGGGTCCC CGGTGTGGTG
CCCGAGGGCT TCGTGGAGAG GGCCGTCGAG GCGGCCTGGC CCGGTGCCCA CACCCGCACC
ACCCCTGCCC GAGCGCCGCT GCCCGTCCTG GCCCGGCCCG GCCGGCGGCT GCTGGCCGCA
GGCGGGGAAC TCCGCCTGGG CCGGCCGGAA GCGCTCCCGA TCCGTACGGA TCATGACGTT
GACCCGGTCC GTGCACTGCT CGCCGCGCCC GGCGGGCTGG CGCGCACCCA GCGGGCGGTC
GTGCAGATCC TGGCCCGGCC GGTCACCGGC CGCCGCGTCG CCAAAGCCCG CCGGTCAGCC
CGCCGCGTGC GTGCTGGCGG CTCGGCCACC CTGCTCGGCG GGCTGCTCGA CCTGCTCACC
CCCCACACGG GCCGAACCCG GCGCACTCGG CGGACCCCCG CACCGACGAA GGTCGATCAT
CAGACCTCGC TCGCGCTGTC GGCGGAGGAC CGCGCGATCG TCACGAAGGG TCGCGGCGCC
CAGTTCGAGG TCCGTGTCCG CTACACCGTC GTCGCCGTTC TCGACGACAC CGCTGACGAG
GACACCGCCG CGCGGGTCGG CGGGCAGCTG CGCGGCCGGG CACACGCGAT CGCCTCGGCG
TTCGCGGCCT ACGGCGAGCA CAACTACTTC CAACGCGCCC GGGTGCGCCG CCCGCTGCCC
GTCCTCGCCG CGCGCCACCT CGGCCGCGGG GACCTGCTGT CCGTCGCCGA GGTCGGCGTG
CTCGCCCACC TGCCGGTCGA CGAGGCGACT CCCGGCCTGC AACGCGCCGG CGCGAAAGCC
GTCGCCCCGC CACCCGGCGT CGCTGGCGCC GCGCCGAATG TCCGTCCACT CGGCCGTTCC
GACGGTGGAC ATGCCCGCCC GGTCGGTCTG CGGGTTCCCG ACGCCCGCCA CCACCTACAC
ATCCTCGGCG CCACCGGCTC CGGCAAGTCC GAACTGCTCG CCCGCATGAC CTTGGACGAC
GTCGCCGCGC GCCGGGGCGT GGTCAACGTC GACCCGAAGG GCGACCAGGT CATCGACATC
CTTGCCCGCT ACCCCCTCGA CGCCGTCGAC CGCCTCGTCC TGTTCGACGC CGAATCGTCG
GGCCGGCCGC CGTGCCTCAA CCCGCTTGAC CAGCCCGACC GGACACGCGC CGTCGACAAC
CTGGTCTCCA TCTTCTCCCG GGTCTACCAC GAGTCCTGGG GCCCACGGAC CGACGACATC
TTCCGCGCCG GACTGCTCAC CCTCGCCGCC CAACCCGAGG TCCCTGTCCT GACCCAGCTA
CCCCGGCTCC TGACCGACGG CGCCTACCGG CAGCGCCTCG TCGGCGAGAT CAAAAAAGGC
GACGGCAACG ACATCCTGGC CGGCTTCTGG CAGTGGTACG AAGCACTCTC CGAACCCGCG
CAGGCGCATG CCGTCGCCCC GCTGATGAAC AAACTCCGCG GGTTCCTACT ACGGCCGTTC
GTGCGCGCCG CGATCGCCGC CGGCCCCTCG ACGGTGGACA TGGACACCGT GCTCAACGAC
GGCGGGGTCT GCCTGGTCCG CATCGCCCAA GACGCCCTCG GGGTCGAGAC CGCCGCGCTC
ATGGGCTCCA TCGTCGTGTC CGCCGTCTGG CAGGCCACCA CCCGGCGCGC CCGCATGCCC
CAGGGAAAGC GACCCGACGC CAGCCTGTAC TTGGACGAAG CACACAACTT CCTCACGCTT
CCGTACGCGC TGGAAGACAT GCTCGCCGCC GCCCGCGGCT ACCGACTCGG GATCACTCTC
GCGCACCAGA ACCTCGCCCA GCTGCCCCGG CACCTGGAAG AAAGCATCGC CGCGAACGCC
CGCAGCAAGA TCTACTTCAC GATGTCGCCC GCGGACGCGA AACGGCTCGT CCGCCACGTC
GAGCCTCGCC TGTCTGAGCA CGACCTGGCC AACCTCGGCC GCTTCCATGC CGCCACCCGC
CTCGTCGTCG TCGGCGAAGA GGCACCGGCG TTCACGCTGC GCACCGAGAA GCTCCCCGCC
CCGGTACCGG GCCGCGCCGC GCAGATCCGC CGCGAGCTGC GCCGCCGCGC GCCCACCCCC
ACACCGACGC CGCCGGACCC GGCGGCCCCG CAGCCGCAGG CCGACCCCCG CCGATCCGCC
CGCTGA
 
Protein sequence
MEIVTMDLLA AAATTPSSPL TTYLTDPAGF LHQLLGHLRA WAVVWGPVAG PLVALTAAGL 
LTLRRRLRRR YQQRLTAGAR LVTVLAPPTV DPAGAGALWA NLLGLLRPSW RRLVGQPHLV
WEYLFDADGV RIQIWVPGVV PEGFVERAVE AAWPGAHTRT TPARAPLPVL ARPGRRLLAA
GGELRLGRPE ALPIRTDHDV DPVRALLAAP GGLARTQRAV VQILARPVTG RRVAKARRSA
RRVRAGGSAT LLGGLLDLLT PHTGRTRRTR RTPAPTKVDH QTSLALSAED RAIVTKGRGA
QFEVRVRYTV VAVLDDTADE DTAARVGGQL RGRAHAIASA FAAYGEHNYF QRARVRRPLP
VLAARHLGRG DLLSVAEVGV LAHLPVDEAT PGLQRAGAKA VAPPPGVAGA APNVRPLGRS
DGGHARPVGL RVPDARHHLH ILGATGSGKS ELLARMTLDD VAARRGVVNV DPKGDQVIDI
LARYPLDAVD RLVLFDAESS GRPPCLNPLD QPDRTRAVDN LVSIFSRVYH ESWGPRTDDI
FRAGLLTLAA QPEVPVLTQL PRLLTDGAYR QRLVGEIKKG DGNDILAGFW QWYEALSEPA
QAHAVAPLMN KLRGFLLRPF VRAAIAAGPS TVDMDTVLND GGVCLVRIAQ DALGVETAAL
MGSIVVSAVW QATTRRARMP QGKRPDASLY LDEAHNFLTL PYALEDMLAA ARGYRLGITL
AHQNLAQLPR HLEESIAANA RSKIYFTMSP ADAKRLVRHV EPRLSEHDLA NLGRFHAATR
LVVVGEEAPA FTLRTEKLPA PVPGRAAQIR RELRRRAPTP TPTPPDPAAP QPQADPRRSA
R