Gene Franean1_4431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4431 
Symbol 
ID5672783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5292752 
End bp5295556 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content63% 
IMG OID641243300 
Producthypothetical protein 
Protein accessionYP_001508716 
Protein GI158316208 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4354] Predicted bile acid beta-glucosidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.402809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGA GAGAATTATC CGGCGCACAC CGTGGCGGTG GCGTCTCGCG CAGGACCTTC 
GTGCGCGGTA CAGCCGCGGC GGCCGGCTTG GCGGCGTTCG GCGGTGCGCT TGCTGCCTGC
ACCGGAAGCG AACCCGCGGC TGCGGTGAAG CCGGCGGTTC ACCCTGTGCC CAAGGCGGCA
TATGTCCGCA AGCTAGGCGC GGTGCCAGCG GGGTCCTGCA ACGCGCTCGG GGATCAGCAG
TGCAGGACGG GAGTCCCGAA CCCCGTGTTC CGCAGTGTGG GGGCGGGCCT GTCCGTACCC
GGCCTGGGCC TCCCGCTCGG GGGCGTCGGC GGCGGCTCTT TCATGCTCAA CCAGTGTGGG
ACGTTCGGAC CGTGGAACAT GGGCGGGCAG CCGACCACGG AATTCTGGGA GATGCGCACC
CTCGCGCAGG CGGCGTTCCA CGCACGTGAG GAGGTCGTCG GGGGCGGCGG GGGTGTGTCG
GTCAGGACAC TGGCCGTACC GCACACCAAC ACCGCTCCTG ATCGCACTTT TGGCGACGTC
CTGCCTGCCT GGAACACGCT GAAGCCCGGG GACGGCAGCT ACGCTGTACT CTTCCCGTTC
GGGTACATGA CCTACAGCGG CTTCCAGTCA AAGGTCTCCA CCAAGATCTG GTCGCCGATC
GTGGCCAACG AGGACGAGCG CACGTCGATG CCGGTGGCGT TCTTCGACAT GCTCATGAAC
AATCCCACCG CCAAGCCGAT CAAGATTTCT GTCATGCTGA CGTTCCCGAA CGCCCCGGCG
TTCGCCACGG GTTCGGTGCG GACTGGTCTT TACAGCAGGT TCGATCGCGA TTCCGCGTCG
GGTATAGGCG GGGTGACCCT CGGCTCGGAC TCCCCGGAGA ACACGCCGGA CACTGTGACG
TCCGAGTGGA CCATTGCCGC GCATCCATTC GCCGGGCAGA CACTCACCTA CTGCACCTCG
TGGGACGGAT CGGGCGACGG GAGCGACATC TACGCCCCGT TCTCCGCGGC TGGCGCGGAC
GGGAAGCTGC CGAACGGCGA CATCGACCAG TCGGCATCGG CCGGTGCGGT GGCCGTGGCG
CTCACCCTGG AGCCCGACCA GACACAGACT GTTCGCTTCG CCCTTTCCTG GGACTTCCCG
CAGGTCTATT ACGACGGCGA GGACGCGACG ACGAGGGCCG TCTGGATGCG TCGGTACACG
GCGTTCCTCG GCGGAAAGAC ATCGCGGACC AACGACTACG TCCAGGATTC GTACCCCTTC
AGGCAGGGTT TCACCATCGC CCGGAAGGAG CTGGCCCGGT ACGATGACTC TCTCGCCGCC
GTCGAGTCGT GGTGGAAGCC GATCGCCGAG AATCCACAGG TTCCGCCGTG GCTACGCAAG
GCTTCTCTGA ACGAGCTGTA TCACATGATC TTCAACGGTT CGTTCTGGGA GTCCGGGCTC
GTCAGCAACA CGATGCCGAT GAGTGTCGAA GAGGGAACCT CGCCTCGTCT GGGATCCGCG
ATCCCGGAAA CCCACATCTA TTTCCACGCC GATGGCGGGG ACGGTGGAGC GCAGACGAAC
GAAGTCGACA TGGACAGCTC CGGCTACCTC GTTTTCGCGA AGCTGTTCCG CAGCTTGGAG
CTGGGTCGTG TTCGCCCGCT GCTTCAGATG GTCAGGCAGA ATCCGCTGGG AATCGGGCGC
GTGATTCAGC AGACCTTCAG AAGTTCGGGA CCCTACATCA CCCAGACGGC GTCATTCCAG
AATCTCCCGT TCTCCAAGCC GCCCACAGCG GGAAACCCTC CCGCTCCGCC CACCAGAGAT
CTCGGTGATC TGTTCGCGGA CGAAGCCGGA GATCCCTTTC GTGACTGCCC GCACAAGCTC
ATCTACCGAA CGTACGCGCT GATCAAGTTC TACGACGACG ACGATCTGCT GGAATACGGA
TACGCGCCGA TGCTGAAGGC GCTGACATAC TCGCAGTTCT TCCGTCCGAC CGGCTCCCAC
CTGCCGGCAG ACCCGGCATC CAACAACCCG CCGAACACTA TGGATCAGGC TGTCGTGAAC
GGTCACGGAA TCTACAACTG CGGGCTGTAT CTGCTGTCGC TTCAAATCCT CTCGACGCTG
ACGCCCCAGG CTGCCCGACT CGGTGTTGAC GAGGCCACAC CTGAGATACA GAAGGAACTC
GACGAGGAAC TGGCGGCAGC GAAGGAGGAA TTCGAGAGGA TCTTCTGGAA CCCGGCCACC
GGTCGATACC GCTACTGCGA CGGCACCGGC GGGATCGGAG ATCGTACCGG TACTATCAGG
GGTCGTTTCA AGCCGGTGCC GCCGCCGGAC GCCATCTGGC TCGAGTCCTT CGCCGGTCAG
CTCGTCGCGA TGGAGCTCGG CCTGCCTGAC GTCGTCGATC TGGACCATGC CCGTACTCAC
CTGAAGAACA CTCTGGATTC ATTCGTCCGG TTCAGGGATC CCGAAGGGAA CCTGATGGGT
GGCCCGATTA TCCTCAAGCC GGACTTCAGT ATCTACCCTA GTTCGCTGAG GACCACAGAA
ATCAATGAAG TGATTCCGGG TATCGCCTTC CTGGCCGCCG CGGGAGCATT CCGAATCGGC
GCCAAGGTCA AGGACAAGGA CATCACGGAA AAGGCGTTGA AGCTCGGAGA GGGGTGTGCG
CTCCAGATCT ACGACATCGA GAGCAACGGT TACGCCTTCG CAACCCCCGA GAGCTGGTTC
GTGGACGACC ACCATATCTC CAGGTTTCCT GGATACACGC GAACCCGCTC TGTCTGGTCG
CTCTACGACG CGGTCAGCGA AATCTCGGTG AAGAAACCGT CCTGA
 
Protein sequence
MGERELSGAH RGGGVSRRTF VRGTAAAAGL AAFGGALAAC TGSEPAAAVK PAVHPVPKAA 
YVRKLGAVPA GSCNALGDQQ CRTGVPNPVF RSVGAGLSVP GLGLPLGGVG GGSFMLNQCG
TFGPWNMGGQ PTTEFWEMRT LAQAAFHARE EVVGGGGGVS VRTLAVPHTN TAPDRTFGDV
LPAWNTLKPG DGSYAVLFPF GYMTYSGFQS KVSTKIWSPI VANEDERTSM PVAFFDMLMN
NPTAKPIKIS VMLTFPNAPA FATGSVRTGL YSRFDRDSAS GIGGVTLGSD SPENTPDTVT
SEWTIAAHPF AGQTLTYCTS WDGSGDGSDI YAPFSAAGAD GKLPNGDIDQ SASAGAVAVA
LTLEPDQTQT VRFALSWDFP QVYYDGEDAT TRAVWMRRYT AFLGGKTSRT NDYVQDSYPF
RQGFTIARKE LARYDDSLAA VESWWKPIAE NPQVPPWLRK ASLNELYHMI FNGSFWESGL
VSNTMPMSVE EGTSPRLGSA IPETHIYFHA DGGDGGAQTN EVDMDSSGYL VFAKLFRSLE
LGRVRPLLQM VRQNPLGIGR VIQQTFRSSG PYITQTASFQ NLPFSKPPTA GNPPAPPTRD
LGDLFADEAG DPFRDCPHKL IYRTYALIKF YDDDDLLEYG YAPMLKALTY SQFFRPTGSH
LPADPASNNP PNTMDQAVVN GHGIYNCGLY LLSLQILSTL TPQAARLGVD EATPEIQKEL
DEELAAAKEE FERIFWNPAT GRYRYCDGTG GIGDRTGTIR GRFKPVPPPD AIWLESFAGQ
LVAMELGLPD VVDLDHARTH LKNTLDSFVR FRDPEGNLMG GPIILKPDFS IYPSSLRTTE
INEVIPGIAF LAAAGAFRIG AKVKDKDITE KALKLGEGCA LQIYDIESNG YAFATPESWF
VDDHHISRFP GYTRTRSVWS LYDAVSEISV KKPS