Gene Franean1_5501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5501 
Symbol 
ID5673832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6660998 
End bp6666016 
Gene Length5019 bp 
Protein Length1672 aa 
Translation table11 
GC content74% 
IMG OID641244356 
Producthypothetical protein 
Protein accessionYP_001509762 
Protein GI158317254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.627379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.180231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTTGA CCGCTCCGGC GTCCGGAGCC GCCCATCCCG ACGGGCCGGC CCCGGGCGCC 
ACGACGGGCC CCGGCGGCGC GCCGCCCTCC GATCACCCGC GCCGGCAGGG GCGGCTCGGC
CGGGTCCGCC GGCTGCTGCC GTCCTGGCCC GCGCTGGTAC TCGCCGCGCT GGCCTACATC
CCGCTGCTGG CGACGGCGCC GGGGCGGATC GGCGCCGACA CGAAGGCCTA CCTCTACCTC
GACCCGGGCC GGATGCTGGC CCGCGCGGTG TCGATGTGGG ACCCCGACGT CGGCATGGGG
ACCGTCACCC ACCAGAACAT CGGCTACCTG TTCCCGCAGG GCGCGTTCTA CTGGCTCGCC
CAGCTCGCCG GGCTGCCCGA CTGGGTGGCG CAGCGGCTGT GGACGGGCTC GATCCTGTTC
GGCGCGGGCG CCGGCGTGCT GTTCCTGCTG CGCACGTTCG GCTGGGCGAA CCGCTACGCG
TTCATCGCGG CCCTCGGGTA CATGCTCACC CCGTACACGC TGGAGTACGA GGCGCGGATC
TCGGCCATCC TGCTGCCGTA CGCGGGCCTG GGCTGGCTGA TCGGCATCAC CGTGCGGGGG
CTGCGCGAAG CCGGCGCGGA CGACCGCTCC CGCGCCGCGG ACACCCCGCG CCTGGTGCGC
TGGCGCTCGG GCTGGCGCTG GCCGGCGGCG TTCGCGCTGA TGGTCACCCT GATCGGCAGC
ATCAACGCGT CCAGCCTGAT CTTCATCCTG TTCGCTCCGC TGCTGTGGGT GCCGTTCGCG
GTGTGGGGCA CCCGGGAGGT CCGTCTCGGC ACCGCCCTGA CCCTGTGCGG GCGGGCGGTC
GCGCTGGTCG TCGTGACGTC CGCCTGGTGG ATGGCCGGTC TGTACACCCA GGCCGGCTAC
GGGCTGAACG TGCTCGCCTT CACCGAGACG GTGAAGACCG TGGCGAGCAG CTCGCAGGCG
TCCGAGGTGC TGCGCGGCCT GGGTAACTGG TTCTTCTACG GCGAGGACGC GCTCGGCCTG
TGGATCGGCC CGGCCAAGGA CTACACCGGC AGCCTGGTGA TCATCGCGAT CAGCTTCGCG
GTGCCGATCC TGGCCCTGGT CGCCGCGTCC TGCCTGCGGT GGGGCCAGCG CGCGTACTTC
GTGGCGCTGA TCGCGCTGGG CACGACGATC GCGGTCGGCG TCTACCCGTA CGACCACCCG
TCCCCGCTGG GCCGGGTGTT CCGCGACTTC GCCGAGGGCT CCACCGCGGG CCTGGCGCTG
CGGTCGCTGC CGCGTGCCGT CCCGATGGTC GTCCTGGGGC TCTCGGTGCT GCTCGCCGGT
GGGCTCGCCG TCCTGGACCA GCGGTACGCG GCCCGCCGCG CACAGGCCCG GGCCGGCGCG
TCGTCCACGA CCGATACCCC GCGGCCCGCG GCCGGTGCTC TACGGCTGCG CGGGCGCACG
GTGCCGACGC TGGCGTTCGG CGGGGTGGCG CTGCTGCTCG TGCTGAACAT GTCACCGCTG
TTCCGCGGGC ACTTCATCGA GCCGCTGCTG GACCGGCCCG AGGACATCCC CGGCTACGAG
CAGGAGCTCG CCAGCGCGCT GGACGCCGTC GCCCCCGACG CGACCGGGGA ACAGACCCGG
GTGCTGGAGC TGCCCGGCGC CGACTTCGCG CACTACCGCT GGGGCACGAC GCTCGACCCG
GTCATGTCCG GGCTGATGGA CCGGCCGTCC GTGGTCCGCG AACTCATCCC CTACGGCGAC
GCCGGCTCCG TGGACCTGCT GCGCTCGCTG GACCGGCGGA TGCAGGAGGG CGTGCTCGAC
CCGGCTTCGA TCCCCGACAT CGCCCGGCTG ATGAGCGCCG GCGACGTCGT GCTGCGCAGC
AACCTGGCCT ACGAGCGGTT CCGGACCCCG CGCCCGCGCG CGACCTGGGA CCTCGTGGCC
AACCAGCGCC CCGCCGGGCT GGCCGAGCCG CGCACGTTCG GACCTCCGGT GCTGGAGGAC
CCGGGCATCC CCTACACCGA CGAGATCACC CTGGGCACCA ACGCCGACGT CATCGACCCG
CCGGCCCTCG CCGACTTCCC CGTCGACGAC CCGCACCCGA TCGTGCGTGC CGAGCGCACC
GACGCCCCGC TGCTGGTCAG CGGCAACGGG GAGGCCCTCG TCGACGCGGC GGCCACCGGC
CAGCTCGACG CGGTGCTCAA CGACGGCCGC ACCATCCTCT ACGCCGGTGA CCTCGCCAGC
GACCCGCAGC GGCTGCGCCA GGCCCTGGAC GACGGCGCCG AGCTGCTGGT CAGCGACACC
AACCGGCTGC GCGCCGAGCG CTGGACGGGC ATCCGGGAGA ACTTCGGCTA CGTCGAGCAG
CCCGGCGTCA CCCCGCTGGC GAAGGACGCC AACGACAACC GGCTGGTGCT CTTCCCCGAC
GCCGGCCACT CGGCGCAGAC CGTCACCCAG ATGCACGCAC CGGGCTCGGA GGCCCAGGTC
GCGGACGTGC GGGCCACCGA TTACGGCAAC ACCTTCTCCT ACGGGGTCTC CGACCGCCCG
GTGCTCGGCG TCGACGGCAG CCTGGACTCG GCCTGGCGGG TCGGCGCGTT CACCGACCCG
GCGGGCGCCG CCTGGCAGGT CGACCTGGCG AAGCCGACGA CCACCGACCA CATCCGCGTC
GTCCAGCCGC TCAACGGCCC GCGCAACCGG TGGATCACCA GGGCGACCCT GACCTTCGAC
GGCGGCTCTC CGGTCACCGT CGACCTGCGG GACTCCTCCC GCACCCAGGC CGGCCAGACG
ATCACCTTCC CGTCCCGGAC GTTCACCACA CTGCGCATCC ACGTCGATGC CACGAACTTC
GGGGTGCGGC GGACCTACGA CGGGCTGTCC GCCGTCGGCT TCGCCGAGGT GGAGATCCCC
GGCACCAACG GGAAGCCGCT GACCGCGGAG GAGGTGCTGC GCCTGCCCAC CGACACCCTC
GACGCCGCCG GCGCCTCCTC CCTCGACCAC CGGCTGAGCC TGCAGATGTC CCGCGACCGG
GCGAACCCCG CCGAGCCGTT CCGCCGCGAC CCCGAGCCGA CGATGGCACG CTCGTTCACG
CTGCCCACGG CGCGCACGTT CGCGCTCACC GGCACCGCCC GGATCTCCGC CTACGCCTCC
GACCAGCTCG TGGACGCGAC CCTCGGCCGG GCCGGAGCCG CGCCGACGGT CACCTCCTCC
GGCCGGCTGC CCGGCGCGCT CGCCGCACGC GGGTCGAGCG CGTTCGACGG CGACACCACC
ACGGCGTGGA GCCCCGGCAT CGGCGACCAG ATCGGCTCCT GGCTCCAGGT GACCTCGCCG
ACCCCGGTCA CGTTCTCCTC GATGAACCTC GCCCTCGTCA CCGACGGGCG GCACTCCGTA
CCCACGAAGA TCGGCATCAT GGTGGACGGG CGTCGGGTCG GCGCGGTCGC CGTCCCACCG
GTCGCCGACA CGCCGGTCAG CTCGCAGACC CGCAACGCCG CCCAGAACGT CAGCCTCACC
TTCCCCGCGG TCACCGGCAG CGCCATCCGG CTGGTCGTGG ACGACGTCCG CACCGTCACC
AGCGTCGACA CGATCAGCGG CGCGGTGATG GACCTGCCGG TCGGCATCGC CGAGGTCGTC
GTACCGGGCC TGGACGGCAC CGCCGGCGGA ACGGGCCGCG GCTCGGCCGC GACCACCGCT
CCCGGCACCG CCGGCTCACC CGCCACCGCC ACGGCCACCG GTGCCGACAC CGGCGCTGTC
ACCGGAGCCG CAGCCGGCCT GACGATCACC GCGGTTCCCG GGACCACCGG GACGGCGGCG
ACCGGTACCG CGGACATTCC CGCCCCCTGC CGCACCGACC TGCTGACCAT CGACGGCGCA
CCGGTCGGGA TCCAGGTCAC CGGAAGCATC AAGGACGCAG CCGGCCGCGC CGGACTGACG
GTGCGGGCCT GCGGGACACC GGTCACCCTC AACGCGGGTG ACCACGTGGT GCGCACCTCC
AACGGCGCGC TGACCGGGAT CGACGTCGAC CGGCTGCTGC TGGCCTCCGA CGCCGGCGGC
GGAGCCTGGC TCGACGCCAC CGGGGCGGGC CCCGGGACTG CCGCCGTTCC GGGCTCGGCC
GGCACCGGCG CGGGTGGCAC CGGTGTGGCC GCGGCCGCCA CGAGCGGCAG CGTGGGCGGG
ACCGCACCGA CCGTGAAGGT CGAGGCGACC AGCGACACCT CGTTCCGCGT GGATGTCAGC
GCCGCGACCC CGGGCAAACC GTTCTGGCTC GTGCTGGGCG AAAGCCGCTC CCCCGGCTGG
ACGGCCCGCG TCAACGGCCA GGACCTCGGC GAATCCCACC TGGTCGACGG ATACGCGAAC
GGCTGGCGGA TCGACCCGAG CGCCGGGAGC TTCACCGTCC TCCTCGACTG GGCACCGCAG
CACATCGTCG GCATCGCGCT GAAACTGTCG GTGGTGACGG GCATCCTGAG CCTGGCGATC
CTGCTGGTTG GGGCGTACCA GCGCCGCCGC GGCCGGAGCT GGCGCCATCT CGTCGGCTAC
CAGTACATCC CCGCGGGCAC CGGCACCGGC GGAGTCGCCC GAGCGGGCAC CGCGGGTCCA
GGCCGGGCTG ATCTGCCGAC GGCGGATCCC TGGCGGGCCG GGCCGGGCCC GGCCCTCCGG
GTGTCGGGCC GGTCGACGCT GCTCACGGCC CTGGCCACCG GGGCGGCGAG CGTGGTGCTG
GTCGCCCCGC TGGCGGGACT CGTGGTTGCC GTGCTGACCG CCGCCGCGCT GCGGATCCCC
CGCGCCCGGC CGCTGCTGCG CGTCGCCCCC GCCGCGTGCC TGGCTCTCAG CGCGCTCTAC
GTCCTCCAGG TGCAGGCCCG GCACGACCTG CCAGCCAACG GCTCCTGGGT GGCCGCGTTC
GACCGAGTCG CCATGATCTC CTGGCTCACG GTCCTCCTCC TGGCCGCCGA CATGATCGTC
TCGCTGGTCC GCGCCCGCGC CGCTGCCCGG ATCGCCTCGG GCACCGCCCC CGGGACCTCC
ACCGCTCCCG CCGCACAGGG CTCCCCGTCC GAGCTGTGA
 
Protein sequence
MTLTAPASGA AHPDGPAPGA TTGPGGAPPS DHPRRQGRLG RVRRLLPSWP ALVLAALAYI 
PLLATAPGRI GADTKAYLYL DPGRMLARAV SMWDPDVGMG TVTHQNIGYL FPQGAFYWLA
QLAGLPDWVA QRLWTGSILF GAGAGVLFLL RTFGWANRYA FIAALGYMLT PYTLEYEARI
SAILLPYAGL GWLIGITVRG LREAGADDRS RAADTPRLVR WRSGWRWPAA FALMVTLIGS
INASSLIFIL FAPLLWVPFA VWGTREVRLG TALTLCGRAV ALVVVTSAWW MAGLYTQAGY
GLNVLAFTET VKTVASSSQA SEVLRGLGNW FFYGEDALGL WIGPAKDYTG SLVIIAISFA
VPILALVAAS CLRWGQRAYF VALIALGTTI AVGVYPYDHP SPLGRVFRDF AEGSTAGLAL
RSLPRAVPMV VLGLSVLLAG GLAVLDQRYA ARRAQARAGA SSTTDTPRPA AGALRLRGRT
VPTLAFGGVA LLLVLNMSPL FRGHFIEPLL DRPEDIPGYE QELASALDAV APDATGEQTR
VLELPGADFA HYRWGTTLDP VMSGLMDRPS VVRELIPYGD AGSVDLLRSL DRRMQEGVLD
PASIPDIARL MSAGDVVLRS NLAYERFRTP RPRATWDLVA NQRPAGLAEP RTFGPPVLED
PGIPYTDEIT LGTNADVIDP PALADFPVDD PHPIVRAERT DAPLLVSGNG EALVDAAATG
QLDAVLNDGR TILYAGDLAS DPQRLRQALD DGAELLVSDT NRLRAERWTG IRENFGYVEQ
PGVTPLAKDA NDNRLVLFPD AGHSAQTVTQ MHAPGSEAQV ADVRATDYGN TFSYGVSDRP
VLGVDGSLDS AWRVGAFTDP AGAAWQVDLA KPTTTDHIRV VQPLNGPRNR WITRATLTFD
GGSPVTVDLR DSSRTQAGQT ITFPSRTFTT LRIHVDATNF GVRRTYDGLS AVGFAEVEIP
GTNGKPLTAE EVLRLPTDTL DAAGASSLDH RLSLQMSRDR ANPAEPFRRD PEPTMARSFT
LPTARTFALT GTARISAYAS DQLVDATLGR AGAAPTVTSS GRLPGALAAR GSSAFDGDTT
TAWSPGIGDQ IGSWLQVTSP TPVTFSSMNL ALVTDGRHSV PTKIGIMVDG RRVGAVAVPP
VADTPVSSQT RNAAQNVSLT FPAVTGSAIR LVVDDVRTVT SVDTISGAVM DLPVGIAEVV
VPGLDGTAGG TGRGSAATTA PGTAGSPATA TATGADTGAV TGAAAGLTIT AVPGTTGTAA
TGTADIPAPC RTDLLTIDGA PVGIQVTGSI KDAAGRAGLT VRACGTPVTL NAGDHVVRTS
NGALTGIDVD RLLLASDAGG GAWLDATGAG PGTAAVPGSA GTGAGGTGVA AAATSGSVGG
TAPTVKVEAT SDTSFRVDVS AATPGKPFWL VLGESRSPGW TARVNGQDLG ESHLVDGYAN
GWRIDPSAGS FTVLLDWAPQ HIVGIALKLS VVTGILSLAI LLVGAYQRRR GRSWRHLVGY
QYIPAGTGTG GVARAGTAGP GRADLPTADP WRAGPGPALR VSGRSTLLTA LATGAASVVL
VAPLAGLVVA VLTAAALRIP RARPLLRVAP AACLALSALY VLQVQARHDL PANGSWVAAF
DRVAMISWLT VLLLAADMIV SLVRARAAAR IASGTAPGTS TAPAAQGSPS EL