Gene Franean1_4900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4900 
Symbol 
ID5673240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5883970 
End bp5886114 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content76% 
IMG OID641243755 
ProductFHA domain-containing protein 
Protein accessionYP_001509171 
Protein GI158316663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.542451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0483191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGCCG TGCTCCCGCC AGGGCCGTCA CCGGGGCCGG CCGGGCCCGC CGGGAGCGGT 
GACGGCCCCC GCATCGGCCC GGTCTACCTC GCCGGGCCGG CGTACGAGGC CTCGTCCGGC
GCGGCGGAAC CCACACGGCC CGCCGAGCCC ACCGCGCCGC CCACGCACCC CGCACCGCCT
GCGTACTCCG CGCCGTTCTC GCCGCCCGCG CCGCCCGCGC ACTCCGTCCC GCCGGCGGCG
CCGCCGGCGC ACCGGGCGGA GGCGCCGACC TCCGGGAGCT CCTGGTTCGA GCCGGGACGT
CCCAGCTACC CGGCGCTGCA CCTGCCCACC CCGTCGGCGC CGACCCCCGC CTCGCCGGAG
CCGGGCCAGT CGCCGGAGCC GGCGTCCGCG GGACCGCCGA TCGCGGACGA GCCAGCGGCG
AGTCCGTGGA CAGCGAGTCC GTGGGCGGCA GCTCCGCCGG CACCACACCC GCAGGCACTG
GACCCGCAGG CACTGGACCC GCAGGCACTG GACCCGCAGG CACTGGACCC GCTGGGACCG
GGCACACAGG CGACGGCATG GGTACCGGAC TCATGGGCCC CTGACCGATG GACGCCTGAC
CGGTGGTCCC CGCGGCCGTC GGAGCCGTCC GAGCCGTCGG CTCCGACACC GCTGACACCG
ACAGCGGCAC CGGCACCGGC ACCGACCGCA GCAGCCCCGG AGCCGCTCGA GACTGCTACG
GGGCCGGCCC GCGATCCGGT CGGGCGAACC GGCCCGCTCC AGGAACGCCT CTACGCCCGC
GGCAGCCGGC GCAGGTCCGC GCGCGGCCCG GCCGGCGCCA CCGACGGCCG GCCGGCGATA
TCCGCGGGCA CGCCGACCCG GGCCCAGGCG GCCATGGCCT GGGCGCAGCG CTGGTACCTG
CGGGACGTGC TCATCGCCTG GATCGCGGCG CGGGCCGTGG TGGGCGCGGC GCTGGCGCTG
ACCAGGTTCG TCGCGGACAC CGTCGCCGGT GACAACGGGG CCACGCTGGA CACCACCGAC
CTGCTCGGCT GGGACGCCGG CTGGTACCTC AACATCGCCG ACAACGGCTA CGACGCAGCC
GGCGCCGAGA GCCGCCGTTT CTTCCCGCTG CTCCCCCTGC TGGTCCGGCT CTTCACCGCC
CTGCCCGGCC TGGGCGGCCA CGGCGGCCAG GTGCTGCTCG CCCTGGTGAA CCTGATCGCT
GTCGTGTTCG CCCTCGCTCT GGTCGGCATC GCGCGGGTCG AGGGGTTCGA CGACGACACG
GTGCGACGGG TGATCTGGAT CTCCGCGCTG GCCCCCCCGG CGTTCGTCCT GGTGATGGGC
TACGCGGAGG CCCTGGCGGG GCTGCTGGCC GTCACGGCGT TCCTGGGCGC GCGGACGCGG
CGCTGGGAGC TGGCGGTGGT CGCCGGCCTG CTCGGCGGCC TGTGCCGGCC GCTGGGCCTG
CTTCTCGCCG TCCCCGTCGC GATCGAGGCC GCGCGGGGGC TGCCGCTACC CCTGGCCCGG
CGTCTCGGCG CCCCACCACC GAACCGCCTT GGTGCCCCGC AGCCGAACCG CCTTGGTGCC
CCGCAGCCGA ACCGCCTTGG TGCCCCGCAG CCGGACCGCC TCGGTCCGGC TGAGCCGGAA
GACCTCGGTG CCCCGTCGGC ACCACGCGCC GCCGCACGGG AGGCGCTTGC GCGGCTGTGC
GCGGTCGCCG CGCCCGTGGC CGGGGCGGGG ATCTACCTGC TGTGGTCGGC GGCCATCCAC
GGCGACGGCC TCGCTCCGTT GACCATGCAG CGGGACGCGG CCCGCCACGG CAGCACGGCC
AATCCGCTGC TGACCGTCAT CGACGCGGCA AAGGGCGCCC TGCACGGCGA GCTCGGCACC
GCGCTGCACG TTCCCTGGCT GCTGCTGGCG CTCGTCGCGC TGGTCGTCAT GGCCCGTGCC
CTGCCGGTGT CCTACTCCGT GTGGTCGGCC CTGGTGCTCG CCGCGGTGCT GACGGGCAGC
AACCTCGACT CATCGGAGCG ATACCTGTAC GGGGCTTTCC CGTTCCTGCT CGTCGCCGCG
CTGGTGACGG CACGGCGCGA GGTGTGGATG CTGGTGATCA CCGTCTCGAC CGCGGCGATG
ACCGTGTACG CGACACTGGC GTTCACGCTG TCCTATGTTC CCTGA
 
Protein sequence
MPAVLPPGPS PGPAGPAGSG DGPRIGPVYL AGPAYEASSG AAEPTRPAEP TAPPTHPAPP 
AYSAPFSPPA PPAHSVPPAA PPAHRAEAPT SGSSWFEPGR PSYPALHLPT PSAPTPASPE
PGQSPEPASA GPPIADEPAA SPWTASPWAA APPAPHPQAL DPQALDPQAL DPQALDPLGP
GTQATAWVPD SWAPDRWTPD RWSPRPSEPS EPSAPTPLTP TAAPAPAPTA AAPEPLETAT
GPARDPVGRT GPLQERLYAR GSRRRSARGP AGATDGRPAI SAGTPTRAQA AMAWAQRWYL
RDVLIAWIAA RAVVGAALAL TRFVADTVAG DNGATLDTTD LLGWDAGWYL NIADNGYDAA
GAESRRFFPL LPLLVRLFTA LPGLGGHGGQ VLLALVNLIA VVFALALVGI ARVEGFDDDT
VRRVIWISAL APPAFVLVMG YAEALAGLLA VTAFLGARTR RWELAVVAGL LGGLCRPLGL
LLAVPVAIEA ARGLPLPLAR RLGAPPPNRL GAPQPNRLGA PQPNRLGAPQ PDRLGPAEPE
DLGAPSAPRA AAREALARLC AVAAPVAGAG IYLLWSAAIH GDGLAPLTMQ RDAARHGSTA
NPLLTVIDAA KGALHGELGT ALHVPWLLLA LVALVVMARA LPVSYSVWSA LVLAAVLTGS
NLDSSERYLY GAFPFLLVAA LVTARREVWM LVITVSTAAM TVYATLAFTL SYVP