Gene Franean1_5316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5316 
Symbol 
ID5673650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6403091 
End bp6404875 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content75% 
IMG OID641244173 
Productalpha amylase catalytic region 
Protein accessionYP_001509580 
Protein GI158317072 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0561676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.278605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCG TTGTCGCATA CCGTGCGGGC GAACCCGGGC AGCCCGGCGG CCGCTCCGGC 
CCGGCCGCGC CACCAGCGGG AACCGAGGAG ACCGATTACT CGGCGTGGTG GCACGACGCC
GTGATGTACG AGGTCTACGT CCGCAGCTTC GCCGACGCCG ACGGCGACGG GGTCGGCGAC
ATCGAGGGCA TCCGCCGGCG CCTGCCCGAT CTCGCCGACC TGGGCGTCGA CGGGATCTGG
GTCAGCCCCT TCTACCGCTC CCCCATGGCC GATCACGGCT ACGACGTGGC CGACCACACC
GACGTCGATC CCTTGTTCGG CACTCTCTCG GACATCGACG CGCTGCTGCG CGACGCCCAC
GAGGCCGGCC TGAAAGTCGT CGTCGACCTG GTCCCGAACC ACTCCAGCAG CGCGCACCCC
GCCTTCCAGG CGGCGCTCGC GGCCGGGCCG GACGCGCCGG AGCGGGACCT CTACCTTTTC
CGGGACGGGC GCGGCCCGGA CGGTGCGCTC CCCCCGAACA ACTGGATCTC GGTGTTCGGT
GGCCCCGCCT GGACGAGGGT TCCGGACGGC CAGTGGTACC TGCACCTGTT CGCCCCGGAG
CAGCCGGACT GGAACTGGCA GCATCCGCGG GTGCGGGCGG CGCACGCCGA GATCATCCGA
TTCTGGCTCG ACCGGGGGGT CGACGGCTTC CGGATCGACG TCTCGCACGG CCTGGTCAAG
GACGGCGAGC TGCGCGACCA TCCGACCGGC GCGCTCCCCA CGCCGGAGAC CGGCTTCCGG
GAGGAGATCG AGCCGCACGC GTGGGATCAG GACGGCGTCC ACGAGATCTA CCGCGAGTGG
CGGGCGATCG TCGACGCCCA CGACCGGCGC GACGGCCGAC AGCGGGTGCT CGTCGGCGAG
ACCTGGGTCG CCGACCCGGG GCGCCTCGCC CGGTACGTCC GTCCCGACGA GCTGCACCTG
ACCTTCACGT TCTCGCTGCT GTACGCGCCG TTCTCCGCCC CGGCGTGGCG GGCGGCGATC
GATGCCGCCC GGGCCGCGAC AGCCGCCGTC TGCGCGCCGC CGACCTGGGT GCTGGCCAAC
CACGACGTCG TCCGTCCGGT CAGCCGCTAC GGCGGCGGCG AGACCGGCCT GCGCCGGGCC
CGGGCGGCGC TGCTGACGCT GCTGGCGCTG CCCGGCACGG TCTACCTGTA CCAGGGCGAC
GAACTCGGCC TGCCGCAGGT CGACATCCCG CCCGAGGCCC GCCAGGACCC GGTCTGGGAA
CGGTCCGGCC ACACCTCGCC CGGCCGGGAC GGCGCACGCG TGCCGCTGCC GTGGTCCGGC
GACGCGCCTC CGTACGGGTT CAGCGCCGGG GCAGCCGAGC CGTGGCTCCC GCAACCGCCC
GACTGGGCGA CGCTGACCGC GTCCGCCCAG TCCGTGGACC CGATGTCGAC CAGGGTGCTC
GTCCGCGGCG CGCTGGCGCT ACGCCGCGCG CTCCCCTTCC TCGGCGGACC GGCCGGGCCG
GGCAGCGCGG CCGAGCCGGC CGAGCCGGGG CACTCGGGCG AGCCGGGGCG GCCGGCAGGC
ACGCGGCCCG GGTTCCGCTG GCGGGACGAT CTGCCCGCGG ACTGCCTGGC CTTCGACCGG
ACGTCCGCCG CCGGCGCCCT GACCTGTGTG ATGGCCACGC GGAGCGAGAT ACGCCTGGAG
ATCGCCGGCC GGCTGGTGCT GGCGAGCGGG CCAGTCGGCT ACGACGGCGC GACGCTCGTC
CTGCCACCGG ACACCACCGC GTGGGTCATC CCCCGTTCGG GTTGA
 
Protein sequence
MDPVVAYRAG EPGQPGGRSG PAAPPAGTEE TDYSAWWHDA VMYEVYVRSF ADADGDGVGD 
IEGIRRRLPD LADLGVDGIW VSPFYRSPMA DHGYDVADHT DVDPLFGTLS DIDALLRDAH
EAGLKVVVDL VPNHSSSAHP AFQAALAAGP DAPERDLYLF RDGRGPDGAL PPNNWISVFG
GPAWTRVPDG QWYLHLFAPE QPDWNWQHPR VRAAHAEIIR FWLDRGVDGF RIDVSHGLVK
DGELRDHPTG ALPTPETGFR EEIEPHAWDQ DGVHEIYREW RAIVDAHDRR DGRQRVLVGE
TWVADPGRLA RYVRPDELHL TFTFSLLYAP FSAPAWRAAI DAARAATAAV CAPPTWVLAN
HDVVRPVSRY GGGETGLRRA RAALLTLLAL PGTVYLYQGD ELGLPQVDIP PEARQDPVWE
RSGHTSPGRD GARVPLPWSG DAPPYGFSAG AAEPWLPQPP DWATLTASAQ SVDPMSTRVL
VRGALALRRA LPFLGGPAGP GSAAEPAEPG HSGEPGRPAG TRPGFRWRDD LPADCLAFDR
TSAAGALTCV MATRSEIRLE IAGRLVLASG PVGYDGATLV LPPDTTAWVI PRSG