Gene Franean1_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1052 
Symbol 
ID5669466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1235451 
End bp1237469 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content70% 
IMG OID641239981 
Productalpha amylase catalytic region 
Protein accessionYP_001505414 
Protein GI158312906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.776114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCG GACGTTCAGT AGGCCGGGTT GTCATCACTG ACGCGCGCCC GGTCGTCTCC 
TGTGGCCAGT GGCCGTCGCG GGCGGTGGAG GGGGAAACCC TCACCGTCAG CGCGACGGTC
TTCCGTGAGG GGCATGACCT CATCGGGGCC AATGTCGTTC TTTCCGGCCC CGACGGCCAG
GGAGTTCCGT TTACGCGGAT GAAACTCGCC GGCGCGGGCA CGGACCGCTA CGAGGCCGAT
GTCGTCATGG GCCGTGAGGG CCTGTGGGGC TATCGCGTGG AGGCCTGGGC TGATCCGCTG
GCCACCTGGC GGCACGGCAT CGAGCTCAAG GTCGGTGCCG GCCAGACGGT GGACGAGCTG
GCGGTCGACT TCGAGGACGG CGCGCGGCTG CTGCTGCGCG CGCTGCCCGG AGTGCCCGAG
CCCCGCCGGG TCGACATCGC GCGGGCCGTC GCGGTCCTGC GCGACGACGA CGTGACGGAT
CCGCACGAGC GCATCGCCGT CGCCTGCTCC CCGGAGCTGG CCGGCCTGCT CGACGAGCAC
CCGCTGCGCG AGCTCGTGAC CCGCACCCCG TTGTACCGGG TGTGGGTGGA CCGCGAGCGG
GCGCTCTACG GCAGCTGGTA CGAGCTGTTC CCCCGGTCCG AGGGCGCGAG CCTCGACCCA
CCGCGATCGG GCACCTTCCT CACCGCCGCG GAACGACTGC CGGCCGTCGC CGCGATGGGC
TTCGACGTGG TCTATCTGCC ACCGATCCAC CCGATCGGCG AGATCAACCG CAAGGGCCCG
AACAACACCC TCACGCCCGG CCCGGAGGAC CCCGGTTCAC CGTGGGCCAT CGGCAGCTCG
CAGGGCGGCC ACGACGCCGT CCACCCGGAT CTGGGGACCC TGGACGACTT CGACCTCTTC
GTCGCGCGGG CCCGCTCGCT CGGCATGGAG GTCGCGCTCG ACCTGGCGTT GCAGTGCGCG
CCGGACCACC CCTGGGCGAA GCACCATCCG GAGTGGTTCG TCGTGCGCAG CGACGGCTCG
ATCGCGTACG CGGAGAATCC GCCGAAGAAG TACCAGGACA TCTATCCCCT GAACTTCGAC
GCCGACCCGA TCGGTCTATA TCACGAGATT CTGCGGGTGG TCCGGTTCTG GACGGCCCGC
GGCGTCCGGA TCTTCCGGGT GGACAATCCG CACACGAAGC CGGTCGAGTT CTGGGAGTGG
CTCATCGGCC AGGTCAAGGC GACCGAGCCG GACGTGCTGT TCCTGGCCGA GGCGTTCACC
CGCCCGGCGA TGATGCACAC GCTCGCCAAG ATTGGCTTCA CCCAGTCCTA CACCTACTTC
ACCTGGCGCA ACGAGCGCCG TGAGCTGGAG GAGTACGCGC AGGAGCTGGT CGACGCGGCG
CACTACATGC GGCCCAACTT CTTCGTCAAC ACCCCGGACA TCCTGCCCGG GTTCCTGCAG
ACGGGCGGGC CGGCGGCGTT CCGCATCCGG GCGGTGCTCG CCTCGATGCT CTCCCCGACC
TGGGGAGTGT ACGCGGGGTA CGAGCTGTAC GAGAACTCCC CCGTGCGGGC CGGGAGCGAG
GAGTACCTCG ACTCGGAGAA ATACCAGTAC AAGCCGCGGG ACTGGGCCGG CGCGGAACGC
GCCGGGGCCT CGCTGGCGCC CTACCTCACC CGGCTCAACC AGATCCGGCG CGACCACCCG
GCCCTGCACT GGATGCGCAA CCTGCACATC CACGAGTCCG CGACGCCCGA GATCACAGTC
TTCTCCAAAC GGCACACCAC GGCCCGGGCG GGCGGACCCG CCCTCGGCCG CCTCCGCCCC
GACGACGACC TCGTCATCGT CGTCGTCAAC CTTGACCCGC ATTCGGCCCG GGAGACCACC
GTGCGACTGG ACATGCCCGC TCTCGGGCTT GACTGGGGCG ACCGCTTCGA AGTGCACGAC
GAGATGACCG GCGTCACCTA CCAGTGGGGC CGTGAGAACT ACGTCCGCCT GGAGCCGACC
GAGCCCGCCC ACATCCTGAC CGCGCGGCGC CTGCCGTGA
 
Protein sequence
MMIGRSVGRV VITDARPVVS CGQWPSRAVE GETLTVSATV FREGHDLIGA NVVLSGPDGQ 
GVPFTRMKLA GAGTDRYEAD VVMGREGLWG YRVEAWADPL ATWRHGIELK VGAGQTVDEL
AVDFEDGARL LLRALPGVPE PRRVDIARAV AVLRDDDVTD PHERIAVACS PELAGLLDEH
PLRELVTRTP LYRVWVDRER ALYGSWYELF PRSEGASLDP PRSGTFLTAA ERLPAVAAMG
FDVVYLPPIH PIGEINRKGP NNTLTPGPED PGSPWAIGSS QGGHDAVHPD LGTLDDFDLF
VARARSLGME VALDLALQCA PDHPWAKHHP EWFVVRSDGS IAYAENPPKK YQDIYPLNFD
ADPIGLYHEI LRVVRFWTAR GVRIFRVDNP HTKPVEFWEW LIGQVKATEP DVLFLAEAFT
RPAMMHTLAK IGFTQSYTYF TWRNERRELE EYAQELVDAA HYMRPNFFVN TPDILPGFLQ
TGGPAAFRIR AVLASMLSPT WGVYAGYELY ENSPVRAGSE EYLDSEKYQY KPRDWAGAER
AGASLAPYLT RLNQIRRDHP ALHWMRNLHI HESATPEITV FSKRHTTARA GGPALGRLRP
DDDLVIVVVN LDPHSARETT VRLDMPALGL DWGDRFEVHD EMTGVTYQWG RENYVRLEPT
EPAHILTARR LP