Gene Franean1_1484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1484 
Symbol 
ID5669888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1773858 
End bp1777292 
Gene Length3435 bp 
Protein Length1144 aa 
Translation table11 
GC content70% 
IMG OID641240404 
Producthypothetical protein 
Protein accessionYP_001505830 
Protein GI158313322 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0469742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG AACTGGAGGC GCTCGCCGCC ATCAAGCTGG ATTGGGTACA CGGGCTCGAG 
GACGTCTGGC GCGACTCGCC GGTGCACGTC GACGGCGTCC ACGTCGAGCC GCGCCGGCGC
ATTCTCGCGG CTCTGGACCG GCTCGGCGAG CGGAAGCCGC CAGGCATCGT CGTCCAGGGT
CAGGCCGGTG CGGGGAAGAC ACACCTGCTC GGCTGGGTAC GTCAGCAGGT CGAACGGCGG
GGCGGTTACT TCTTCCTGAT CGGCCCGGTG AGTGGGGACG CGTTCGCCGA GAGCGTCGTC
TCGGCGATGC TCACCGGCCT CTTCCGGACC GTCGACGGCG AGGCGGAGAT CCGGCGCTTC
CTTCTGGCGC TGTGCGCGCA GTTCCGCGTG CCGGCGGAGC TGACCTCCGC GGTCGTCGGG
GACCAGCCTC TGGAGCCGAC TCATCTCAAG GACCTCGTGG TCGCGTTCTT CGACCACGAC
CGGGCGCTGG CTCGCGAATG CCAGCACACC CTGCGCGCCC TGGTCCTGTA CAGCTCGGAC
GACTATGACG CCCAAGCGGT CGGCGAGACC TATCTGCGGT CGATGGACGA GCAGGAGCCC
GGCGAACGGG CCCGCCGCGG CATGCAGGCG CGGACCCGCC CGCCGATGGA CATCGCCCGC
GATCTGTCCC GGCTGCTCGC GCTGACCGGA CCGAGCGTCA TCGCGGTTGA TCAGGTCGAT
GGAGTCGTCG CCGAGCTCAC GGCACGCGAC GAGGCAGCGG ACAGCGGCCT GGGCGGCTTC
GGCGGGTCGG GTGCGGTGCG GCTGAACATG CTGGCGGACG GCCTGCTCGT CCTGCACGAG
CGCATGGAAC GGACGCTGTG CCTGCTCGCC TGTCTGCCGG TCAGCTGGAT TCAGTTCCGT
GAACGAGCGG TGGCCTCCAT GCGTGACCGG TTCTCGGCGA TCATGCCACT CAACCGGCTG
GACGACCCGG AGATCACGGC CAGGATCATC GAACGCCGGT TCCAGGAGGA CTTCGACCGC
ATCGGCTTCG CCCCGCACTA TCCGACCTGG CCGGTAGCCC GGTCAGCCTT CGACCGCGCG
CCCCAGTACA CCCCGCGCGA GCTGTTGCAG CGCATCGACC GGCATATCCA GGAGTGCTTG
TCCACGGGCG TCGTCCGTGA ACTGTTCGAC TTCGCCGCCG CGGACGAGAA TGACATCGTC
GATGTTCAGG ACAACGGTGA CCACGACATC ACCGACATCA CCGTTAGCAC GAATTTCACC
GATGGCACCG ACCTGGACGG CGAGCGCCTG GATCGGCTGG ATCGGCTGGG GCGCGAGGAC
CGGAGCGGCG GTGGACGCGG CCGAGCGGCT GACGAGGTCG GCGCGACCAG TCCTGAGGTG
CTCAGCGCGC TCGACGCCCA GTTCGCCGAG CTACGAGCAG GCGCGCCGCG CGTCGCGGCA
GTCAACGAGG ACATCCTGGT GCCCGTGCTG CTGACCGCCG GACTGTCCGC GTGGATCATC
GAGAACGGCG GGGCGCACGA CTACAGCCTC GAACCGATGC CGAGCACACG CCCAGCGCTG
CACGCCCGGC TGACACGCAC TGTCGACGCC GACACCGGCG AGCAGCTGCA CTGGTGCTTC
CGGGCAATCA CAGCCAGCCA CGGCAACGCG ATTCTGCCGC GTATCACCGC GGCCTGCACC
GCCGCCGCCA TGGTCGCCGG CAACCCGCGG CGACGGCTGT TCCTGCTACG GAACGATCCG
TGGAGTCCAG GGAAGAAGGT GCAGCAGGCC CTGGCCGCGT TCAGCGCCGC CGGCGGCCGT
ACCCTGCGGT TGCACGCGGG CGACCTGGAC ACTTTCATGG CGTTGCGGAC CCTGCTCGAC
CAGCGGCCAG CCGGGCTGCG CGCGTGGCTG GCCTCCCGGC GCCCCGCCAG CGGCACCTGG
CTTCTGCGGG AGGCACTCGG CGACGCGGCC GGCACCGACC CGACCGGCTC CGCCGCCCTG
GCGCCCCGCG CCCCCACCAT GACGCCCGGT CCCGGCCGCC TGGGCAGCGC TGGTGACCCT
GACCCTGACC CTGATTTTGA CTTTGGCCCT GAGGCCGAGC CCGGCCCCGA ATCACCGCCA
CAGGTCGCGC CGGGGCCACG CTCCGCGGCA CCCGCCGGTG TTGTTCGTTC GTCATTCGCT
TCGGCGGACG ACCTGGGTGG CGGTGTCGCG GCCCCGTCCG CGCTCACGCT CGGCACGGCG
GTACGGGGAG ACGCGCCGGC CCGGATCGAT CTGGCCGTGC TGCGCCGCCA TGTCACGATC
TTCGCCGGAT CGGGCTCGGG CAAGACGGTG CTGATCCGCC GGCTGGTCGA GGAGTGCGCG
CTTCGCGGGG TGTCCTCGAT CGTGCTTGAT CCGAACAACG ACCTGGCCCG CCTCGGTGAC
GCCTGGCCAA GCCCGCCGGC CGGCTGGAGA GACGGCGACG CCCAGTTCGC CGCCGAGTAC
CTGGCCGCCA CGGACGTGGT GATCTGGACG CCGCGTCGCG ACGGCGGGCG GTCGCTGACC
TTCCAGCCGC TGCCGGACTT CGCCGGGCTG CGCGACGACC CGGACGAGTT CGAGGCGGCG
ATCGACGCGG CCGTCGCGAC GCTGGCCGTC CGAGCGAACG TCACCGGGAA CGCGAACCGA
GCCAGCCTCG GACGGGCAGT TCTGACCGCG GCGCTACGCC ATTTCGCCCG CGGTAACCGG
ACTGGTCTGA CCGCGTTCAT CGAGCTGCTG GCCGAACTGC CGGACGGGGT GGCCGACGAC
CTCGACCCGG CGGGGAAACG CTCTGTCGAG CTGGCGCAGA CGCTACGAGC GGCGATGGTC
AACGATCCGT TGTTCGGCGG AAAGGGCGAG CCGGTCGACC CTGGGACGCT GCTTGTCCCC
GCGCCGGGGC AGCGTGCCCG AGTTTCGGTG ATCAGCCTCG TGGGGCTGCA ACATGATGAG
CAGCGCCAGA GCTTCGTCAA CCAGCTCCAG CTCGCCCTGT TCTCCTGGGT GAAGCGCAAT
CCGGCCGGAA ATCGGCCCCT CGGCGGGTTG TTCGTCATGG ACGAGGCGCA GACGTTCGCC
CCGTCCGGCC CGTCCACCGC CTGCACCACG AGCACCCTGG CGCTCGCCTC GCAGGCCCGC
AAGTACGGCC TTGGGCTGGT CTTCGCCACC CAGGCGCCGA AGAACCTGCA CAACGGAATC
CCCGGCAACG CCACCACACA GATGTTTGGC CGGCTCAATG CCCCGGTCCA GATCGAAGCG
GCCCGGATGA TGGCCCGCGC CTGCGGCGGC GACGCCCCGG ACATCGGCCG GCTCGCCGTC
GGCGAGTTCT ACGCCACGAG CGAGACCCTG CCATTCATGA AGATCCGCAC ACCGATGTGC
CTGAGCCACC ATCCGCCTTC GCCGCTGACC GCCGAGGAGG TCATCACCCG CGCCCGCCCG
GACTTGACCG CATGA
 
Protein sequence
MSAELEALAA IKLDWVHGLE DVWRDSPVHV DGVHVEPRRR ILAALDRLGE RKPPGIVVQG 
QAGAGKTHLL GWVRQQVERR GGYFFLIGPV SGDAFAESVV SAMLTGLFRT VDGEAEIRRF
LLALCAQFRV PAELTSAVVG DQPLEPTHLK DLVVAFFDHD RALARECQHT LRALVLYSSD
DYDAQAVGET YLRSMDEQEP GERARRGMQA RTRPPMDIAR DLSRLLALTG PSVIAVDQVD
GVVAELTARD EAADSGLGGF GGSGAVRLNM LADGLLVLHE RMERTLCLLA CLPVSWIQFR
ERAVASMRDR FSAIMPLNRL DDPEITARII ERRFQEDFDR IGFAPHYPTW PVARSAFDRA
PQYTPRELLQ RIDRHIQECL STGVVRELFD FAAADENDIV DVQDNGDHDI TDITVSTNFT
DGTDLDGERL DRLDRLGRED RSGGGRGRAA DEVGATSPEV LSALDAQFAE LRAGAPRVAA
VNEDILVPVL LTAGLSAWII ENGGAHDYSL EPMPSTRPAL HARLTRTVDA DTGEQLHWCF
RAITASHGNA ILPRITAACT AAAMVAGNPR RRLFLLRNDP WSPGKKVQQA LAAFSAAGGR
TLRLHAGDLD TFMALRTLLD QRPAGLRAWL ASRRPASGTW LLREALGDAA GTDPTGSAAL
APRAPTMTPG PGRLGSAGDP DPDPDFDFGP EAEPGPESPP QVAPGPRSAA PAGVVRSSFA
SADDLGGGVA APSALTLGTA VRGDAPARID LAVLRRHVTI FAGSGSGKTV LIRRLVEECA
LRGVSSIVLD PNNDLARLGD AWPSPPAGWR DGDAQFAAEY LAATDVVIWT PRRDGGRSLT
FQPLPDFAGL RDDPDEFEAA IDAAVATLAV RANVTGNANR ASLGRAVLTA ALRHFARGNR
TGLTAFIELL AELPDGVADD LDPAGKRSVE LAQTLRAAMV NDPLFGGKGE PVDPGTLLVP
APGQRARVSV ISLVGLQHDE QRQSFVNQLQ LALFSWVKRN PAGNRPLGGL FVMDEAQTFA
PSGPSTACTT STLALASQAR KYGLGLVFAT QAPKNLHNGI PGNATTQMFG RLNAPVQIEA
ARMMARACGG DAPDIGRLAV GEFYATSETL PFMKIRTPMC LSHHPPSPLT AEEVITRARP
DLTA