Gene Franean1_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0433 
Symbol 
ID5668856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp512058 
End bp513767 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content69% 
IMG OID641239365 
Producthypothetical protein 
Protein accessionYP_001504804 
Protein GI158312296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGC ATTTCATCAC CGTCGGGCAG GTGCTCGACG CGCCGGGCGG ACGAGACGTA 
CTGGAACGGT TCCTGCCGCA GGCCGTCGAC CGCGCCGACG TCCGTGAGCT TCTCGTCCTG
TTCTTCCTGC GGGTGACCCC GGGGCTGCGC GATGACGAGC AGGCCAGGGC GGCGTTCTGG
GCCGAGATCG ACGCGCTGAT GGAACCGGTC ATTCTCCGGC CGCACGCCGC CGCCATCGCG
CCATCGGCGG TCGGCGTATC GGCGCCGCAC GCGTCGGCGC CGTGGACCGT CGCAGGCTCA
CCCACGCGGT GGGGGCTGCT GGAGATCCAG CTGAGCGGTC CGTCGGACGG GAACCCGTTC
ATCGACGTCG AGCTGACCGC CGAGTTCCGG TGCGGCGAGC GGTCCTGGAC GGTCGGCGGC
TTCTATGACG GCGACGGAAC CTACCGGCTG CGGGCGCTCG CCGAACAGGA GGGGACGTGG
CAGTTCGTGA CCTCCTCCAC GACCCCGGCG CTGGACGGGA TCGAGGGCGA GGTGACCGTA
GGCCCGGCGG CGCCGGGAGC GCACGGGCCG GTCCGGGTGG ACGGCTTCCA TTTTGCGTAC
GCGGACGGAA CCCGTTACCG GCCGTGGGGG ACCACCGCGT ACGCCTGGAA CCACCAGGAC
GACAAGACGC AGGAGCAGAC GCTGGAAACG CTGGCCGCGT CGCCCTTCAC CAAGCTGCGT
ATGTGCCTGT TCCCCAAGCA TTTCGTCTTC AACAACGCCG AGCCCGTCCA GTTCCCGTTC
CCACGGGTCA ACGGCTCCTT CGACCACACG CGGTTCGACG TCGAGTTCTT CGCGCGCCTC
GACGAGCAGG TTCGCCGTCT CGGTGAACTC GGCATCGAGG CCGACGTCAT CCTCTTCCAC
CCGTACGACA AATGGGGTTT CTCCGATCTG GGACGGGCGG TGGACGAGCG AGTCGTCAGG
TACCTCGTGC GGCGCCTGGC CGGCTACGCG CACGTCTGGT TCTCGCTGGC CAACGAGTAC
GACGCGGTAC CCGGCAAGAC GATCGCGGAC TGGGACCGGA TCGGTGAGAC CCTCACCGCC
GAGGACCCGC ACGGGCACCC GGTGTCCATC CACAACTTCA TCGAGCACTT CGACCACACC
CGGCCGTGGA TCACCCACGC CAGCGTCCAG CACGGCCGGG TCGAGGAGGT CACCGGCTGG
CGCGAGCGCT GGGGCAAGCC GGTGGTCATC GACGAGACCG GTTACGAGGG CGACCTCGAG
TTCGACTGGG GCAACCTGAC CGGCGAGGAG ATGCTGCGCC GATTCTGGGA AGGCGCGGTG
CGCGGCGGTT ACGTCGGTCA CGGCGAAACG TACTGGAACG CCGAGGAGAG GATCTGGTGG
GCGAAGGGCG GTCAGTTGAC CGGCACGAGC CCGCGCCGGA TCGGCTTCCT GGCCGAGATC
GTCGCCGCCT CACCGACCGG TGTGCTCGAG CCTCTACCGT CCGACTACGA CCTGCCGTGG
GCCGGTGTCC AGGACGAGTA CCTGGTCAAC TACTACGGCC TCGGCCGGCC GAGGGAGCGT
CATATCCTCC TGCCCCCGGG CCGCTGGCAC GTTGACGTCC TGGACACCTG GGAGTGCACC
GTCGAGCGGC TGCCCGGCAC GTACGAAACC CTCGCGGTCG TCCCGCTGCC GGCCAAGCCC
TACCAAGCCG TGCGGCTGGT GAAGGCCTGA
 
Protein sequence
MLTHFITVGQ VLDAPGGRDV LERFLPQAVD RADVRELLVL FFLRVTPGLR DDEQARAAFW 
AEIDALMEPV ILRPHAAAIA PSAVGVSAPH ASAPWTVAGS PTRWGLLEIQ LSGPSDGNPF
IDVELTAEFR CGERSWTVGG FYDGDGTYRL RALAEQEGTW QFVTSSTTPA LDGIEGEVTV
GPAAPGAHGP VRVDGFHFAY ADGTRYRPWG TTAYAWNHQD DKTQEQTLET LAASPFTKLR
MCLFPKHFVF NNAEPVQFPF PRVNGSFDHT RFDVEFFARL DEQVRRLGEL GIEADVILFH
PYDKWGFSDL GRAVDERVVR YLVRRLAGYA HVWFSLANEY DAVPGKTIAD WDRIGETLTA
EDPHGHPVSI HNFIEHFDHT RPWITHASVQ HGRVEEVTGW RERWGKPVVI DETGYEGDLE
FDWGNLTGEE MLRRFWEGAV RGGYVGHGET YWNAEERIWW AKGGQLTGTS PRRIGFLAEI
VAASPTGVLE PLPSDYDLPW AGVQDEYLVN YYGLGRPRER HILLPPGRWH VDVLDTWECT
VERLPGTYET LAVVPLPAKP YQAVRLVKA