Gene Franean1_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0643 
Symbol 
ID5669060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp747907 
End bp748977 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content74% 
IMG OID641239570 
Producthypothetical protein 
Protein accessionYP_001505008 
Protein GI158312500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.955311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.552464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGG GGCGGCCCCC GGACCACTCC GCGGGATACG GGACCGTGTT CACCCCGGCG 
GTGCTCGAGG CGCCGCCGCT CCCGGTTCCC CGTGGCCCCC TCTCCGAGTA CCTCGTCGAG
CTGCTCGGCG GCGACGTCCG GCCCGCCGCC GGGTGGCCGA AGCCGGCGGA CGACGCGCTG
TTCGGCGAGG ACGGGGCCCT CGCGCTGCAC TGCCTGTACG AGCTTCACTA CCGGGGATTC
CGCGGGGTCG ACGACCGGTT CGAGTGGGAG CCCTCGCTGC TCGCGTTGCG GGCGGAGCTC
GAGGGCGACC TGGAGCGCCG CCTGATCGAC CTGGCCGGCC CGGACCCGTC CCCCGCGGGG
GACATCGCCG CGGAGCTGCG CCGGGTGATC TCCCAGCCGG GAGGCCGTTC CCTGTCCGGC
CGCCTCGCCG AGCGGGGCAG CCTCGATCAG TTCCGCGAGT ACGCGGCGCA CCGCTCGCTC
CTCCAGCTGA AGGAGGCCGA CCCGCACACC TGGGCCGTTC CCCGCCTCAC CGGCGCCGCC
AAGGCCGCAC TCGTGGAGAT CCAGGCGGAC GAGTACGGCG GTGGCACCGA GCGGGACATG
CACCAGAACC TCTTCGGCCT GACCATGCTC GAGCTGGGCC TGGACCCCTC GTACGGCGCC
TACGTCGACC GCCTGCCCGG AGGCACCCTG GCCACCGCCA ACGTCCCGAG CTTCTTCGGC
CTGCACCGGC GGTGGCGGGG CGCGCTCGTG GGGCACCTGG CGGTCTTCGA GATGACATCG
GTCGAGCCGA TGGGCGCCTA CGCCGCGGCC CTGCGGCGGC TGGGCCTGCC CTGGAGCGCC
CGGCACTTCT TCGAGGTCCA CGTCGTCGCC GACGCCCACC ACCAGAATCT CGCGGCGGAG
TCACTCGCGG GTGGCCTTGT CCGCGCCGAG CCGGCGCTCG CCCGCGACGT CCTGTTCGGT
GCCCGGGCGA CCATGGCCGT CGAGGGCTAC TGCACGGAGA ACATCCTCGC CGCCTGGGAC
CGCGGGGGGA CGGCCCTGCT TCCCGTCCAG GGAGAAGCGG TCGCCCGCTA G
 
Protein sequence
MRTGRPPDHS AGYGTVFTPA VLEAPPLPVP RGPLSEYLVE LLGGDVRPAA GWPKPADDAL 
FGEDGALALH CLYELHYRGF RGVDDRFEWE PSLLALRAEL EGDLERRLID LAGPDPSPAG
DIAAELRRVI SQPGGRSLSG RLAERGSLDQ FREYAAHRSL LQLKEADPHT WAVPRLTGAA
KAALVEIQAD EYGGGTERDM HQNLFGLTML ELGLDPSYGA YVDRLPGGTL ATANVPSFFG
LHRRWRGALV GHLAVFEMTS VEPMGAYAAA LRRLGLPWSA RHFFEVHVVA DAHHQNLAAE
SLAGGLVRAE PALARDVLFG ARATMAVEGY CTENILAAWD RGGTALLPVQ GEAVAR