Gene Franean1_5789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5789 
Symbol 
ID5674112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7032077 
End bp7033540 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content78% 
IMG OID641244639 
Producthypothetical protein 
Protein accessionYP_001510041 
Protein GI158317533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.712723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.707323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGACC GACCCGCGGC GCCGGACCAC GACGGCGAAC AGGCGGCCCG TCCGCTCGCC 
ACCTTCGACG CGCTGCTGAC CCCCACGGGA GCGGACCTGC TCGCCGCGGC GGCCGCGGCG
CATGCCGACG GCACGGAGCT GCGCGCCGGT TCCCGGCTGC GCGCCGCCGG CCATCCCCCC
GACCTCGTCG CCGCCGCGTT CGCCCAGGCC GAGCTGCGGG CCCGGGCCGC GGCGAAGTTC
AGCCAGGCCG AGCGGATGTT CTTCACCCGC GCCGGGCTGG AGCAGGCCTC GTCCGAACGG
GCCGCGGCGC ACCGGGCGGC GAGGTTCGCC GGCCTGGGCC GCCTCGCCGA CCTGTGCACC
GGCATCGGCG GTGACCTCCG CGCCCTGGCG GCCGAGCACC CGGTCCTCGC CGTCGACCGC
GACCCACTGC ACCTGCGGAT GGCCGTCCAC AACGCGCACG CGCTCCGGGG CGGCGGGCGG
GAGGGCGGGT CCGGTGGGGT CGAGGCCCTT GCGGCGGACG TCCGGGATGT CAGCCTGACG
GGCGTCGACG GCGTCTTCGT CGATCCCGCG CGGCGGGGCG GCGAGCGGCG GATGTCCGCC
GGCGCCGGTG AGCCCCCGCT GGCCTGGTGC TTCGGGCTGA CCGAGCGGGT CGAGGCCGTC
GCGGTCAAGG CGGCTCCGGG GCTCCCGGTG GAGCTCGTCC CGCCAGGCTG GGAGATCGAG
TTCGTCGCGG ACAGACGTGG CCTCAAGGAG TCCGTGCTGT TCTCGCCCGC GCTCGCGACG
GCGTCCCGGC GGGCGACGGT GCTGCTCGGC CCGGGTGACC GGTCGGTGCC CGGGACCACC
GGGCCACGGG TGGTGACGCT GACCGGTGAG CCGGACCCCG TCGGTGACGA CCGCGACCAC
GGCCGCCCGG GCGGCGGTCA CAGCGGCGGT GGCGGGCCCG TCCCGCTCCC GGTCCGGCCG
CCGGGGGAGT ACCTCTTCGA CCCCAATCCC GCCGTCACCC GGGCCGGGCT GGTCGGCACG
CTCGGCCGCC TGGTCGGCGG GTGGCAGATC GACCCGAGGA TCGCGTTTCT CGGCGCCGAC
CGGCCGCTGG CGACCCCGTT CGCGCGGACG CTGCGGGTCG AGGCCGAGAT GGCCTTCGAC
GTGCGCCGGG TGGCGGCGGC CGTGCGCGAG GCCGGCATCG GCGCACTCGA CGTGCGCCGG
CGCGGACTCG CCGGGGATGT GGACGCCATC CGGCGGCGGC TGATGCCGGC CCGGCGGCAC
CTGGTCCCGG GGGGCCGCAC CATGACGGTG GTGATGACCC GGGTGCGGGA CGAGCCGTGG
GCTCTTCTGT GTACCGACAT CACGCAGGCC GATCGGGCGG ATCAAGCCGA TCGGGCTGAT
GCGGCGGATC GGGCCGGGCG GGCGGATCCG CCGGATCTGG CGGACCGGAC GGCGGGGGCC
CGGGGCGGTC CGGCCAGGGA CTGA
 
Protein sequence
MSDRPAAPDH DGEQAARPLA TFDALLTPTG ADLLAAAAAA HADGTELRAG SRLRAAGHPP 
DLVAAAFAQA ELRARAAAKF SQAERMFFTR AGLEQASSER AAAHRAARFA GLGRLADLCT
GIGGDLRALA AEHPVLAVDR DPLHLRMAVH NAHALRGGGR EGGSGGVEAL AADVRDVSLT
GVDGVFVDPA RRGGERRMSA GAGEPPLAWC FGLTERVEAV AVKAAPGLPV ELVPPGWEIE
FVADRRGLKE SVLFSPALAT ASRRATVLLG PGDRSVPGTT GPRVVTLTGE PDPVGDDRDH
GRPGGGHSGG GGPVPLPVRP PGEYLFDPNP AVTRAGLVGT LGRLVGGWQI DPRIAFLGAD
RPLATPFART LRVEAEMAFD VRRVAAAVRE AGIGALDVRR RGLAGDVDAI RRRLMPARRH
LVPGGRTMTV VMTRVRDEPW ALLCTDITQA DRADQADRAD AADRAGRADP PDLADRTAGA
RGGPARD