Gene Franean1_3979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3979 
Symbol 
ID5672340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4765455 
End bp4766675 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID641242858 
Producthypothetical protein 
Protein accessionYP_001508275 
Protein GI158315767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.98735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.25715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCTCC CGCTGGCTGC CGGGCCTGGC GCCGAGAGAC GGGAGAGACC TGGTGCTGTT 
CGTCGGGACG ACTGGGCGGA GGACCACCAC GACGTGGATG TGCAGGACGA GACAGGCAGG
CGGCTGACGA AGGCCCGGCT GCCCGAGGGC ACGGCCGGGA TCGCCCGGCT GCACACGCTG
CCCTCCGCCG GAGCGGTATC CCGTCCACTC CCGGGGATCT CGCCTTCGAG CGGCTCACCC
CGACCGGCAT GGCGATCACC GCCGCCACCT GCCAGGCGCC CTGTCCCAAG GCAGTTGACC
TCATCGACCG ATGTGGACGC GTCGACCTGC CTATATTATG AGTATGTGAC CCGATCTGTA
CCGGAAAAGA CGCTAGAGCA TTGGGCCAGC CAGTACATCA CATACCGGTA CAGGTCGAAG
GCCGCCCTCT GGTGGCCCGC GAGGGGGGAG GACATCCAGT TCGGTCGACT TCCGTCAAGA
CCCGGAAAGA TCGTACAGAT CGAGCTGAAG ACCACTGAAG TCGTCCGTCG CGGCGTCCAC
GAGGTGGAGG TCGACCTCGG ACAGCTCTGG GAGTATCGGC GTCTACCGTT GGGAAAGCAA
CCATTCTACG CCTTCCCCCG GCCGGGTGCC GACTGGCCCG GAGATCTCGG CGAGGCCGCC
GCCAAGACAG GCCGTGCCGT CTCCGAACTC GGATACCGCA GGGGTAAGGA ATTGTGGTTC
GCGAACTGGA TGGTCGTGAT GACCACCGAG CAGGTCGCGG ACGTCATGAG CAAGGAGCTT
GCACTGCACG GTTCGGAGAA ACGAGGCGAG CGGCGGCCGC TGGTGCGTTT CGCCGGCAAA
TCCGAACCGA GATGGGGACT TGAAGCGGCC GATCCGGAGG TGATCCGCTG GCGAGACTTC
TGGTCGATTC TCGACCGGTG CGGTCGGGAT CGCTGGCCAC AGTTGATTCG CCTGCCCGCC
GTCTTCCTTG ACATCCGCGA CGGTCGGGAG CTTTCCGGCG GTCGAGAGGT CTACACTCGT
CAGCAGCTCA GGGAGCTTCT GGACGGCGCG GCAGGGGCGC AGGGAGATGT CGGGCACCTG
GTGATTCTCG AACCCAATCC TGATGGTCAC TACCGACGTT CAGATTTCTT GAACAGCAAC
TCTGCCAGAG TAGGTGCAGC TCCCAGTAGA GGCGATACCG AGCAAAACGA CCATCGTGCG
GTCGTCTCGC TTGATGCTTG A
 
Protein sequence
MDLPLAAGPG AERRERPGAV RRDDWAEDHH DVDVQDETGR RLTKARLPEG TAGIARLHTL 
PSAGAVSRPL PGISPSSGSP RPAWRSPPPP ARRPVPRQLT SSTDVDASTC LYYEYVTRSV
PEKTLEHWAS QYITYRYRSK AALWWPARGE DIQFGRLPSR PGKIVQIELK TTEVVRRGVH
EVEVDLGQLW EYRRLPLGKQ PFYAFPRPGA DWPGDLGEAA AKTGRAVSEL GYRRGKELWF
ANWMVVMTTE QVADVMSKEL ALHGSEKRGE RRPLVRFAGK SEPRWGLEAA DPEVIRWRDF
WSILDRCGRD RWPQLIRLPA VFLDIRDGRE LSGGREVYTR QQLRELLDGA AGAQGDVGHL
VILEPNPDGH YRRSDFLNSN SARVGAAPSR GDTEQNDHRA VVSLDA