Gene Franean1_4861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4861 
Symbol 
ID5673201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5829681 
End bp5831099 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content75% 
IMG OID641243716 
ProductHNH endonuclease 
Protein accessionYP_001509132 
Protein GI158316624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.254841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAAC AGGTACCTTC CCCGACAGCC GCGTCCCCCA CGCCTGACAT CGGCCTGATG 
CCGGGCCTGG ACGCGGAGAC CGCGCCGCTG GCGGATCTGG AAGCCGCGAT CTGCGGGTGG
GCAGGGCGGC TCGCGGCTGC GACCTGCGGC TGGCTGGTCC TGCTCTCCGC CTTCGAGCGG
CGGGGCGGCT GGTCGGGGAT CGGGCTGCGC TCATGTGCGC ACTGGCTGTC CTGGCGCTGC
GGGATCGGGC TGCGTGCCGC CCGGGAGCAC CTGGCCACGG CCCGCGCCCT CGAACAACTC
CCCGCGGTGC GGGCGGCGTT CGCCGACGGA GCGGTCTCCT ATTCGAAGGT CCGGGCGATC
ACCCGGATCG CCGACCCGAC CACCGAACTG CTCTGGCTCG AACACGCCCT GCACTGCACC
GCAAGCCAGC TGGAACGCCT CGTCCGCACC CTCCGCCAGA CCACCACCGA CCCCGCCGAC
CGTGCCAGGA CGCAGGCGGC CCGGCGGGTC TCCTGGCGCA CGGACGACGA CGGCATGCTG
CACCTGACCG CGGTCCTGCC TCCCGACGAA GGCGCCCAGC TCGTCGCAGC GCTCGACGCC
GCCCGCGCCA GCCTCGACAC CACCACCACC GGCACCACCG GCACCGGCAC CGGCACCGAC
GCCGACGCCG ACGCCGACGC CGGCCAGCCG CCTCCCGACG GGGAGGTCGT TGCCGCGCCG
CGGGACCGCC GACGCGACGC CGACGCGCTC GTCGCTCTCG CCGAGGGGTT CCTGCAACGG
CCAGCTCCCG GACTGACCTC GCCCGCCCAC ACGCTCACTG TGCACGTCGA CGCGGCGACC
CTGCTGGACG CCGCACGGCC ACCGCGACCC GGGCCCGGGT CGCGCGCGGA GATCTCACCC
GGGATCGGCC TGTCCTCCGC CGTCCTGCGC CGGCTCGGTT GCGACGGGCT GATCCGCGCC
CTGGTCACCG ACACCCACGG CAACCCGCTG CGGCTGGGCC GGCGCCGCCG GCTGCCGAAC
CGGCAGCTCC GGGACGCGGT CCACGCCCGG GACAGGGGCA CCTGCCAGTA CCCGGGCTGC
GCACACACCC GGTGGCTGCA CATCCACCAT CTCGTTCCCT GGATCGAGGG CGGCGGCACC
GACATCGACA ACCTCACCCT CGTCTGCGGC GCACACCACC GCACCCTGCA CGACGAGGAC
ATCAAGCTCC GCAGAACCAC CACCGGGCGG ATCGTCGCCC TGCTTCCCGA CGGCCGCACG
CTCGACCCGG CGCCGCCCGC CAATCCGGGG GCCCGACCCG CCGAGGTCCT CGCCGAGGCC
ACTCGGCACG TGGCGCCAGA CGCGATCGTC ACCTGGAACG GCGGCCCGTT CCACCTCGAC
GACTCGATCC GCGCACTCCT GCAGGATCAG GCCGCGTGA
 
Protein sequence
MIEQVPSPTA ASPTPDIGLM PGLDAETAPL ADLEAAICGW AGRLAAATCG WLVLLSAFER 
RGGWSGIGLR SCAHWLSWRC GIGLRAAREH LATARALEQL PAVRAAFADG AVSYSKVRAI
TRIADPTTEL LWLEHALHCT ASQLERLVRT LRQTTTDPAD RARTQAARRV SWRTDDDGML
HLTAVLPPDE GAQLVAALDA ARASLDTTTT GTTGTGTGTD ADADADAGQP PPDGEVVAAP
RDRRRDADAL VALAEGFLQR PAPGLTSPAH TLTVHVDAAT LLDAARPPRP GPGSRAEISP
GIGLSSAVLR RLGCDGLIRA LVTDTHGNPL RLGRRRRLPN RQLRDAVHAR DRGTCQYPGC
AHTRWLHIHH LVPWIEGGGT DIDNLTLVCG AHHRTLHDED IKLRRTTTGR IVALLPDGRT
LDPAPPANPG ARPAEVLAEA TRHVAPDAIV TWNGGPFHLD DSIRALLQDQ AA