Gene Franean1_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3806 
Symbol 
ID5672170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4518953 
End bp4520536 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content77% 
IMG OID641242685 
ProductGTP-binding protein, HSR1-related 
Protein accessionYP_001508105 
Protein GI158315597 
COG category[R] General function prediction only 
COG ID[COG1159] GTPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0425817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.277341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCC CCGGGCAGGG CGGGCTGCGG GAACAGGTCC GCGCGCTGCT GTCCGACGCG 
GTCGACGCCT ACCGCGGCAC CCCCGCCGAG GCGCTGCTCC GTCATGAGCT CGGCAGGGTC
GACGGGCCGC TGCGGGTGGC CATCGCCGGG CGGGTGAAGG CCGGCAAGTC CACCCTGCTC
AACGCCCTGG TCGGCGAGCA GGTGGCGGCG ACGGACGCGA CCGAGTGCAC CCGTGTCGTG
ACCGCCTACA CGGAAGGCGA GCGGGAGGCC GCCTGGGCGT ATCTGCGGGA GGGATCAGTG
GTCGAGGTGG CGCTGCGCCG GACACCGGGC AGCACCCACG TCGACCTGGC CGGCCACGGG
GAGGGCGAGG TCGAGGCGCT GCGGGTCGAG CTGCCGAGCC CGCGGTTGCG GCGGCTGACG
CTCATCGACA CGCCGGGCAT CGCGTCCCTG TCGGAGGAGC TCTCCCGGCA CACGGAAAGC
TTCCTGCTGC CGGGCTCGGG GTGGCAGGCC GGACGCCCGC CGGGGCGGGC GCCCCGCGGG
CCGTCCGCCC CGGCCGAGGC GGCGGGCGAT GCCGCCGCCG GAGGCATCGG GGTCGGCGCC
GACGCCGTGC TCTACCTGCT GCGCTTCCTG CACGCCTCGG ACGTCGGCTT CCTGGAGTCG
TTCCGCCGGA CCGGGGTGGG CGAGGCGACA CCCGCCCATG CCATCGGCGT GCTCTCCCGG
GCGGACGAGG TCGCGCCCGG CCGCACCGAG TCGGTGGACC TCGCCCGACG CGCCGCCGCC
GACATCGGCC GGGACGACCG GGTACGCGCG CTGGTGCAGA CCGTCGTCCC GGTCGCCGGC
CTGCTCGCCC AGGCGGGCAG CCGGCTCACG ACGGACGAGC TCGCCCAGTT CCTGGTGCTG
GCGGCGGAAC CGGCCGAGGT GGCGGACGAG TTGCTGCTGT CGGCTGACCG GTTCGCCTTC
CCGGCGGCGG CCACCGGCGT CCCGGCACCG CTGCGCGCGC TGCTGCTCGA CCGGCTCGGG
CTGGCCGGCG TGCGGCTGGC GGTGGCGCTC GTCCGGCTCG GGGAGGCGCG CGACCCGGCC
GGGCTGGCCG CCGAGCTCGC CGCCCGCAGC GGCCTGCCCG AACTGCGGGC GCTGCTGCGC
GACCAGTTCA CCGACCGGGC CGACGTGCTG AAGGCACAGC ATGCGCTGGG CGTGCTGGAC
GACGTACTGG ACGAGTTTCC GCACCGCTCG GTCGCGGCGC TGCGGGCCCG CCGGGAACGT
CTCGAAGCCG GCGCACACGC GCTCGCCGAG CTGAGGCTGC TCGGCGACCT GCGTACCGGC
GTGGCCGACG TCGCCTCGCT CGGTGAGGAC CGCCGCCGGG AGATGGAGCG GCTGCTCGGC
ACGGACAGCA CCGGCGCGCA CGCCCGGCTC GGCCTGCCCC CCGGCGCCGG CCCGGACGAG
GTGCGGGCGG CCGCGCTGGA GGCCCTGGAC ACCTACCGGC GCCTGTCCGA GAACAGGCTG
GCGGCCCGTC CCATCCGGCG GGCCGCGACG GTGGTCCGCC GGAGCTGTGA GGGCCTGCTG
CGCACCACCA GCCAGACGCG GTGA
 
Protein sequence
MTGPGQGGLR EQVRALLSDA VDAYRGTPAE ALLRHELGRV DGPLRVAIAG RVKAGKSTLL 
NALVGEQVAA TDATECTRVV TAYTEGEREA AWAYLREGSV VEVALRRTPG STHVDLAGHG
EGEVEALRVE LPSPRLRRLT LIDTPGIASL SEELSRHTES FLLPGSGWQA GRPPGRAPRG
PSAPAEAAGD AAAGGIGVGA DAVLYLLRFL HASDVGFLES FRRTGVGEAT PAHAIGVLSR
ADEVAPGRTE SVDLARRAAA DIGRDDRVRA LVQTVVPVAG LLAQAGSRLT TDELAQFLVL
AAEPAEVADE LLLSADRFAF PAAATGVPAP LRALLLDRLG LAGVRLAVAL VRLGEARDPA
GLAAELAARS GLPELRALLR DQFTDRADVL KAQHALGVLD DVLDEFPHRS VAALRARRER
LEAGAHALAE LRLLGDLRTG VADVASLGED RRREMERLLG TDSTGAHARL GLPPGAGPDE
VRAAALEALD TYRRLSENRL AARPIRRAAT VVRRSCEGLL RTTSQTR