Gene Franean1_5863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5863 
Symbol 
ID5674186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7112784 
End bp7115963 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content79% 
IMG OID641244713 
Producthypothetical protein 
Protein accessionYP_001510115 
Protein GI158317607 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0962596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.213292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAC CAACGCGGGA TGCCGACCAG CACCGGTGGG CGGTTCGTTC CCGAGGAGCC 
GTTCGTCCCG GTCAGCGGGT CGTAGCGGTC GTCGTGACGC GGCGGGATCA GCCCGCGGCG
CCCGGCGGCC ACGACCCGCG ACGAAACGGT CTCGGCAGTA CTGGCACCGC TGCGGCGGTC
GCTGCGGGCC CGGTGGGTCC GGTGGTCGCG CTGACCGCGT CGACCCGCCC GCCCGAGCGG
ATCGTGGTCG TCCGTCTCGG CGGCCCCGCG ATCCCGGCGC AGCGAATCCC GGCGCAGTGG
GTTCCGGCGC AGTGGGTTCC GGCGGCAGAC AGCCCCACGC AGCCGAGCCC GGACCCTGAC
GTCCCGAGCC TTCCCGCCCG AGACGTCCCG GGCCTGCCCG CTGGGGACGT CCTGAGCCTG
CCTGCCGGGA CGTCCTTCGG TGTCGCGGTC GCGGCCGGTA TCCGGCACGG CGGGCTGGGC
GGCGCGCCCG ACACCTTCTA CTGGCTGATC CACGACGGCG TCCTGCCACG GCCCGACGCC
CTGGAACGAC TGCTCACCTA CGCCCGCGTC GACCCGGGCG CGGCGGTCCT CGGTCCCAAG
GTGCTCGACG CCGCCCGACC GGAATTCCTG CTCGAGGCGG GTGTGACGGT CGACCGGGCC
GGCCGGCGGA TCACCGGTGT CGTCCCGGGC GTGCCCGACC ACGGCCAGCA TGACGCCGTG
CGGGACGTGC TGGCCGTCTC CTGCACGGGG ATGCTCGTGC GAGCCTCGGC ATGGGAGCGG
CTCGGCGGCC TGGCAGCGGA CATGACCGCC GGCCTCGATC TGGATCTCGG CGCCCGCGCG
GCGCGGGCGG GCCTGCGGGT CGTCGTCGTG CCCCCGGCGG CGGTGCTGCT CGCGCCGCCC
GCACGTACCG ACGCGCCCCT CGCTGACAGG CCCCTCGCGT CCGCCGCGTC CCGCGCGGGC
GACCGTGTCG GGAGGGTCGC GGGAGCGCGG GTGCGACTCG CGCTGACGGC GGCGCCCTTC
CTCGTACCGG CCGTGCTCGC GTTGGTGGTC GCGGGCGCGG CGCGGGCCGG TGCCCGCCTG
CTCCTCCCGC CCCGGGACAG CGGTGATCTG AGCGGCCCGC GTTCCCGCGG CCGGTGGCGC
GACGCGTTGA CCGAGCTGTG GATCGCCGGT GCCGTCCTCG CCGGCCCGCG GCCCCTGTTC
CGGATGCGCG TGCGGTTCGG CCGGTGCGTG ACGGTGCCGC GGCGCGCCGT GCGCGAGCTG
CTGTCGGCCC GACCCCGTGC GGTCGCCGAC CGGGCGCCGA GTGGCCCCCG CGCGGCCGCC
CCGCCGCGGG CGCTCGCGGT GGTGATGGCC CTGTTCCTGG GGGCGGCTGG CCTGGCCGTC
CGCCGGTTGC CACCGGACGC CGTCGGCGGT GGCGTGCCGA TGCCCGAGTA CGCCGGTGAC
CTGTGGTCCG TGGTGTGGTC CGGTTGGCAG GGCTCCGCGG GCGGCGTGCT CGGCTGGCCG
GGCTCCGCAC CGCCCTGGAC GGCTCTGCTC GCCGCGCTGT CGTCCGTCGC CGCCCCGGTC
GGTCTCAGCG TGGCCGCGAC CTGCTCGGTC GTGCTCGTCG TCGCTCCGGC CGCGGCGACC
TACCTCGCGT ACCGGGCCTC GGGACGGCTC GTCGGCTCGC GCGGCCCGAG GCTCGGCCTG
GCGGCCCTGT ACGGCGTGTC CCCGCCGGTC ACGGGATCAG TGCTGGCCGG CCGGATCGAG
ACCGCCGTCG CCCTGGCGGT GCTGCCCGCC GTTCTCGCGG CGGGGGACGA GTTCCTCCGT
GGCCGGACGG TCCGGGCCGG AGCGAGCGGT TCCGGTTCCG GAACCGGCCT CGACGCCGAC
GCCGGTGGGC GCGGGCGCGC GGGCTGGCGG CTCGCCGTCT CCCTGGCCCT CGTCGTCGCG
TGCATGCCGG TGCTGGCACC GATCGCGCTG GTCGGCCTGC CCACGGCGGC CTGGGCCGTC
CGGAGATCAC GGCCGCCGGG CACGCAACCG CCGCCGGTCG GGCCGGTGCC GGTACTTGGT
GTGCTGATCG CGGGAGCCGT CCCCCTGCTG CCCGGCCTCG TCACGGGGGG CGTCGGGTGG
TGGCCCGCCG CTGTGTCGCC CCTGTCGGGC TCGGAGCTGA CCGGCCTGGT GGCCGGCGCT
CCCGGTGACG AGAGGCGGGC CGCGGCGCTC TGCCTCTTCG TGGCGCTGTG CCTCTTCGCG
GCGCTGTGCC TGCTGCTCAC CGCCTGCCCC CGCCTGCTCG GCCGTCGGGC GTCCTCCGGG
GTCGGTGGTG TGGCCACCGG CTGGGCGCCG GCCACGGCGC GGGTCGGGGC GGTCCTGCTC
GGCGGATGTG TGCTGGCGTT CGCCGCCGGC CTGGCCACCG CGACGCCGAT TCCGGGTCCC
GCGCAGGAGG AGAGCCCGGC CCGCGCGGTC GCCGCGCTGA GCCGGACGGG CGGACCAGGC
GCGCGCGTCC TGGTGCTGCG CCGGTCCGGC CCGTCGGGGC CGGTCGCGTA CAGCCTGGCC
GCCCAGGGCG GGCCGAGGTT CCCCGCCGCC GCCCCGCACC GGCCGCCCTC CGCTGCCGAC
CGGGCGCTCG CGACGCTCGT GGCGGACATC TCCGCCGGTC TCCCGGACGC CGCCGACGCG
CTGCCGGCGT TCGGTGTGGC CGCGGTCGTC GTCCCGGCGG GTTCGGCCGA CCCGGCGCTG
GTGGCCGCAC TCGACGCGGT GGACGGCCTC TCCCGGGAGA GGCGCGGCCC TGACGTGCTG
CTGTGGCGCC CGGTCGCCGC CGCCCCCGGT GCCGGGGAAG GCACGACCCT GCTCCGGCTC
GGCTCGCCCG GTCCGGCCGG ATCGACCCGG CTGCCCGGCG GCGCCGCGGG GCGCCGGGTG
GTGCTCGCCG AACCCGCCGA TTCCGGCTGG CGGGCGACCC TGGACGGCGC GCCGCTGCCC
GCGGCGGTCG CGGACGGCTG GGCGCAGGCG TTCGTCCTGC CGGCCGAGGG CGGCCTGCTC
GAGGTCTCCT ACGATCACCA CCGCCACCGC GCCGCCGTGC TGGCGACCGC CGCAGCCGGG
GCGCTCCTGC TGGCCGCGGG CCCCTTGGCC GCCGTCCCGC GCCGGCGCAG TGGCCGCCGC
GCGGCGGCCA CGCGGGCGGC GGCCACGCGG GCGGCGGCCA CGCGGGCGGA GACCACGTGA
 
Protein sequence
MTRPTRDADQ HRWAVRSRGA VRPGQRVVAV VVTRRDQPAA PGGHDPRRNG LGSTGTAAAV 
AAGPVGPVVA LTASTRPPER IVVVRLGGPA IPAQRIPAQW VPAQWVPAAD SPTQPSPDPD
VPSLPARDVP GLPAGDVLSL PAGTSFGVAV AAGIRHGGLG GAPDTFYWLI HDGVLPRPDA
LERLLTYARV DPGAAVLGPK VLDAARPEFL LEAGVTVDRA GRRITGVVPG VPDHGQHDAV
RDVLAVSCTG MLVRASAWER LGGLAADMTA GLDLDLGARA ARAGLRVVVV PPAAVLLAPP
ARTDAPLADR PLASAASRAG DRVGRVAGAR VRLALTAAPF LVPAVLALVV AGAARAGARL
LLPPRDSGDL SGPRSRGRWR DALTELWIAG AVLAGPRPLF RMRVRFGRCV TVPRRAVREL
LSARPRAVAD RAPSGPRAAA PPRALAVVMA LFLGAAGLAV RRLPPDAVGG GVPMPEYAGD
LWSVVWSGWQ GSAGGVLGWP GSAPPWTALL AALSSVAAPV GLSVAATCSV VLVVAPAAAT
YLAYRASGRL VGSRGPRLGL AALYGVSPPV TGSVLAGRIE TAVALAVLPA VLAAGDEFLR
GRTVRAGASG SGSGTGLDAD AGGRGRAGWR LAVSLALVVA CMPVLAPIAL VGLPTAAWAV
RRSRPPGTQP PPVGPVPVLG VLIAGAVPLL PGLVTGGVGW WPAAVSPLSG SELTGLVAGA
PGDERRAAAL CLFVALCLFA ALCLLLTACP RLLGRRASSG VGGVATGWAP ATARVGAVLL
GGCVLAFAAG LATATPIPGP AQEESPARAV AALSRTGGPG ARVLVLRRSG PSGPVAYSLA
AQGGPRFPAA APHRPPSAAD RALATLVADI SAGLPDAADA LPAFGVAAVV VPAGSADPAL
VAALDAVDGL SRERRGPDVL LWRPVAAAPG AGEGTTLLRL GSPGPAGSTR LPGGAAGRRV
VLAEPADSGW RATLDGAPLP AAVADGWAQA FVLPAEGGLL EVSYDHHRHR AAVLATAAAG
ALLLAAGPLA AVPRRRSGRR AAATRAAATR AAATRAETT