Gene Franean1_7048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7048 
Symbol 
ID5675359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8601130 
End bp8602725 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content74% 
IMG OID641245894 
Producttransposase IS66 
Protein accessionYP_001511285 
Protein GI158318777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.652427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGTCG CGGTCGAGGG TGATGGCGTG ACGCTGGCGG GGGTGCTGGC GGAGAACGCC 
TGGCTGCGTG GCCAGTTGGC CGAGCGGGAC GCCGAGATCG CGGCGCTGCG GGCGCGGGAC
GCCGAGCGGG AGACCGAGCT TGAGGCGTTG CGGGCGGAGC TCGTGGTGCT GCGGAAGGTC
GTGTTCGGTC GGTCGTCGGA GAGGGGCGCC GGGCCGGCGC CTGCGCCGGC GGGGCGGGAT
GGCACGGACG GTGGCCAGCT GGCGGGCGGT CGGGAGGCCG CGGGCCGGGA GGCGCCGCGG
CGTGGGCCGG GGGCGCGGGC GGGTCGGCGG GACTACGGCG GTCTGCCGCG CCGGGATCTC
GACTGTGACT TCCCCTCGGG TGGCTATGCC TGCCTGGAGT GCGGGACGCT GTTCACGCCG
TTGGGTGAGC ACCGGGTCGA GCAGGTGGAC TGGCGGGTGC TCGTCGAGCT GCTGGTCTCC
CACCGGCGCC GCTACCGGCG GGGGTGTGGC TGCGGCGGGC CGGTGACGGT GACCGCGCCG
GGCCCGTCGA AGGCGGTCGG GCGGGGCCTG TTCACCAACC GGTTCCTCGC GATGCTGCTG
GTGGAGCGGT ATGTCGCGGG CCGGTCCCAG AACTCGCTGG TCACCGGACT GGCACGCCAC
GGCGCCCAGA TCTCGCCGGC GACCCTGACG GGGGCGTGCG CGCAGGTCGC GGGCCTACTC
GCCCCGCTCG CGGAGAAGAT CGTCGCCCGG TCGCGGGGGT CGTGGCACCT GCACGCCGAC
GAGACGACCC GGCGGGTGTT CACCCCGGAC AGCGCCGGCG GGCCGGCCCG CCGGTGGCTG
TGGGTGTTCC TCGGCCCGGA CTCGGTGTGC TTCGTGATGG ACCCGTCCCG CTCGGCGGCG
GTGCTCGCCG GGCATGCCGG GATCAGCGAG GCCACCGGCC AGCTCGACGG GGACGACGGC
GCCGGCGGCC CGCGCCAGCT GGTGATCTCC TCGGACTTCT ACGCCGTCTA CGCCTGCGCT
GGCCGCAAGG CGGACGGGAT CGTCAACCTG TTCTGCTGGG CCCACGTCCG CCGGTACTTC
ATCCGGGCCG GGGACGCGAA CCCCGCCCAG CTCGGGATCT GGGCCCGCCA CTGGCGCGAG
CAGTTCGGCG CGCTCTACCA AGCGCACGCC GAACTCGCCG ACGCCTGGCA GACCGCGGCC
AGCGCCCCGA GCCCGGCGGC CGAGCGCCGC CTCGCCGCCG CCCACGCCAC CTGGGACGCC
GCGATCGGGG CGATCGACAC CGCCCGCCGC GAGCAGACAG CCTCCCCCGG CCTACAGGAA
CCCGCGAAGA AAGCCCTGGC CACGATGGAC CGGGAATGGG ACGGGCTGAT CGCCCACCGC
GACTACCCCA TGATCGGGCT GGACAACAAC CCAGCCGAGA GAATGATCCG CAAACCGGTG
ATCACACGGC GCAATACCGG CGGCTCCCGC ACCGACGACG CCGCCTGTCG GCATGCCCAC
ACGCAACTTC CGACTACTTA CGTGAAGAGT GAAGAGAAGG TTTTCGGGTC GCCCTCGGGT
CGGGCATCCT CATCGCAGGG CCGGCCAGGG AGGTGA
 
Protein sequence
MGVAVEGDGV TLAGVLAENA WLRGQLAERD AEIAALRARD AERETELEAL RAELVVLRKV 
VFGRSSERGA GPAPAPAGRD GTDGGQLAGG REAAGREAPR RGPGARAGRR DYGGLPRRDL
DCDFPSGGYA CLECGTLFTP LGEHRVEQVD WRVLVELLVS HRRRYRRGCG CGGPVTVTAP
GPSKAVGRGL FTNRFLAMLL VERYVAGRSQ NSLVTGLARH GAQISPATLT GACAQVAGLL
APLAEKIVAR SRGSWHLHAD ETTRRVFTPD SAGGPARRWL WVFLGPDSVC FVMDPSRSAA
VLAGHAGISE ATGQLDGDDG AGGPRQLVIS SDFYAVYACA GRKADGIVNL FCWAHVRRYF
IRAGDANPAQ LGIWARHWRE QFGALYQAHA ELADAWQTAA SAPSPAAERR LAAAHATWDA
AIGAIDTARR EQTASPGLQE PAKKALATMD REWDGLIAHR DYPMIGLDNN PAERMIRKPV
ITRRNTGGSR TDDAACRHAH TQLPTTYVKS EEKVFGSPSG RASSSQGRPG R