Gene Franean1_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1251 
Symbol 
ID5669664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1506549 
End bp1508816 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content79% 
IMG OID641240183 
Productputative DNA-binding protein 
Protein accessionYP_001505611 
Protein GI158313103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00288406 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0497189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAGC TGCGGGCGGT CGCACTCAGC GAGGACGGCG GTTATCTCGT GCTCGCCGAC 
GCCGCCGGGC GCGCGGACGC GGAACAGTTC CGCGTCGCGG TGGACGATCG GCTGCGCGCG
GCGCTGCGCG GAGCCCGACG TAGTGAAGTA CGCGCGGAAA GCGCGCTGAC CCCCCGTGAG
ATCCAGGCCC GACTTCGGGC CGGTGAGACC GCCGCCGACG TCGCGCGGGC CGCGGGCATC
CCGGTGGAGC GGGTCGAGCG CTACGAGGGG CCGGTGCTGG CCGAACGCGC CCGGGTGGTC
CAGGAGGCGC GCGCCGCGCT GCTGCCCAAG GATCCGGGCG GGGTGCCCGG CCGTCCCCTC
GGCGAGGTGG TCGACGCACG GCTGCTCAGC GTGCAGGACG ACCCGGACAC CGCCCAGTGG
GACGCCTGGC GCCGGGTGGA CGGGATCTGG CTGGTCCAGC TCACCTCGGA CAGCAGGTGC
GCCCGGTGGA CGTGGGACCC GGTGGTGCGC CGGGTGCGCC CGCACGACGA CGCGGCCCGG
GCGCTGGTCG CCCCCGAATC AGCGGAGCCG CCCGCCCCGC AGCCGGCGCA GCCGCCCGTG
CGCGCCGCCG GCCCGGCGCT GACCCTGGTG CACGACCAGG GCGCGGCGGC GGCCTACCCG
ACGCAGTCCG CGGCCGGTGG ACCCGGCCAG CAGCTTCCGG GAACGCAGCC CTCGACGGCT
CCCCCCTCGG CCTTCCCCGC TCCGGCGGCC GTCAACGGGA CGGGCTACCT CCCCAGCGCG
GCTGCGCCGG GGTACGGGCC GTCGCCGGCC CAGGGGCCCG CGGGCGAGTC TCCGGACCGC
CGCCCGGCCG AGCGGGCCGG CGACGACTAC GCCACGGACC ACACCACTCC TGACCACGGG
GCGCCCGACT ACGCGACCCC CGGCTACGAG AACACCAGTT ACGAAGCCTC CGGCTATGGA
GGCTCCGCCC AGGAAGCCGC CGGCCAGGAC GCCGCGACGC GGGCGAACAC GGGGCAGGGA
GCCGGCGGCC ACGGCACCTC GCCGGACCAG GCTGGTCCGG ACGAGTACCA GGGCCCGGTG
GCCGCCCCCG CCGCGAGCCG TACGGCCGGC GCCGCGCGGC GCAGCATGCC CGGTGCGTGG
CCGGCGGCCC AGGGCGGGGC CCCAGGCCTG TCCGGGCGGC ACAGCGGCGC GGGAGGCCGG
CGCACCACGC CCCCGAGCCG GTTCGGCTCC CGGGAGCGGG CCGTTCCGCC GCCGACACCG
GCGGCCGAGC CGAGGTGGCT GCCGAACCCG GATACCGAGT ACTCCGAGGT GACCGCCACC
GCCGTCGACG CGGCGGCGGC CAACGAGGCG GAGTTCGGCC ACGCCGGCGC GGCTGACATG
GAGTCCACCA AGACAGACCT GGTCGCGGTG GCCGAGGCGG CCCTCGCCCC GGCGCCGGAA
TCGGCCGAGC CCGGCATCCC GAGCGAGGAC GCCCAGGACA CCCCGGGCGG CTCGGCCGAC
CCGGACGAGA CCGTTCAGGC CGCGCACGCC GCCGGTCTGG ACGAGGACGA CGAAGACGAC
GATGAGGACA CCGAGGACGG CGGCGACCGG TCGGTGAGCG AGGCGGCGCC CGGCGGGGCC
GGGCGGCCCG CGGCTCCCGC GCCGACAGGG CAGCCGGTGC TCCGACCGGC CGCGATCGTG
CCGCCTGCGG CCGCGGAGCC GGCGCGGGAA CCCACGACGC CGGTGGAGCG GGTGCCGCGC
CGTCCGGCGG CCGCCGCCGC CCGCCGGCCA GCTGTCGCCC GCACCGCTGG CGGTGCCCGT
TCACTCGGGG CGCTCGCGGC AGCCGCTGAG GACCTGGACG GCGGCACTGC GGCCTCGGCG
GCTCCGGCTG GTGGCGGCCG AGGCGCCGCT GCCCCGGGGC GCGGCGGAGC ACCGCGCCGG
CCGGCAGCGG GCCGGGCCGT GGACGGCCCG GCGGGTCGCC CCGCGCGCCC CGGCTCCACC
GCGGCGGAGC ACCCCGCCGA GCCGTCCACC GCGGCCGACA GCGAGTTCGA GACCACGGCG
GAGACCACCA CTGCCGCCAC GGTCGGCGAC ACCGCCTCCG AAGCCGCCAC AGAGGCCGCC
ACAGAGGCGG CCCAGCAGGC CGCACGGGCC GCCCAGCCGG CCGCCGCGGG CAACCGCGGA
CGGCAGCCGG CGGCAGGTCG TAGCGGCGAA CGTCCCGCAG GCGGTCGACG CGGACGCAAG
TCGGTGCCAG CATGGGACGA CATCGTGTTC GGCGCCCGCC GGCCCTAG
 
Protein sequence
MRELRAVALS EDGGYLVLAD AAGRADAEQF RVAVDDRLRA ALRGARRSEV RAESALTPRE 
IQARLRAGET AADVARAAGI PVERVERYEG PVLAERARVV QEARAALLPK DPGGVPGRPL
GEVVDARLLS VQDDPDTAQW DAWRRVDGIW LVQLTSDSRC ARWTWDPVVR RVRPHDDAAR
ALVAPESAEP PAPQPAQPPV RAAGPALTLV HDQGAAAAYP TQSAAGGPGQ QLPGTQPSTA
PPSAFPAPAA VNGTGYLPSA AAPGYGPSPA QGPAGESPDR RPAERAGDDY ATDHTTPDHG
APDYATPGYE NTSYEASGYG GSAQEAAGQD AATRANTGQG AGGHGTSPDQ AGPDEYQGPV
AAPAASRTAG AARRSMPGAW PAAQGGAPGL SGRHSGAGGR RTTPPSRFGS RERAVPPPTP
AAEPRWLPNP DTEYSEVTAT AVDAAAANEA EFGHAGAADM ESTKTDLVAV AEAALAPAPE
SAEPGIPSED AQDTPGGSAD PDETVQAAHA AGLDEDDEDD DEDTEDGGDR SVSEAAPGGA
GRPAAPAPTG QPVLRPAAIV PPAAAEPARE PTTPVERVPR RPAAAAARRP AVARTAGGAR
SLGALAAAAE DLDGGTAASA APAGGGRGAA APGRGGAPRR PAAGRAVDGP AGRPARPGST
AAEHPAEPST AADSEFETTA ETTTAATVGD TASEAATEAA TEAAQQAARA AQPAAAGNRG
RQPAAGRSGE RPAGGRRGRK SVPAWDDIVF GARRP