Gene Franean1_5136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5136 
Symbol 
ID5673470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6153647 
End bp6155113 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content73% 
IMG OID641243986 
ProductXRE family transcriptional regulator 
Protein accessionYP_001509400 
Protein GI158316892 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.033803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCT CGACAGGACC GTCGGCTGGT CGTGCCACGC CGAGGACTGC GCAGGCGAAC 
GACCACAGGC CGGCATCGGC CGGGATCACC CGCACTCCGA ACGTGCGGTT GCGTACCTGC
CGGGAGGAAC GCGGCTGGTC GCAGGAACGA CTGGCCAGTG AGATCCGGCG TTTTTCCGTC
ATACACGAGG GCCGCGAGGC CGGCGTGACG GGAAATATGA TCTGCAAATG GGAGAAGGGC
GATAAAAAGC CCAGCCTTCG TTACCAGCGG CTCCTGCGGG CCCTGTTCGA GCGGTCGTCG
GCGGAGCTCG GCTTCGTGGA CGACGACCCC AACACAGGGC TTCCGGCGCA CGCGTCCGGC
GCGGACAATG CCTCCCGGGA GATCGTGCCG GCCGGCACGC TGCTCCTGCA GGCCGCCGAG
GCCGACGCCG GCCTGCACAA CGCTCTCCCG GTGGAGCGGC GTGGGTTCCT GCGGCTGTTC
GCGGCGGCCG GCGGTGTCGC GGTCGTCCCG CTGGGCATGG GTGGCGACGA CGCGCCCTGG
GAGCGGCTGT CCGCCGCGCT GCGCCGGCGC ACCACGGTCA CCCCCGAGCT CGCGGACGAG
CTGAGCCGCT GCACCGCGGG CCTGTACGGC CTCGAGGAGC GGGTTCCGGC CCGGGCCCTG
TTCTCCCGGG TCACCGGGCA TCTGGGGACG CTCACGCAGC TCCTGGAGTC CAGCGGCCGC
TCGCCGGTCC GCCGGGACCT CGCCTCCACG GCCGGCGAGA CCGCCGCCCT GGCCGGTTGG
CTCGCCTTCG ACATGAACGA CGTCCCCTCG GCCCTGGCCT ACTACCGGGT CGCCATCGAG
GCGGCGCGGG AGGCCGACGA CAGCGCGCTG TGGGCCTGCG TGCTGGGCTA CGAGAGCTAT
CACAGCGCGG GCATCGGCCG TCACGACCAG GCCTGCGCGC TGCTGGCCGA GGCGCAGCGC
CGTGCCGCGA CGGGCAGCAC CGTCATGACG AAGGCCTGGC TGGCCGGGCG GGAGGCCGAG
GAGCAGGCGG CCCGCGGTGA GGGGCGGGCC GCGCTGGCCG CCCTCGACCG CGCCCAGGAC
GCGTTCGACC GCGCCGACGA CGGCGACCGG GTCTGGACGC AGTTCTTCGA CCGCGGCCGT
CTGGACGGCA TCAAGGTCAC GACCTACACC CGGCTGCGGC GCCCGGCCGC GGCCCACGCG
GCGGCGACCG AGGCACTGCG CGCCACCACC CCGCACAGCG GCACCAAGAA GCGGTCCCTG
CTGCTCGGTG ACATCGCCGA GGTCCACATC CAGCGCCGGG AGATCGAGGC CGCCACCCAG
TACGCGACCG AGTCGCTCGC CATCGTCGCG GCGACTGACT TCTCCCTCGG GCTGACCAGG
GTCCGCCGCG TCCGGGAGCA TCTGCGGCCC TGGCAGCAGA CGCAGGCCGT CCGCGACCTC
GACGAGCAGC TCCGCGCGCT CACCTGA
 
Protein sequence
MQRSTGPSAG RATPRTAQAN DHRPASAGIT RTPNVRLRTC REERGWSQER LASEIRRFSV 
IHEGREAGVT GNMICKWEKG DKKPSLRYQR LLRALFERSS AELGFVDDDP NTGLPAHASG
ADNASREIVP AGTLLLQAAE ADAGLHNALP VERRGFLRLF AAAGGVAVVP LGMGGDDAPW
ERLSAALRRR TTVTPELADE LSRCTAGLYG LEERVPARAL FSRVTGHLGT LTQLLESSGR
SPVRRDLAST AGETAALAGW LAFDMNDVPS ALAYYRVAIE AAREADDSAL WACVLGYESY
HSAGIGRHDQ ACALLAEAQR RAATGSTVMT KAWLAGREAE EQAARGEGRA ALAALDRAQD
AFDRADDGDR VWTQFFDRGR LDGIKVTTYT RLRRPAAAHA AATEALRATT PHSGTKKRSL
LLGDIAEVHI QRREIEAATQ YATESLAIVA ATDFSLGLTR VRRVREHLRP WQQTQAVRDL
DEQLRALT