Gene Franean1_5545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5545 
Symbol 
ID5673875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6712760 
End bp6714091 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID641244401 
ProductXRE family transcriptional regulator 
Protein accessionYP_001509805 
Protein GI158317297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTC GCCGCCCGCT ACCCACCGCA CCGACTGGCC TGTGGGACCG CCCCGAGATG 
GCCCAGGCCC TCACCGCACG CGATATGAAG ACCGTGCTGG AGATCTACCG GAAGTGGACC
GGTGCCTCCC AGACGCAGAT CGCCGCCATG ACCGGCATCG CGCAGCCGTC CATCAGCGCG
ATTTTCGGCG GGAAACGCCA GGTCACCACC ATCGAAAGCT TCGAGAAGTT CGCCGACGGA
CTCGGCATCC CCCGCGAACG TCTCGGACTC GCGGCCCTGA AGACCGCAGC TCCGGACACC
GCCGACAGCG CGACGAGTCC GGATCGGCGT AGCGTGCTCG CCGCCGGTGC ACTGTTCGCG
ATCGACGCGG AGTTGGACGA GGTCACCCGC CGGATGCAGC AGGTCGCCGC GTCCAACGTC
GATGACGACG CGCTGCAGCA GCTCGACATC AGCATCGAGG TTGTCGGTCG CCGCTACGAG
AACAGCGACG CCGCCACCGT CTACCCCGTC GCGTTGAAGC AGCGCCGGTG GGTCGCCGAC
CTCATGGGCG GGCACCAGCA CCCCGACCAG CGCCGCGAGT TGTACGCCAT CGGCGGGAAG
CTCTCCGGCC TGCTCGGCTA TCTCGCGTTC GATCTCGGGA ACGAGCTGGT CGCCCGCGCC
TACTGCAACG AGGCGATGAG CCTTGCCAAG ACCGCCGGAC ACCGTGATCT CGCCGCGTGG
GTCCGCGGCA CCCAAAGCTT CATCGCTTAC TACGGCGGTC GGTACCGCGA AGCCCTGGAC
CTGGCCCGCG ACGGCCAGCG CTACGCCCGC GGTGGCCCCG CCAGCATCCG ACTCGCCATC
AGCGGCGAAG CCCGCACCCT GGGCAAGCTC GGCGACATCG CCGGAGTCGA CGAGGCCGTC
GGGCGTGTTC TGGCTGCCCA TGCCCGGATC GAGGACACCG ACCCCGTCGG CTACTTCCTG
TCGTTCGAAC CGTTCACCGC GTCCCGCATC GCCGGCAACG CTGCGTCCGC CTATCTCGCC
GCCGGTGCCC CCGACCGGGC CCGCGAGTTC ACGGATCAGG CCATCCCCAT CTTCGCCGCC
GCCGGGTCCA CGGCCAGCCA CGCCCTGACC CTGGTGGACG CGAGCATGAC CTACCTTTCC
GGCCCCGACG CCCAACCGGA CCGCGCCGGA GCTCTCGTTG CCGAAGCGCT GGACGTCGGC
GCCGATCTGC GGTCCGAAGT GGTCGCCCGC CGGGCCCGGG ACTTCCTGCT CACCGCCGCC
CAGTGGCGCA CCGTCCCCGA GATCGCCCAG GTCAACGACG CCGTCAAAGC CTGGAGACTG
CCCACCAGCT GA
 
Protein sequence
MTRRRPLPTA PTGLWDRPEM AQALTARDMK TVLEIYRKWT GASQTQIAAM TGIAQPSISA 
IFGGKRQVTT IESFEKFADG LGIPRERLGL AALKTAAPDT ADSATSPDRR SVLAAGALFA
IDAELDEVTR RMQQVAASNV DDDALQQLDI SIEVVGRRYE NSDAATVYPV ALKQRRWVAD
LMGGHQHPDQ RRELYAIGGK LSGLLGYLAF DLGNELVARA YCNEAMSLAK TAGHRDLAAW
VRGTQSFIAY YGGRYREALD LARDGQRYAR GGPASIRLAI SGEARTLGKL GDIAGVDEAV
GRVLAAHARI EDTDPVGYFL SFEPFTASRI AGNAASAYLA AGAPDRAREF TDQAIPIFAA
AGSTASHALT LVDASMTYLS GPDAQPDRAG ALVAEALDVG ADLRSEVVAR RARDFLLTAA
QWRTVPEIAQ VNDAVKAWRL PTS