Gene Franean1_6314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6314 
Symbol 
ID5674633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7667317 
End bp7668648 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content71% 
IMG OID641245167 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001510562 
Protein GI158318054 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAC GCCGCCCGCT ACCCACCGCA CCGACCGGCC TGTGGGACCG CCCCGAGATG 
GCCCAGGCCC TCACCGCACG CGACATGAAG ACCGTGCTGG AGATCTACCG GAAGTGGACC
GGCGCTTCCC AGTCGCAGAT AGCCGCCATG ACCGGCATCG CGCAGCCATC CATCAGCGCG
ATTCTTCGCG AGCAACGCCA GGTCACCAAC ATCGAAAGCT TCGAGAAGTT CGCCGACGGA
CTCGGCATCC CCCGCGAACG TCTCGGACTC GCCGCCCCGA AGACCACAGC TCCGGACACC
GCCGACAGCG CGACGAGTCC GGATCGGCGC GCCGTGCTCG CCGCCGGAGC ACTCTTCGCG
ATCGACGCCG AGTTGGACGA GGTCAGCCGC CGGATGCAGC AGGTCGCCGC ATCCAACGTC
GACGATGACG CGCTGCACCA GCTCGACATC AGCATCGAAG TCGTGGGCCG CCGCTACGAG
AACAGCGACG CCGCCACCGT CTACCCCGTC GCGCTGAAGC AGCGCCGGTG GGTCGCCGAC
CTCATGAGCG GACACCAGCA CCCCGACCAG CGCCGGGAGC TGTACGCCAT CGGCGGGAAG
CTCTCCGGCC TGCTCGGCTA TCTCGCGTTC GACCTCGGGA ACGAACTGGT CGCCCGCGCC
TACTGCAACG AGGCCATGAG CCTGGCCAAG ACCGCCGGAC ACCGCGACCT CGCCGCGTGG
GTCCGCGGCA CGCAGAGCTT CATCGCCTAC TACGGCGGCC GGTACCGCGA AGCCCTGGAC
CTCGCCCGCG ACGGACAGCG CTACGCCCGC GGCGGCCCCG CCAGCATCCG ACTCGCGATC
AGCGGCGAAG CCCGCACACT CGGGAAGCTC GGCGACATCG CCGGAGTCGA CGAGGCCGTC
GGGCGCGCTC TGGCCGCCCA CGCCCGCATC GAGGACACCG ACCCCGTCGG CTACTTCCTG
TCGTTCGACC CGTTCACCGC GTCCCGCATC GCCGGCAACG CCGCCTCCGC CTACCTCGCC
GCCGGCGCCC CCGACCGCGC CCGCGAGTTC ACCGACCAGG CCATCCCCAT CTTCGCCGCC
GCCGGGTCCA CCGCCAGCCA CGCCCTGACC CTGGTCGACG CGAGCATGAC CTACCTCACC
GGCCCCGACC CGCAGCCCGA CCGCGCCGGA ACTCTCGTTG CCGAAGCACT GGACGTCGGG
GCAGACCTTC GATCCGAAGT GGTCGCCCGC CGGGCCCGAG ACTTCCTGCT CACCGCCGCC
CAGTGGCGCA CCGTCCCCGA GATCGCCCAG GTCAACGACG CCGTCAAAGC CTGGAGACTG
CCCACCAGCT GA
 
Protein sequence
MTRRRPLPTA PTGLWDRPEM AQALTARDMK TVLEIYRKWT GASQSQIAAM TGIAQPSISA 
ILREQRQVTN IESFEKFADG LGIPRERLGL AAPKTTAPDT ADSATSPDRR AVLAAGALFA
IDAELDEVSR RMQQVAASNV DDDALHQLDI SIEVVGRRYE NSDAATVYPV ALKQRRWVAD
LMSGHQHPDQ RRELYAIGGK LSGLLGYLAF DLGNELVARA YCNEAMSLAK TAGHRDLAAW
VRGTQSFIAY YGGRYREALD LARDGQRYAR GGPASIRLAI SGEARTLGKL GDIAGVDEAV
GRALAAHARI EDTDPVGYFL SFDPFTASRI AGNAASAYLA AGAPDRAREF TDQAIPIFAA
AGSTASHALT LVDASMTYLT GPDPQPDRAG TLVAEALDVG ADLRSEVVAR RARDFLLTAA
QWRTVPEIAQ VNDAVKAWRL PTS