Gene Franean1_5660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5660 
Symbol 
ID5673987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6871812 
End bp6873284 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content73% 
IMG OID641244514 
ProductXRE family transcriptional regulator 
Protein accessionYP_001509917 
Protein GI158317409 
COG category[K] Transcription 
COG ID[COG1396] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.176576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.652733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGGAC AAACCTGGCT GTGGACCCCA CCGCGACCGA CTCCAACAAG CGTTGGAATT 
GGATCGCTGC TACGGGCGTA CCGGCAGGCC CATGGCCTGA CCCAGCAGCA ACTCGCCGAT
CTCCTCGGCT TCGACCAGTC CTATGTGTCG AAGGTCGAGA GCGGGCGGCG GGCGATCCAC
GACATCTCCA CGCTGCGCCA CATCGCGCGC AACCTCGGTC TGTCACCCGA GGACGTCGGA
CTCGCCCCGG GCGGTCTGGC CGACCGGCGT CGTGAGCCAC CGCGCGGCTC CGCGGTCGAG
AAGGTCGCCG GCAGCCAGCG CGCCTGGCGG CTGACCCGGG ACCACCTGAA CCACCACCGG
ATCAGCCTGG CCCGCGCCGC GGCCCGGCTC TACCCGGAGG CCTACCGTCT CGGCAACGGG
CTGCTGGCCC GTCCCGGCTG GGTGTGGGAC ACCCCGGTGG ACATCAACGA CATCGGCCTG
CGCTGGGAGC AGTCGGCGGG CGAGCCGGCG ATCACCGGCG CCGAGCCGGA GGCCGACGCC
ACCCGCCCGC TGGTCGGCGA CGGCGGCGCG CAGTCGCGTT ACCAGCGCTA CACCCGGGCG
ATGCGCGACC TCGACCGGCC GACCCTGTTC GAGAACCGGC TCAGCTTCCG GCTGCTGGAC
GTCGCCGAGG CCGAGGGTGC CCCCCCTACC CTCACCTTCG GCCACACCAC CTACTTCGAC
GCCGTCGACG TCTGCGAGAC CGTGGCGCAC GAGACCGCGG CGGCGATGAT GGGCGGCGAG
CTCGCCTGGC CCGCGCTGCC GTTCCGGCGC CGCATCGGCG ATCCGTTCGA CCTCGCCGGG
CGGGTCGCGC TGCCCTCCAT CAACACCCTG ACGATCCGGC TCGACCGGGG CAGCGCCAGC
TTCGTCCTGC ACCGGCGCAG CGCCGGCTCC GTGGCGACGG CGGGCGGGGT GTACCACGTG
CTGCCCGCGG GGGTGTTCCA GCCGTCCGGG ATCACCCCGT TCCACCACGA GGCCGACTTC
GACCTGTGGC GCAACGTCAT GCGCGAGCTC AGCGAGGAGC TGCTGGGTAA CGCCGAGCAC
GACGGCAGCT CGTCCCGGCC GATCGACTAC GACACCGACG AGCCGTTCCG CTCCTTCGAG
CAGGCGCGCC GCGCGGGCAC GCTGCGGGTC TTCTGCTTCG GCATGGGCCT CGACGCGCTC
ACCCTGTTCG GCGAGATCCT CACCGTCCTG GTCGTGGAGG CCGACACGTT CGACACGCTG
TTCGCCGACA TGGTCCGGAC GAACGCCGAG GGGTCGGTCG TCTCGACGGG GCCGGGCCGG
CAGGCGCACG AGGGGATCCC GTTCACCCAG GCCTCCCTGC GCCGGCTCGT CGACACCGAG
CCGCTCGCTC CCTCCGCCGC CGCCTGCCTG GAGCTCGCCT GGCGCCACCG GGAGACCCTC
CTGCCCGGGT TGCGCATGCG GGCATCGGTC TGA
 
Protein sequence
MTGQTWLWTP PRPTPTSVGI GSLLRAYRQA HGLTQQQLAD LLGFDQSYVS KVESGRRAIH 
DISTLRHIAR NLGLSPEDVG LAPGGLADRR REPPRGSAVE KVAGSQRAWR LTRDHLNHHR
ISLARAAARL YPEAYRLGNG LLARPGWVWD TPVDINDIGL RWEQSAGEPA ITGAEPEADA
TRPLVGDGGA QSRYQRYTRA MRDLDRPTLF ENRLSFRLLD VAEAEGAPPT LTFGHTTYFD
AVDVCETVAH ETAAAMMGGE LAWPALPFRR RIGDPFDLAG RVALPSINTL TIRLDRGSAS
FVLHRRSAGS VATAGGVYHV LPAGVFQPSG ITPFHHEADF DLWRNVMREL SEELLGNAEH
DGSSSRPIDY DTDEPFRSFE QARRAGTLRV FCFGMGLDAL TLFGEILTVL VVEADTFDTL
FADMVRTNAE GSVVSTGPGR QAHEGIPFTQ ASLRRLVDTE PLAPSAAACL ELAWRHRETL
LPGLRMRASV