Gene Franean1_7144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7144 
Symbol 
ID5675447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8724418 
End bp8726166 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content65% 
IMG OID641245983 
Productputative transcriptional regulator 
Protein accessionYP_001511371 
Protein GI158318863 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.310406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACG TCGAGCTCGC GGAGATCGTC GACAACCTTC GCACCATCGG TACAGACATC 
GCCGACGTGG AGGTGAAGAA GGCCCACGGT GGGCTGCCGA AGTCGCTGCG CGAGACTCTC
TCCGGGTTCT CGAACACGCG AGGCGGTGTG GTCGTCCTTG GTCTGGACGA GACCCAGGGC
TTCGCGCCGA CGGGACTTCC CGATCCGGCG AGGCTCGCGG CTGATCTTGG TTCCATGTGC
TCCGAGGACA TGGAACCGCC GCTCCGACCG TTGATCAAGG TCCACGACTT CGAGGGCGCC
CAGATCCTGG TCGCCGAGGT CCCCGAGCTC GATCCCGCGC GGAAGCCCTG CTACTCGCGT
GGGGCCGGCA TCACCAAGGG TAGCTACGTC AGGGTCGGCG ACGGTGATCG TCGGCTGTCG
GCCTACGAGG TCCAGATGAT GCTCTCCTCA CGTGGTCAGC CTCGTGAGGA CGAGCAAATA
GTCTCTGGCG TTGGTCTCGA TCATCTGGAT GCGGCCATGG TCGATGCGTT GGTCGCCCGG
CTGAGGACCA GCAGGCCCTA CGCGTTCAAG GACTTGGACC GCTTGGCTGT GCTGCGTCGC
GCCAAGGTGC TCGTGACTGG AGACAGCGGC GAGGACGTGG CGTCGCTGGG AGGCCTGCTC
GCCCTGGGCA GGTATCCGCA GGAGCACTTC CCGCAGCTGA TGGTCACCTT CGTCCACTAC
CCGACCGAGA CCGGCGGCCG GTCCACCGAG AGGTTCCTGG ACAACGTGAC GTTGGAAGGC
CCGGTCCCGG TCATGGTCCG CGACACGCTG GCCACCGTCC GCCGGAACAT GTCTCGTCGG
GCTGTCGTCG GGGGTGCGGG TCGGCAGGAC GTCTGGGAGT ACCCAGAGAC CGCTCTGCGT
GAAGCCGTCG TCAACGCACT GGTCCACCGC GACCTGTCCG GGGGCGCTCG AGGCGCCCAA
GTCCAGGTCG AGATGTACCC CGACCGCCTG GTGATCCGTA ATCCGGGCGG TCTGTTCGGT
CCTGTCACGG TCGACAGTCT CGGCGAAGAA GGTGTCTCCT CAGCCCGCAA CGCCACCCTC
ATCAAGATCC TCGAAGATGT CCCGCTGCCC GGCGAGACCC GCACCGTCTG CGAAAACCGC
GGGTCGGGCA TCCGCGCCAT GCTCGACTCG CTGCTCGCCG CGGGAATGAG CCCACCGGAC
TTCAACGACA AGATCTCGTC GTTTGTCGTC GTCTTCCCCA ACCACACCCT GCTCGGCGAG
GAGACCGTCG CTTGGATCAC CGGGTTGGGT GAGAAGGGCC TGACCGACAG CCAATGCGTC
GCCCTCGCGC TCCTACGCCA GGAAGAAATC CTTGACAATC GCGCCTATCG CAACGCGACT
GGCGTCGACT CCCGTGTAGC CACCAGCGAG CTTCGGGACC TGGTCGCTCG CGAACTCGTC
ACCCAGACCG GAACCCGCCG CTGGGCCAGG TACCAGCTGT CCTGGCGAAC AACCTCAACG
AAAGGCCAGT CCTCTCGGGC CGACCGCCGA CCGGAGCTGC TCGTCGCCCT CGGAAACGAG
ACTCTCTCAC GCTCCGAACT CGTTACACGG ACCGGTCTCA GCGACCAGAC CATCCGACGA
TGGCTGAAGA TCATGCGGGA CGAAGGCTCG GTCGACATCG TCGGTAGCAG CCTCAAGAGC
AGACACGTTC GATACCGCCG CACCTACCAG GATCTGCTCT TCCACTCCGA AGGCACAGAC
CGCGACTGA
 
Protein sequence
MLDVELAEIV DNLRTIGTDI ADVEVKKAHG GLPKSLRETL SGFSNTRGGV VVLGLDETQG 
FAPTGLPDPA RLAADLGSMC SEDMEPPLRP LIKVHDFEGA QILVAEVPEL DPARKPCYSR
GAGITKGSYV RVGDGDRRLS AYEVQMMLSS RGQPREDEQI VSGVGLDHLD AAMVDALVAR
LRTSRPYAFK DLDRLAVLRR AKVLVTGDSG EDVASLGGLL ALGRYPQEHF PQLMVTFVHY
PTETGGRSTE RFLDNVTLEG PVPVMVRDTL ATVRRNMSRR AVVGGAGRQD VWEYPETALR
EAVVNALVHR DLSGGARGAQ VQVEMYPDRL VIRNPGGLFG PVTVDSLGEE GVSSARNATL
IKILEDVPLP GETRTVCENR GSGIRAMLDS LLAAGMSPPD FNDKISSFVV VFPNHTLLGE
ETVAWITGLG EKGLTDSQCV ALALLRQEEI LDNRAYRNAT GVDSRVATSE LRDLVARELV
TQTGTRRWAR YQLSWRTTST KGQSSRADRR PELLVALGNE TLSRSELVTR TGLSDQTIRR
WLKIMRDEGS VDIVGSSLKS RHVRYRRTYQ DLLFHSEGTD RD