Gene Franean1_1367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1367 
Symbol 
ID5669776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1646827 
End bp1649718 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content72% 
IMG OID641240294 
ProductSARP family transcriptional regulator 
Protein accessionYP_001505721 
Protein GI158313213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGTC ACCGGCCGAC CGGTCGCCAC GATCTCATCC CACCGCGGTG CACCCGTCGA 
GCCTTGCGGG CCGGCCCCGC GTGGTTGGCG GTACGTGTCG TCGGCCGAGC GGTGCTCGCG
CTCGCCGCGC TGACAGCGTT CTCGATGATC CCGCTGGTGC TGTGGATGTG GCGGGACGTC
CCGCTGCCCT CCACCTGGGA ACCAGCCCGG TGGCTGACCC TCGCCCGCCG CGGCTACCTG
CACCCCGACA TCGTGCCCAA CACGGTCGCG GTCCTGATCT GGATCGCCTG GGGGCAGGTC
GCGCTCGCCC TCCTCCGCGA GATCGCCGCC CAGTTCCAGC GCGGTGCGTC CGCCACCCGT
ACCGTGCTGC TTCCCGGTGT GGTCCAGCAC CTCGCGGGCA GCTGGATCGC CTCCCTGTCG
ATCCTGTTCA GCCTCCTGGC TGGACGTTCC GCAATGGTCG CCGTGTCCCT CGACCTACCG
GCCTCCCACC CGGCCGCGGG GAGCGTTCAG ACCATCGCGG CGAACGCCGC TGAGCCCATG
GGGGGACCGG CGCCCAGCAC CTCGGCAGGC AACGCGCTAC CGGCGAACGG GATGCCGACC
GAAGGAACCG ATCCAGGCGG GACCACCTGG CGTCGGCACG TCGTGGCACC CGGCGAGACT
CTGTGGGGCA TCGTCGGCGG CGAGTACGAC GACCTGGAAC CAGACGCCTT CCCGCCGGCC
GTCGCGGAGG TCTTCGAGAC CAACCGCGGC ACCACCGACT CGCTCGGCCG CGCCCTGGAT
CGGCCCGACC TGATCAACCC GGGTATGGAA CTGCGGCTAC CGCAGCTTTC CACGGGGCAG
CTGCCTGGCG GGACCTATAC GGCCCCGCAG CCCATCTCAC CGCCCGCTCC TGCCCCTGCG
CCTGCTCCTC CTGCACCCAC TCCCGCGCCT GCCAGCCCGG CGAGCCCAGC GAGCCCGAGG
GCGTCAGCAT CGCCGCCCGT CACTGTGCCC CACGACGTTC CCGCTGTTCC TCCCGGGGAC
GAACCAGAGG TGCCCCTCAG CACGTGGATC GGTGGCGCCG GTCTCCTCGC CACCACCCTC
GTCGGCCTGT GGGCCGCGCG TCGACGCCGC CGCGACAGCA CCGTCACCCC GGCGCAGGAG
ATCCCCGACC CGGATCCGGC CACAGCCGCC CTGCACGCGG CTCTGCTGGA CGCGGACGAC
CCGGATCTCC TCGACCGACT CGACGCCGCC CTACGCAGCA TCGGCGCGGC TCACCGTGAT
CAACCCGACG GGCCGAGCCC CCAGATCCTG CTCGTCCAAC CGGACCAGAG CATCGAGGTT
CTCCTTCATC CCGACGGCGC ACAGACCCTG CCACCGCCAT GGGATGCCGG GCCCGATCCG
CGGATCTGGA CCCTGCCCGC CACCGCGGCG CTCACTGCCG CTGCGGACAT CCCGCCACCT
TGCCCGGCGC TGGTACAACT CGGCACGACC GCCGCGGGCG CGGCCCTGTA CGCCGACCTC
GAAGCCCTCG GCACCCTCGG CATCGCCACG AGCCCGACAG ACCCCAACCG GCTGGCCGAC
TTCTGCCGGG CGATCCTTGC CGCGATCGTC GCCTCCCCTT GGGCCGATCT CACCACCGTG
CGCACCGTCG GCCTCGACCC GCATGCCTTC GCCGCAGAGG AACGAGTCCA GGCAGCCAAG
GACGTCACCG AGCTGACGGA CGACGCTCGC GCCGAGGCCG CCGCCATCGA CACCGTCCTG
CGGGAGCGGG GCTACCCCAG CACGCTCAAC GCCCGGATCG CCGAGCCGGG CGAGGAATTC
GACCCGACCA TCGCCCTCCT CGCCACTCAC CTGGACACCG ACGCCGGCCG TACCGAAGCG
GCCGACCTTG CCGCCGCGGC AGGCCATGGC CGGCGGGGGC TGGCCGTCGT CCTACCGGCC
CAGCCCGACA TCCTTACCGC GTGGATGCTG CGCCCCGACC CGGCCGGCCG CGGATCGCGC
CTGGACCCCC TCGGCGTGCT CCTGACCCCG GTCGGGCTCA CCGACACCGA CGAAGCCGCC
GTCGCCGCCT TCATCGCGGA CGCGGAGGCA CCTCCCATCG ACCTGCCTCC ACCAGATCCG
CCCGCCGAGA CTCCTGCGGT GTTCGTCTCG CCGCCCTGGG AGGTGATGGT GCGCCTGCTC
GGCCCCATCG ACATCACCAC CCGCGACGGC CGCACCCCGC CGCGGGCCGA GACCCGCGAA
CGCACCAACG AGGTCCTCGC CTGGCTGGTC ACCCACCGGC ACGGCACCCG CACCGATCTG
GAATCCGCGC TCTGGCCGCG GGGAGCAAAA TCCAAAACCC TGGCGAACGT GCTTGCCCGG
GCACGGCGCC TGCTCATCAA CCTCGTCGGT GAGGATGCGA AGAACTGGGT GCCCCGATAT
GACCGCGGCC CCCTCACCCT CGACCCCCGG GTGGTCTCCG ACCTGGACAT TCTCCAAGCC
CTCCTGCGCC ACGCCATCGA CCAGCGTGAT CACCACGAGA CGGCGATCGC CACGCTGCAG
GAGGCCCTCA AGCTGGTCAG AGGTGTGCCC GTCGGCTATC CGTGGCTGGA TGCCCAGATG
GGCTCGATCC TGACCACCGC CCCAGTCAAC GCCGCGATCC TGCTCGCCGA ACACCAGCTC
GCCGCCGGCG ACACCGCCGG CGTACTGGCG ACCACCGCCC GAGGCATGGA AATCCTCCCG
GCCCACACCA CCCTGTTCGC GCTGCGAATG CGCGCCCACG CCGCCGCCGG CGACCCCGAC
GCGGTCAAAG CCGAATACCG CTCCTACCTC CGCGCCGAAA AGGCCGAACC CCTCTGGGAC
GGCGAAACCG ACCGCGACCT CGAAGCCCTC CACTACAAGC TCACCCGCCG CCGCACAGCC
GGATCCGGCT GA
 
Protein sequence
MRGHRPTGRH DLIPPRCTRR ALRAGPAWLA VRVVGRAVLA LAALTAFSMI PLVLWMWRDV 
PLPSTWEPAR WLTLARRGYL HPDIVPNTVA VLIWIAWGQV ALALLREIAA QFQRGASATR
TVLLPGVVQH LAGSWIASLS ILFSLLAGRS AMVAVSLDLP ASHPAAGSVQ TIAANAAEPM
GGPAPSTSAG NALPANGMPT EGTDPGGTTW RRHVVAPGET LWGIVGGEYD DLEPDAFPPA
VAEVFETNRG TTDSLGRALD RPDLINPGME LRLPQLSTGQ LPGGTYTAPQ PISPPAPAPA
PAPPAPTPAP ASPASPASPR ASASPPVTVP HDVPAVPPGD EPEVPLSTWI GGAGLLATTL
VGLWAARRRR RDSTVTPAQE IPDPDPATAA LHAALLDADD PDLLDRLDAA LRSIGAAHRD
QPDGPSPQIL LVQPDQSIEV LLHPDGAQTL PPPWDAGPDP RIWTLPATAA LTAAADIPPP
CPALVQLGTT AAGAALYADL EALGTLGIAT SPTDPNRLAD FCRAILAAIV ASPWADLTTV
RTVGLDPHAF AAEERVQAAK DVTELTDDAR AEAAAIDTVL RERGYPSTLN ARIAEPGEEF
DPTIALLATH LDTDAGRTEA ADLAAAAGHG RRGLAVVLPA QPDILTAWML RPDPAGRGSR
LDPLGVLLTP VGLTDTDEAA VAAFIADAEA PPIDLPPPDP PAETPAVFVS PPWEVMVRLL
GPIDITTRDG RTPPRAETRE RTNEVLAWLV THRHGTRTDL ESALWPRGAK SKTLANVLAR
ARRLLINLVG EDAKNWVPRY DRGPLTLDPR VVSDLDILQA LLRHAIDQRD HHETAIATLQ
EALKLVRGVP VGYPWLDAQM GSILTTAPVN AAILLAEHQL AAGDTAGVLA TTARGMEILP
AHTTLFALRM RAHAAAGDPD AVKAEYRSYL RAEKAEPLWD GETDRDLEAL HYKLTRRRTA
GSG