Gene Franean1_5118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5118 
Symbol 
ID5673452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6127848 
End bp6131888 
Gene Length4041 bp 
Protein Length1346 aa 
Translation table11 
GC content74% 
IMG OID641243968 
ProductSARP family transcriptional regulator 
Protein accessionYP_001509382 
Protein GI158316874 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0180644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.388014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATC CGGACCGGCT CCAGTTCGCC GTCCTGGGGC CGTTGGAGGT CACCGCGGGC 
GGTGTTCCCG TCGTGCTCGG CGGCACCCAG CGAAAGGTGC TGCTCGCGGC GCTGCTGCTC
GAGGCCGGGC AGGTGCTGTC CACCCAGCGC CTGATCGAGG TGATCTGGAG CGAACCAGCC
CCGGAAAGGG CACTCGCCAC CCTGCGGACA CACGTCAGCG AGCTGCGCCG CCGGCTGGAG
AGCCCCAGCT CAGCCCAGGT ACTCATCCGC CGGGGAGCCG GTTATCTGCT CGATGTGGGG
CCGGCGCAGA TCGACGCCGA ACGCGCCCGG CGGCTGTTGG AGCAGGGCCG GCGCGCCATC
GAGGACGGCG ACCCGGTCAG CGCCATCGCG CCGTTGCAGG AGGCGCAGGC GCTCTGGCGC
GGCCAACCGT TGGTCGATCT CGTCGACTAT CCCTTCACAA CGGGCTACGT CGAAAAACTG
ACAGAGCTTC ACCTTGACAT CCAGAAGACG CGAATCTCAG CCGATCTGTC GCTGGGCCGA
CACCGCGAGG TCATCGGTGA TCTGCGGATG CTCGTCACGA GACATCCGCA CGACGACGGG
CTGCGCCGGG AGCTTGTTCT CGCCTTCTAC CGCGACGGCC GCACCGAGGA CGCCGCCCGG
GCCTGCCGGG AAGGCCTTGA GGCACTCCAC GATCTCGGTC TGGAATCCCC GCTGCTGCAA
CGGCTCCAGG AAGACGTCCT GCGCGCCGAT CCGGGCCTGG CCTGGACTCC GCCACGTGCG
CTGGGCCGGC CGGTGTCGGC GCCGCCCACC GGGCAGGGCG TCTACCAGCT CCCGCCCGAC
ATCGAGGAGT TCACCGGCCG CGACACGATC CGGGGCCAGA TCCTGGCCAC CCTCACCGAC
CCGGTGGCGG GCGCGAAGGG GACGGTCGTC GCCGCGATCG CCGGCAAGGC CGGGGCGGGG
AAGACCGCGC TCGCGGTGTA CGTGGCCCAC CGCGTCCGCG CCCAGTTCGC TGACGCCCTG
TACGTGGACC TGCGCGGCAC CACCGAACCG CTTGACCCGC ACCGCGTGCT GGGCCGCTTC
GTCCAGGCGA TGGGGGTGAG CCGGGCCGCG GTTCCCTCCG ACCCCGACCA TCTGGTCGAG
GTGTACCGGT CGCTGCTGGT CAACCGCCGG GTGCTGATCC TGCTGGACAA CGCGGCGAAC
GAGGCGCAGA TCCGCCCGCT GCTCCCGACC AGCCCCGGCA GCGTGGTCAT CGTCACCAGC
CGCTCGCGGA TGCACGGGCT GACCCCCTCC TACTGGATGG TCGACGTCCT GCACCCCGGC
GATGCCGTGG AGCTGCTCTC CAAGGTGGTC GACGAGCGGC GGGTGAGCGC GGAGCCGGAG
GCGGCACGCG ACATCGTGGG GCTGTGCGGG TACCTGCCGC TCGCCATTCA GATCGCGGGG
CGCAAGCTGG CCGCCCACCC GCACTGGCGG CTGACGCGGT TGGCCTCCCG GCTGGCCAAC
GAGCGGGACA GGCTGTCCTG GCTGGAGGCG GGCGACCTCG AGGTCCGTGC CAGCTTCGGG
CTCAGTTACG AGGGCCGCCC CGCCGACGAG CAGCGGGCGT TCCGCCTGCT GGCGCTGCCC
GAGCTGGCCG ACTTCGCCCC GTGGGCCGCC GCCGCCGTCC TGGACGTCGG CCTGGACGAG
GCCGAGGACG TCGTGGACCG CCTCGTCGAC GCCCAGCTGC TCGAACGCCG CGGTGTCGAC
CGGGCCGGCA CCGAGCGGTA CCGGTTCCAC GACCTGCTCC GGGTCTTCGC CCGGGAAAAG
AACGAGGCGT CCGCCGGCGC GGCCCACCCC ACCGTCCCCC CGGGCCCGGG CTTCACCCGG
ACGGCGACCA CCGACGTCCC GCGCGCCGCC CCCACAGGCA CGGACACCGG AGCGGCGCCG
GCAGCCCAGG TGCCGCGGCT GGCTCCCGAG CACGCGCAGG CCCTCGACCG GCTGCTGTAC
GCCTATCTGG CGGTGCTGCG CGGCGCGGTC GACGAGTTCG CGCCGGGCAC CGCGCGCACC
ATCGAGCCCG CCGCCCACAC CCCCGCCGCG GGCGGCTCCG GTGGACCGGG CCACGCCGTC
GACCCGGACG TGGTCGCGGC GCTCACCGCC CAGCCGCTGC TGTGGTTCGG CGGCGAGCGG
GTCAACCTGC TGGCCCTCGT CGACCAGGCG CACCGGCACG GCCTCGACGA GCCGACCTGG
CTGCTCGCGA TCGAGGCCGC CGAGTTCTGC GCCTTCGGCG CGCACTGGTC GGACTGGGAG
CGGATCCACA CCCTCGCGCT CGCGGCGGCG CGGCGGCGCG GCAACCGGCT GGCCGAGGCG
GTCCTGCTGT GCGGTTTGGG TGAACGCGAC ATCACCCTCG CCTTCGAGCA CGCCTTCTGG
CGGCTGGACG CGCCCGGTGC CGACAACGAC GGGGAGCCGG CGGCCGTCCT CGGTGCCCGC
GCCGCCGACC ACCTCGACCT CGCGACCGAG CGCCTGGAAC GTGCCCGTTC GATCTTCGTC
GAGTACGGCG ACGCGCTCGG CGAGGCCCGC ACCCTGCACG GGCTGGCCGA CGCCGCGCGC
GGGCGCGGGG ACGCCGAGGG CGCGGTGGAG CACTTCGAGC AGTGCCTGGC GCTGACCCGC
CGCGGCGGCG CGCGGCGCGC GGAGGCCGAG GCGCTGATCT GCCTGGCGAT GGCGCACGGC
GACCGCGACG AGCTGGGCGA GGGCATCACC TGCCTGTCGA TCAGCCTGTC GATCGCGCGC
GAGCTGACCA GCCGGTCGCT GGAAGCCCTC GCGCTGCGCC GGCTCGGCGA CCTGCACCGG
CACCGGCCGG AACGCGCCCT GGCCCACTAC AACGAGAGCC TGCCGCTGCT GAGCGCGCTG
CCCGACATCC TGTGGGAGCC GCGGATCCTG GTCCGGCGGG GCGACATCCT GGCGCGCCTC
GACGATCATC TCGCCGCGCG CCGGTCCTGG CAGCAGGCGG TCACCCTGCT GCGCCAGCAC
GGTTCCACCG AGCTCGCCGC GGCCGAGTCG CGGCTGGCCC GCCCGGCGAG CACCGCGCCG
ACCCAGTTCG CCAGCGGCCG GCTGCTCGGC GACTTCGACC CGGCCTACTT CATCGGACGG
GTCGCCACGA GCCGGCGCAG CGTCCGCCTG CTCAACACGT GGACGGACCT GCTGGCCCCG
GCGCACGTCG AGGCCTTCGC CGAGGCTCTG CTCACCGCGG TCGACGCGGG GGCGATGGTG
CAGATCCTGC TCCTGGACCC CGGCTCCCCG CCCGCGGCCG GCCGGGCCGA GGACCTGCTG
CACAGCATCG ACGTCCCGAA TGTGATCAGG GCGAACCTGC GCGTGCTCGA CACCGTGCGG
GCGCGGCTGG TCCCCGCGCT GCGGCCCCGG CTCTCCGTGC GGATCTATGG CGACCAGCCA
CTGACGACCT ACCACCGGTG GGACAGCGGG GCCCTGATCT CGACCTTCCC GTTCGGCCAC
TCGTCGGCCG CCACGACCCA GCACGAGGCG GCGATCTCCT CGACCCTCGT CCAGTTCGTC
GAGCAGCACT TCGAGAAGAT CTGGCATCCC GACCGCAGTG TCAGCCTCGA CGACTATCTC
CGGGTCCCGC TGCGCATCCT GCGCACCGAC GGCCCCGACC GGGACGCCCG CACGATTCAG
GCGAGGTACG CGCGGCTGGG CGACACGGTC TACCTGGCCG ACGAGGAGCT GACCCGGCTC
GTCCACGACG GCGGCGCCGA CGGCCTCGTC ACCGAGATCG GCGGCACCGG CCGTCACCCG
CTCACCGAGC TGCACCGGTG CCGGGTCGTG CCGGCCTGGG AGGGGCGTGA CGGCGCCGCG
GACGCCTTCG CGAAGAAGTA CGGCCCGCAG ACCACGGCCC CCGCGCCGGA CGGCCCGCCG
CTGCGGCTCG CGCCGATGCG CCCCCCGCCG CCCGCGCCCA CTCCGCACGC CGGCGGCCGG
GCCGCCGCCG GCCCGGCGGC TCCTGGCCCA GCGGGCCCTG GGCCAACGGG CCCGCTGCCG
TCGCCCCGAT CGGTGCCGTG A
 
Protein sequence
MPDPDRLQFA VLGPLEVTAG GVPVVLGGTQ RKVLLAALLL EAGQVLSTQR LIEVIWSEPA 
PERALATLRT HVSELRRRLE SPSSAQVLIR RGAGYLLDVG PAQIDAERAR RLLEQGRRAI
EDGDPVSAIA PLQEAQALWR GQPLVDLVDY PFTTGYVEKL TELHLDIQKT RISADLSLGR
HREVIGDLRM LVTRHPHDDG LRRELVLAFY RDGRTEDAAR ACREGLEALH DLGLESPLLQ
RLQEDVLRAD PGLAWTPPRA LGRPVSAPPT GQGVYQLPPD IEEFTGRDTI RGQILATLTD
PVAGAKGTVV AAIAGKAGAG KTALAVYVAH RVRAQFADAL YVDLRGTTEP LDPHRVLGRF
VQAMGVSRAA VPSDPDHLVE VYRSLLVNRR VLILLDNAAN EAQIRPLLPT SPGSVVIVTS
RSRMHGLTPS YWMVDVLHPG DAVELLSKVV DERRVSAEPE AARDIVGLCG YLPLAIQIAG
RKLAAHPHWR LTRLASRLAN ERDRLSWLEA GDLEVRASFG LSYEGRPADE QRAFRLLALP
ELADFAPWAA AAVLDVGLDE AEDVVDRLVD AQLLERRGVD RAGTERYRFH DLLRVFAREK
NEASAGAAHP TVPPGPGFTR TATTDVPRAA PTGTDTGAAP AAQVPRLAPE HAQALDRLLY
AYLAVLRGAV DEFAPGTART IEPAAHTPAA GGSGGPGHAV DPDVVAALTA QPLLWFGGER
VNLLALVDQA HRHGLDEPTW LLAIEAAEFC AFGAHWSDWE RIHTLALAAA RRRGNRLAEA
VLLCGLGERD ITLAFEHAFW RLDAPGADND GEPAAVLGAR AADHLDLATE RLERARSIFV
EYGDALGEAR TLHGLADAAR GRGDAEGAVE HFEQCLALTR RGGARRAEAE ALICLAMAHG
DRDELGEGIT CLSISLSIAR ELTSRSLEAL ALRRLGDLHR HRPERALAHY NESLPLLSAL
PDILWEPRIL VRRGDILARL DDHLAARRSW QQAVTLLRQH GSTELAAAES RLARPASTAP
TQFASGRLLG DFDPAYFIGR VATSRRSVRL LNTWTDLLAP AHVEAFAEAL LTAVDAGAMV
QILLLDPGSP PAAGRAEDLL HSIDVPNVIR ANLRVLDTVR ARLVPALRPR LSVRIYGDQP
LTTYHRWDSG ALISTFPFGH SSAATTQHEA AISSTLVQFV EQHFEKIWHP DRSVSLDDYL
RVPLRILRTD GPDRDARTIQ ARYARLGDTV YLADEELTRL VHDGGADGLV TEIGGTGRHP
LTELHRCRVV PAWEGRDGAA DAFAKKYGPQ TTAPAPDGPP LRLAPMRPPP PAPTPHAGGR
AAAGPAAPGP AGPGPTGPLP SPRSVP