Gene Franean1_5465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5465 
Symbol 
ID5673796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6609507 
End bp6611834 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content76% 
IMG OID641244320 
ProductSARP family transcriptional regulator 
Protein accessionYP_001509726 
Protein GI158317218 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCGG GCAAGGTGCT GGAACCTCTC GGAGCTGAGC CGGTGGCGGC GCATTCTTTG 
TTTGTCGACG CGGTGACCGG CGGCGTCACC GTGTTTGTCG CGCCACCCGG CACGCTGCTC
ACCGACGGGC TCGCGGCCGC GCTGGAGGCG GTCGGCCGGA CACCGGTCTG GCTCCGGCTG
GGCGTCGAGG ACCGCGATCC GGCGACCTTC CTGGCAACGA TGATCACCGC GGTGCGGCGC
AGGGCCCCCG GGTTCGGCGC CACCCTGCTC ACGCGGGTGC GGGCGTGGCC GGGCCCGGTG
CACGGGTGGG ACGACATCTT CCGGCGGCTC GGCGCGGAGC TGGCCGAGCG GCTCCCGCCC
GACTCGGCGC TGGTCCTGGA GCATGCAGAC CGGATCGATC CGCACGGGCC GTCGTTACGC
CTCGCGTGCG GCCATGTTCT GGGCGCTCTG CCCGAGGCGA CGCCCCGCGT CCTGATCGGC
CACGACGCGC TCCCGAGCAC CGAGGCGCTC GGGGACGCCC GCCGGCCGCC GGCCGAGCAC
GCCCGGCTGC ACCCGTCGGA GGTGGAGCTC CCCGGCAGGC TCGGCCTCTG GCGGCGCCAC
GCGGGGGTGC TCCTCTGCCG GTCCGCCGCC CTCCGGCACG CTGTCGACTT CGTGTGCGCC
ACGATGCCGG AAGGTGATCT GGCGCGGGCC CTACGCGGCC GGACACTCGT CGGTGTGCTC
ACCGCGCTGG CCCGAGCAGT CCTCGCGGGA GCCACCGGTG CGGGCCGTCG CATGCTCGCG
CGGCTGCTGC ACGTCGAGTA CATCGTCCCC GAGGCGGACC GGCATCAACT CGCCGAGGTG
GCGGGACCAT GGTTCCAGCC GTTGATCGGC GACTGGTCAC GACTGCGCAC CGTCTGGCGT
GCGCCGCTGC TCGCCGCGCT GGCCGAGGAC GCGTCGCTGG ACCGCGAGAC CCTGCGCGGC
ACCGCGCTCG AGCTTCTGCG GGCAGGTGCT CCGGAACGGG CGATCCCCAT GCTGCTCGAG
TTCGACGATC GCGCGGTGGC GGCGCGGGCG CTGGCCAGGG TCGCCGCCGA CATGATCGCC
GCCGGCCAGT GGGTGACAGT CGGCGGATGG CTGGACCAAC TGCCGCCCGA GGCTGTCGAG
GCGGAACCGG ATCTGCTCGA CGCGGCCGCG CACGTCGCCT CGGCCCGGGG CCAGCAGGAC
CCCGCCCGTC ATCGGTCCCA CGCGGCGGCC GCCGCGGTCT CCGAGACGGA CCGGCTGGCC
CGGCGGCTGG CGGACATCCG GCGGGACCGG CGCCTGCATG CCGAGGCCGA GGCCGCGCTC
GCCCGGTCTG AGGACGAGAC GACGGCGCTC CTGCACGCGC ACATCGCGAG CGTGTCCGAC
GGCCCCGGCC CCAGGGCGCC GAGGGCCGAC GGCGCCACGC CGGGCTCCGT GTCGTTCGGC
CTGGTGCCGG GGCCCAGGCA CCCGGGTCCG GAGGCGATCC GCCGGGTCGA CTCGCCCACG
GGTCCGCCCA CGGGTCCGCC CGCGGGCCGG CTGAACGTGG GCGGACCCGT CCCCCCGGAC
GCGGCGGGGC GGGTGGAGAT GGCCGTGCAC GTGCTGGGGC CACTGTCGGT GATGGTGGAC
GGGCGGCCTG TGCGGGGGTG GGGGGCCCGG CCCCGGTCCC TGCTGGCTTA CCTGGTGATC
CACCGGGCGG ACCTGCCCCC GCGAGAGGTC GTCACCGAGG CGCTGTGGCC CGGCGCGGAC
CTGCCCGCAG CGCGGAACAA CATTCAGGTG GCCGTGTACG GCGCGCGGCG GGCCCTGCGT
GAGGCCGTCG ACCGGCAGGT CATCGTGTTC GAGCGCGGCG TGTACCGGCT GGCCTCCGAC
ATCGCGGTGG CCGTCGACCT CGACGAGTTC GACGGCCATG TCCAGGCCGG GCAGCGGCTC
GCCGGCGCGG GGCACGTCGA GCACGCCATC GCCGAGCTGG AGGCGGCGAC CGCGCTGTAC
CGGGGCGACT TCCTCGCCGA CGCGCGTTCC GAAGACTGGG CCGTGCTGCG CCGCGAACAG
CTCAGGCTGG CCTACCTGGA GGCGCTGGAC CGGCTCAGCT CGCTGTACCT GGAGACCAGG
CAGTACTCGG TCTGCGCGCT GCTGTGCCGG CAGATCCTGG AACGCGACCC GTGCCGGGAG
GACGCCCACC GCCGCCTCAT GCGCTCCTAC GCCCGGCAGG GGCAGCCGCA CCTGGCGCTG
TTGCAGTTCC GGACCTGCGC CGACGTCCTC GCCCGCGAGC TGCGCGTCGC GCCGGGACCG
GCGACAGCGC GCCTGCACGA GCGGATCCGC CGGCACGAGC CCGTCTGA
 
Protein sequence
MLAGKVLEPL GAEPVAAHSL FVDAVTGGVT VFVAPPGTLL TDGLAAALEA VGRTPVWLRL 
GVEDRDPATF LATMITAVRR RAPGFGATLL TRVRAWPGPV HGWDDIFRRL GAELAERLPP
DSALVLEHAD RIDPHGPSLR LACGHVLGAL PEATPRVLIG HDALPSTEAL GDARRPPAEH
ARLHPSEVEL PGRLGLWRRH AGVLLCRSAA LRHAVDFVCA TMPEGDLARA LRGRTLVGVL
TALARAVLAG ATGAGRRMLA RLLHVEYIVP EADRHQLAEV AGPWFQPLIG DWSRLRTVWR
APLLAALAED ASLDRETLRG TALELLRAGA PERAIPMLLE FDDRAVAARA LARVAADMIA
AGQWVTVGGW LDQLPPEAVE AEPDLLDAAA HVASARGQQD PARHRSHAAA AAVSETDRLA
RRLADIRRDR RLHAEAEAAL ARSEDETTAL LHAHIASVSD GPGPRAPRAD GATPGSVSFG
LVPGPRHPGP EAIRRVDSPT GPPTGPPAGR LNVGGPVPPD AAGRVEMAVH VLGPLSVMVD
GRPVRGWGAR PRSLLAYLVI HRADLPPREV VTEALWPGAD LPAARNNIQV AVYGARRALR
EAVDRQVIVF ERGVYRLASD IAVAVDLDEF DGHVQAGQRL AGAGHVEHAI AELEAATALY
RGDFLADARS EDWAVLRREQ LRLAYLEALD RLSSLYLETR QYSVCALLCR QILERDPCRE
DAHRRLMRSY ARQGQPHLAL LQFRTCADVL ARELRVAPGP ATARLHERIR RHEPV