Gene Franean1_6672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6672 
Symbol 
ID5674987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8102830 
End bp8105535 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content74% 
IMG OID641245523 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001510915 
Protein GI158318407 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.514873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.542159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGTCT CGGAGACGGT GTCTGTCCAG GCGCGGATCG CCGCGGAGCT GGGCGTGCGC 
GAGGGGCAGG TGGCCTCGGC GATCGACCTG CTCGACGGCG GTGCGACAGT GCCGTTCATC
GCCCGGTACC GCAAGGAGGT GACCGGCGCC CTCGACGACG CGCAGCTGCG AACCCTGGAG
GAGCGGCTGC GGTACCTGCG GGAGCTGGCG GAGCGGCGGG CGGCGATCCT GGAGTCCATC
CGGAGCCAGG GCAAGCTGGA CGACGCCCTC GAGGCGCAGA TCATGGCCGC CGACACCAAG
GCCCGCCTCG AGGACATCTA CCTCCCCTAC AAGCCGAAGC GCCGGACGAA GGCGCAGATC
GCGCGTGAGG CCGGGCTGGA GCCGCTCGCG GACGCGCTGC TGGCCGACCG CTCGCTCGAC
CCCCGGGCGG AGGCGGAGCG CTACGTCGAC GCCGAGAAGG GCGTCGCGGA CGCCACGGCC
GCGCTCGAGG GCGCCCGCGC CATCCTGGTG GAGCGCTTCG CGGAGGACGC CGACCTGATC
GGTGCCCTGC GCGAGCAGAT GTGGTCCCGT GGTTATCTCG TCAGCCGGGT CCGCGAGGGC
AAGGAGTCCG ACGGCGCGAA GTTCGCCGAC TACTTCGACT TCGCCGAACC GTTCACGAAG
CTGCCGTCCC ACCGGATCCT CGCGATGTTC CGCGGTGAGA AGGAGGAGAT CCTCGACCTC
ACCCTGGAGC CGGACGCGCC CGCCGACCCG GCCGAGCCCC CGGCCCCCGG CCCCACCGAC
TACGAGCGGC GCATCGCCGA GCGTTTCGAC ATCGCCGACG CCGGCCGGCC GGCCGACCGC
TGGCTGCTCG ACGCGGTGCG CTGGGCCTGG CGGACGCGGG TCCTGGTCCA TCTCGGCGTC
GACCTCCGCA TGCGGCTGTG GACGTCCGCC GAGGACACCG CGGTGCGCGT GTTCGCGGCG
AACCTGCGTG ACCTGCTGCT CGCCGCCCCC GCCGGGCAGC GGCGCACCAT GGGCCTCGAC
CCGGGGTTCC GTACCGGCGT CAAGGTGGCG GTCGTCGACG CGACCGGGAA GGTCGTCGCC
ACCGACACGA TCTACCCGCA CGTCCCGGCC CGCCGCTGGG ACGACTCGGT GGCCTCGCTG
GCGCGGCTGT CCGCCGAGCA CGGCGTCGAG CTGATCGCCA TCGGCAACGG GACGGCCTCC
CGGGAGACCG ACAAGCTGGC CGACGACCTG ATCCGCCGGC ATCCCGAGCT GAAGCTGACC
AAGGTGATGG TGTCGGAGGC CGGTGCCTCC GTGTACTCCG CCTCCGCCTA CGCGTCCCAG
GAACTGCCTT CGATGGACGT CTCCCTGCGC GGCGCGGTGT CCATCGCGCG CCGCCTGCAG
GACCCGCTCG CCGAGCTCGT CAAGATCGAC CCCAAGTCGA TCGGGGTCGG GCAGTACCAG
CACGACCTGG CCGAGGCCAG GATGTCGTCC TCCCTGGACG CCGTCGTCGA GGACTGTGTG
AACGCGGTCG GGGTGGACGT CAACACCGCC TCGGCGCCGT TGCTCTCCCG GGTCTCCGGC
ATCAGCGGCG GGCTGGCCGA CAACATCGTC CGCCACCGTG ACAGCAACGG CCCGTTCCGG
TCGCGGACGG GGCTGCTGGA CGTGGCCCGG CTGGGCCCGA AGGCGTTCGA GCAGTGCGCC
GGCTTCCTGC GCATCACCGG CGGCGACGAC CCGCTGGACA GCTCCAGCGT GCACCCGGAG
TCCTACCCGG TAGTCCGGCG GATCCTGACG GTGACCGGTG GCGACCTGCG CGCGCTGATC
GGCGACACGA AGACACTGCG CTCGCTCAAG CCCACCGAGT TCGTCGACGA CACGGTCGGC
CTGCCGACGG TGACCGACAT CCTCGCCGAG CTGGAGAAGC CCGGCCGCGA CCCGCGGCCG
GCGTTCCGGA CGGCCGAGTT CACCGAGGGC GTCGAAACCC TCGCCGACCT GGTGCCGGGC
ATGATCCTCG AGGGCGTGGT GACCAACGTC GCCGCGTTCG GCGCCTTCGT CGACATCGGC
GTGCACCAGG ACGGCCTCGT CCACGTCTCG GCGATGTCGA AGAACTTCGT CAGCGACCCC
CGTGAGGTCG CCAAGCCCGG GGACATCGTG CGGGTGAAGG TCCTCGACGT CGACATCCCG
CGCAAGCGGA TCTCGCTCAC GCTGCGCCTC GACGACGAGC CGGGCGCGGC GGACGCCGGC
GGCGGGCAGA GCGGTCAGGG TGGCCAGGGC GGTGAGCGGC GCGCGGGCCG CGGCGGCCGC
GGCGGGCAGA ACCGCGGTGG CGCCGGTGGC GCCGGTGGCG CCGGCCGGAG CCAGGAGCGC
GACGGCGGGC AGCCCGCCGA CCAGCAGGGC GGCCGGCAGG CGGCAGCGGC CGGCGGCGGG
CAGCCGGCCG GCGGCCAGCC CGGAGGCGGG CGTGGCGGTC CGGGGCAGCG GGGCGGCCCC
GGCCAGCGCG GTGGTCCGGG CCAGCGCGGT GGCGGTGGCG GTGCGGGTCA GCGCGGCGGT
GCCGGTGGCG CGGGTGGCGG TCAGCGCGGT GGCGGCGGTG GCCAGCGCGG CGCCGGTCAC
CGCGGCGGGC GGACCGACGG GGCCATCGCC GACGCGCTGC GCCGCGCCGG TCTGGTGACC
GGCGACGAGG TCACCCTCGG CGGAAAGCAG GACGACAGCC GGGACAGCCG CCGCGGGCGC
CGATAG
 
Protein sequence
MSVSETVSVQ ARIAAELGVR EGQVASAIDL LDGGATVPFI ARYRKEVTGA LDDAQLRTLE 
ERLRYLRELA ERRAAILESI RSQGKLDDAL EAQIMAADTK ARLEDIYLPY KPKRRTKAQI
AREAGLEPLA DALLADRSLD PRAEAERYVD AEKGVADATA ALEGARAILV ERFAEDADLI
GALREQMWSR GYLVSRVREG KESDGAKFAD YFDFAEPFTK LPSHRILAMF RGEKEEILDL
TLEPDAPADP AEPPAPGPTD YERRIAERFD IADAGRPADR WLLDAVRWAW RTRVLVHLGV
DLRMRLWTSA EDTAVRVFAA NLRDLLLAAP AGQRRTMGLD PGFRTGVKVA VVDATGKVVA
TDTIYPHVPA RRWDDSVASL ARLSAEHGVE LIAIGNGTAS RETDKLADDL IRRHPELKLT
KVMVSEAGAS VYSASAYASQ ELPSMDVSLR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ
HDLAEARMSS SLDAVVEDCV NAVGVDVNTA SAPLLSRVSG ISGGLADNIV RHRDSNGPFR
SRTGLLDVAR LGPKAFEQCA GFLRITGGDD PLDSSSVHPE SYPVVRRILT VTGGDLRALI
GDTKTLRSLK PTEFVDDTVG LPTVTDILAE LEKPGRDPRP AFRTAEFTEG VETLADLVPG
MILEGVVTNV AAFGAFVDIG VHQDGLVHVS AMSKNFVSDP REVAKPGDIV RVKVLDVDIP
RKRISLTLRL DDEPGAADAG GGQSGQGGQG GERRAGRGGR GGQNRGGAGG AGGAGRSQER
DGGQPADQQG GRQAAAAGGG QPAGGQPGGG RGGPGQRGGP GQRGGPGQRG GGGGAGQRGG
AGGAGGGQRG GGGGQRGAGH RGGRTDGAIA DALRRAGLVT GDEVTLGGKQ DDSRDSRRGR
R