Gene Franean1_7004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7004 
Symbol 
ID5675315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8536545 
End bp8539700 
Gene Length3156 bp 
Protein Length1051 aa 
Translation table11 
GC content72% 
IMG OID641245850 
Producthypothetical protein 
Protein accessionYP_001511241 
Protein GI158318733 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.244252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGGC AGATGACGGT GATTCGGACC GCGGCTGTGC CCTGCGCGGA GGTCGACGAG 
GCGGTCGCCG AGGACGAGCG GGCGCTCGGA GCCGCGACCG ACCTCGTGGC GAGGATCGCG
GCGGCCCAGA ACCTCACCAT CTCCCTACAG TCGCGCTTCC ACTGCCATCG GGATCTCGCG
GACCTCGACC GGGCCATCGA GGTGATCAAC GAAATCGGCC GGGATGTCGA CACGGCTCAT
CCCGGCCGGC TCATGCTCGA CACCCAGATG ATGTCCTGTC TGATCACATC CACCCGGCTG
GAGGATGCCA GGCGGGCGCG AGAGCTCGGA GCGCGGCTGT ACCGTCTGAC GGGCCAGCTG
GGCGAGATAA GGGCGGGTGT CGCCGTCATG CTGGGCATGG CCGCCATGGT CTGCTTCCGG
CACACGTCCT CCGTCGAGGA CGCCGAGGAG TTCGTCGTCC TCCTCGAGGA GGCGGTCGGC
GGCGTTCCGG CCAACTCGCC GTCCAAGCTG ATCGTCCAGG GCGCACTGAT CGCCGCGCTG
TCCGTGCGAT ACCGGCTCAG CCCGAACCGC GAACGGGATC TGGACCGTGC GGTCGAGATC
TTCGACGAGC TCGACCCGGC AGCGAGCGGC GGAGACAAAG AGGGAGTCGT CCGCTTCGCC
TGCGGCGCGC TGGTCGACGG CCTCGTCGAC CGGCACGAAC GGCACCACGA CCCGGCCGAC
CTGCGGACCG CGCGGCGAAT CCAGGACTAC ATGCCGGTCG GTGAGGCGAG TGGTGGCCTG
ACGGAGAGCC CCGCGGCGTG GCATCCGCAG GCCTTCCGGC ACTACTCCTC GTTTCGGCAG
ACGCAGGATC CCCAGTTCCT CGATCAGGCT ATCGAGGAGC AGGACCGCGC GATGAGCGAC
CCGAGCCGGT CCGGCAAGCA ACGCGCGAGT TACGCCTACA TGCTCGCGAT CTGCTGTTTC
ACGCGTTTTC TCCTCTACCG GGGCCGCCGG CTCGGGGATC TGAACCGGGC CATCGACCTC
GTCCGCGAGG CGCGCCAGGA CTGGCGCGAC TTCACCCTCG TCCACAGTTC CGCGCAGCTG
CTCGCCCGCT TCCTCATCCA GCGCCACATG AGCCTCGGAC GCCCGGCCGA TCTCCGCGAG
GCCCGCGTCG TGCTGGAGTC GGCGCTGCGG ACCGGAGAGG TGGAGAACCG GACGAAGTCC
TCGATCCTGA TGCTGCTGGC CTACTGCGAG GCGGTCCGGG CCGAGCGCGA CGGCGACGAC
GAGGCCGCGG ACCGGTCGAT GGACCGCTTC GAGGAGCTCA CCCGCCTGAT GCCGGAAGGG
TCGATGGAGG CGATGGCGGT GCAGACGACA CTCGCCAACG CGTTGCAGTC CAAAGCGTCC
GCGAGCCAGC TTCCCGAGGA CGCCACCCGG GCGTACCGGG CGTCGCGCGC GGTGTGCTCG
CACGCGGGCG CGAGTCCCCT GGCGACGGTG CCGTGCGCCG AGGGCTGGGG CCATCTCGGC
TGGCGGCGGC AGGACTACGC GGAGGTGGCG GAGGCGTACG GACACGCGGT CGCGGGCCTG
CAGGCCATTT CCGTTCTTTA CCCGCGGCGC GAGGACAAGC AGGACTGGCT GGGCCGCGCC
GGGTCGATGG CCGCGCGCGC GGCGTTCGCG CTGGCCCGCC TCAACCGGGC CGAGGAGGCG
GTCGAGGTGC TCGAGGCGGG ACGGGCCGTC CTGCTTTCGG AGGCCCTGGA CCGCCAGCGG
GTGGATCTGG ACCGGCTGAG CGCGGTCGGT CACGGCGAGC TGCGGGAGCG GTACGAGCGC
GCCGCGGCCC GCGGCGCCGA GGCCGAGCGC AGCTTCGAGC CCGACTTCGA CATGGAGTAC
GTCCCGGTGT TCGATCCGAC CCTGCGGGAC CCCGAAGCGC CGCTGCGCGA GCTGGCCACC
GAGCTCGGTG ACGCCCGCGA GGAGCTGCGA CGGGTGGTGG GCGAGATCAG GTCCGTGCCG
GGCTACGAGT CCTTCCTCCT CCCTCCCTCG TTCGCCGACG TCCAGAGGGT CACGCGGGCC
GACCGGGTCC CGTTGGTCTA CCTGGCGGCC ACGCCACGGG GCGGCCTCGC CCTCGTCGTC
ACCGCATCGA AGGTGACCCC CGTGGAGGTG GTGTGGCTGC CCGAGCTGAC CCTGGCCAAC
CAGCGCAGGT GGGTAGGGCG GTGCGTCCAG GCCGCCGAGG AGGACGACCT CGACGCTTAC
GAGAAGGCGT CCCGATGGCT CGGCCGGGCG GTGATGGACC CGGTACTCAG GCTGCTCGCG
CCGTACCGGC GAGCCGTTCT CATGCCCGGC GGCCTGCTGG GCTCCGCGCC GCTGCACGCG
GCGAGGCTGG CGGAGACCGT CGGCGGCGCG GGCACCTACG CGCTCGACCG GCTCACCCTC
GTCTACGCGC CCAGTGTCCG CGCGCTCGCC GCCACCCGGA TCGCGCCGGG CCCGGCCCGC
TCGGGTGATC TGCTGCTCGC CGTGGCGGAC CCGGCCTCCT CGGGTGCCGA CCCGCTGCCG
GGTGCCCAGG TCGAGACCGA CATCGTCGCG GCCCATTTCG GCCCCGAGGC CCAGGTCCTG
CGCGCCGGAC AGGCAACCCG GGCGAGAGTG CTCGCGGCGA TGTCCGGATC CGACGTGCTG
CACTTCGCCT GCCACGGGGT GGCCGAGCCG ATGGATCCGC TCAGCAACCG GCTTCTGCTC
GTCGGCGACC AGTCGATGAC GTTGCGGGAC ATCGGCGGGC TCGACCGGAT GAGCCCCCGG
CTGACCGTGC TGTCGGCCTG CCAGACGGCC GTCATCGGTG ACGATCTGCC GGACGAGTTC
GTAAGCCTCG CCACCGGCCT GCTGCAGGCC GGATCGCCGG GCGTGGTGGC CACGCAGTGG
TCGGTGGACG ACGTCGCCAG CGCCGCCCTG ATGGCCCGCT TCTACCAGCT CTGGCGGGAT
GACGGAGTGG AAACGGCGGA GGCCCTGCGG CGGGCACAGC TCTGGGTGCG CCGGACGACG
AACGGCGAGA AGGTCCGCGA GTTCCCGGAT CTCCTCCGCA TCCCCGACTA CGGTCCGCCG
ACCGACGCCT CGCCGGCCGT TCGCCATGCC TGGGAGACCC GACGGGCCCA TGAGCATCCC
GTCTACTGGG CCGCGTTCAC CTTCCTCGGC AGATAG
 
Protein sequence
MAGQMTVIRT AAVPCAEVDE AVAEDERALG AATDLVARIA AAQNLTISLQ SRFHCHRDLA 
DLDRAIEVIN EIGRDVDTAH PGRLMLDTQM MSCLITSTRL EDARRARELG ARLYRLTGQL
GEIRAGVAVM LGMAAMVCFR HTSSVEDAEE FVVLLEEAVG GVPANSPSKL IVQGALIAAL
SVRYRLSPNR ERDLDRAVEI FDELDPAASG GDKEGVVRFA CGALVDGLVD RHERHHDPAD
LRTARRIQDY MPVGEASGGL TESPAAWHPQ AFRHYSSFRQ TQDPQFLDQA IEEQDRAMSD
PSRSGKQRAS YAYMLAICCF TRFLLYRGRR LGDLNRAIDL VREARQDWRD FTLVHSSAQL
LARFLIQRHM SLGRPADLRE ARVVLESALR TGEVENRTKS SILMLLAYCE AVRAERDGDD
EAADRSMDRF EELTRLMPEG SMEAMAVQTT LANALQSKAS ASQLPEDATR AYRASRAVCS
HAGASPLATV PCAEGWGHLG WRRQDYAEVA EAYGHAVAGL QAISVLYPRR EDKQDWLGRA
GSMAARAAFA LARLNRAEEA VEVLEAGRAV LLSEALDRQR VDLDRLSAVG HGELRERYER
AAARGAEAER SFEPDFDMEY VPVFDPTLRD PEAPLRELAT ELGDAREELR RVVGEIRSVP
GYESFLLPPS FADVQRVTRA DRVPLVYLAA TPRGGLALVV TASKVTPVEV VWLPELTLAN
QRRWVGRCVQ AAEEDDLDAY EKASRWLGRA VMDPVLRLLA PYRRAVLMPG GLLGSAPLHA
ARLAETVGGA GTYALDRLTL VYAPSVRALA ATRIAPGPAR SGDLLLAVAD PASSGADPLP
GAQVETDIVA AHFGPEAQVL RAGQATRARV LAAMSGSDVL HFACHGVAEP MDPLSNRLLL
VGDQSMTLRD IGGLDRMSPR LTVLSACQTA VIGDDLPDEF VSLATGLLQA GSPGVVATQW
SVDDVASAAL MARFYQLWRD DGVETAEALR RAQLWVRRTT NGEKVREFPD LLRIPDYGPP
TDASPAVRHA WETRRAHEHP VYWAAFTFLG R