Gene Franean1_7323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7323 
Symbol 
ID5675624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8952575 
End bp8955334 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content74% 
IMG OID641246160 
Producthypothetical protein 
Protein accessionYP_001511548 
Protein GI158319040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.815907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCC GTAACCCGTT CCGCCGGCAC CGGCTGGCGG CCGCGGCGCT GCTCGGGACC 
GTCGCGTTGG TCTGGCCCGG GGCCGATGTG GGCCTGGCGG CGGCTCCGCC CGACGGCGCA
CGAGCGCCCG CCGCCACGAG CATCACGAGC ACCACGCTCG ATTCGCACAA TCCACAGGCT
TCATCCACAG TTTCCACAGG GCTGTCCACA GAGTTGTCCA CAGGATCGGC CTCATCTGTG
GGAATCTCGC CGGACAGCGC GGCGATCATC CAGCCGATAA CCCGGACGAA CCCCAGCACA
AGCGGTACTT CGGGTACTCA ACAGACCACA CCAAGCACCG GTGGGACCAC GAACGGGACC
ACGGGCCTTC CCATCGAGGT CCGCCTCACC AGCCTCGTCA CCCCGAGCGC CGCCGCCCCC
CAGCTCCGGG TGACCGGCAC GGTCGACGTG GGCGGGACGG TGCTGCCGAC CGACCTCGGT
GTCGTCCTCG AGGTGGGCGG CCCGCTGCGG TCCCGGGGCG AGCTCGCCCA GCTGACCAGC
GGCGGCCCGC CCCCCCAGAT CCGTTACTCA CTGCTCGGCA GCGGGCCACT GACCATCAGC
CCCACCGCGA CGACCCCCGG CCAGTTCGCC GCGCAGGCGG ACGTCCCCGC GATGTTCCGC
TCGTCGGCCG TGAGCGTGCG GCCGGTCCAG ATCAGGATCG TCGGCCGGGT CGGCGAGCGC
GCGCGGAGCA CCGTCGCGTC GCTGACGACG TTCATAGTCG TCTCGCCCCC GCAGGCGGCG
GCGCGGACGG CGGTCGTCAC CGTCCTGCCC GTCTCCAGCA AGCCCCGGCT GCGCTCGGAC
GGCCTGCTCA CCGACGACAC CCTCACCGAC GAGATCCGCG ACGGCGGCCG GCTCGACGCG
CTGCTCGACC CGCTCGAGTC CACCGCCCCC GGGACACCGC CGCCCCAGGT CGCCCTCGCG
GTTGACCCCA CGCTGGTACA GGCGTTGCAG CGGATGGAGA CCCCCTACCA GTACGCCACA
CCCGGCGGAA GGAGGAATGC CGACCCCGAC CCCGACGCGG GCGCCTTCCT CGACCGGATC
AAGGACTTCG CGCGGCGGGG CGGCACCGTC ATCGCCCTGC CCTACGGGGA CACCGACCTG
CCGGCCCTCG TCCGCGCCGA CCAGCTCGAC GCGCTGCAGT ACTCGGTGAA CACCGGCCAG
CTCGTGATCG CCGGGCTGCT GGGCGGCAAG ATCCCGAGAA GCGGCACGAT CGCCTACCCC
GCCGACGGCA TCGCCGACCC ACACACCGTC GACGTGCTGA GTGCCAACGG CGCCGGCACG
GTGATCGTCG ACGACCGGCT GCTGCCCGCG GCCCAGTCCG TGCGGTACAC CCCGTCCGCG
GCGGTCACCC TGCCGACGTC CACCGGCCAG GTCCGCGTGC TCGCGGCGGA CCACCGGCTG
GCCGACGCGG TCGCCGGCTA CGACGGCCAG CAGCTCCTGG AGCAGGCCCT CGCCCGGTTC
CGCGCCGAGC TGGCGATGAT CACCGCTGAG CCGGCGAACG AGCGGGCCGC GGTGCTGGCC
CTCCCGCGCG ACTTCACGCC GCCCCCCGGC TGGCTCAACG CGATCCTGCG CAGCCTCGAC
AGCGCCTACT CCAGGCCGGT CGGCATCGAG GAGCCGTCCC CGGACGCCAC CCGGCAGCGC
GCCGGCCTGA CCTACAACGC CGACGCGCAG AGCCGGGAGC TGCCGGTCGA CTACGTGAAG
GGCGTCGGGC AGATCCGCGG CGAGGTGGCC GTCCTCAGCG CGGCCTTCTG CCCGCCGACC
ACCCTGACCG GCGACCTGCT ACAGCAGTGC CGTCTGGGCA AGGTCGACCC GATGCGCAAC
ACGCTGATCA CGGCGCTGTC GGCGTGGTGG CGGACGGACC GGCTCGGCGG GTTCTCCCTG
GCCCAGCAGG CGGACGGGCA GGTCGGTGAC TACCGGTCGA AGATCCGGGT CGTCGCGTCC
CGGATCGTGA ACCTCACCAG CAGCCGCGGA CGGGTCCCGG TGACGCTGGA GAACGGTTCG
GACTGGAATG TGACCGTGGT CCTGAAGCTC TCGTCCACCG ACCGGGGGCG GCTCGTGTCG
GCGACCGAGG TCACCCGGGT TCTCGAACCG CTCCAGAAGG ACCAGTTCGA GATCGAGGTG
GACGCCGAGA GCGCCGGCAC CTTCCCGGTG GACATCCGGC TCGAGACCGT CGACGGGCAG
GCCCTGGGCC CGGATGCCGC CGCCCGGGTG CTGGTGCGCT CGACGGTCTA CGGGGCCATC
GCCACGGCGA TCACGATCGG CGCGATCGGC GTGCTGATGC TGGCCGTGCT GATCCGCCTG
CTACGCAAGC TCCGCGCGCG CTCCCGCGGC GGATCGGCGG CGGATGCGGC GGCCGCGGCC
CCGCCCGACG GCTCCGGCCC GCCAGCCGGC GCCGGTCTCC TCGGCCCGGC CGGGGCGAAC
GGGGCCTCGG GCGCCGGCGA TCCCGACCAC CCCGGCACGG GCGACGGGCC GTTCGATCCC
CCCGGGACCG CCGGCCCCGG CTGGCCGGAG CGGCCCGTCG ACGATCCCGC GCCGGCCGGC
CTCGGGCACG ATCCCTACTT CGACGGACAA CCCGCGCCCG CCGCGTACCG GGAGCCCACC
GGCTCGCCGT ACGGCCCGTC CACCGGTGGT CCCGGCGGCA GGACGGCCGA GCCGTGGTCC
GGGCGTCCCG CGACCCCGGC CGGATCCGCG TCCCAGCCGC GGCGACGGGA CGGCCGGTGA
 
Protein sequence
MTGRNPFRRH RLAAAALLGT VALVWPGADV GLAAAPPDGA RAPAATSITS TTLDSHNPQA 
SSTVSTGLST ELSTGSASSV GISPDSAAII QPITRTNPST SGTSGTQQTT PSTGGTTNGT
TGLPIEVRLT SLVTPSAAAP QLRVTGTVDV GGTVLPTDLG VVLEVGGPLR SRGELAQLTS
GGPPPQIRYS LLGSGPLTIS PTATTPGQFA AQADVPAMFR SSAVSVRPVQ IRIVGRVGER
ARSTVASLTT FIVVSPPQAA ARTAVVTVLP VSSKPRLRSD GLLTDDTLTD EIRDGGRLDA
LLDPLESTAP GTPPPQVALA VDPTLVQALQ RMETPYQYAT PGGRRNADPD PDAGAFLDRI
KDFARRGGTV IALPYGDTDL PALVRADQLD ALQYSVNTGQ LVIAGLLGGK IPRSGTIAYP
ADGIADPHTV DVLSANGAGT VIVDDRLLPA AQSVRYTPSA AVTLPTSTGQ VRVLAADHRL
ADAVAGYDGQ QLLEQALARF RAELAMITAE PANERAAVLA LPRDFTPPPG WLNAILRSLD
SAYSRPVGIE EPSPDATRQR AGLTYNADAQ SRELPVDYVK GVGQIRGEVA VLSAAFCPPT
TLTGDLLQQC RLGKVDPMRN TLITALSAWW RTDRLGGFSL AQQADGQVGD YRSKIRVVAS
RIVNLTSSRG RVPVTLENGS DWNVTVVLKL SSTDRGRLVS ATEVTRVLEP LQKDQFEIEV
DAESAGTFPV DIRLETVDGQ ALGPDAAARV LVRSTVYGAI ATAITIGAIG VLMLAVLIRL
LRKLRARSRG GSAADAAAAA PPDGSGPPAG AGLLGPAGAN GASGAGDPDH PGTGDGPFDP
PGTAGPGWPE RPVDDPAPAG LGHDPYFDGQ PAPAAYREPT GSPYGPSTGG PGGRTAEPWS
GRPATPAGSA SQPRRRDGR