Gene Franean1_6956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6956 
Symbol 
ID5675269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8474523 
End bp8479025 
Gene Length4503 bp 
Protein Length1500 aa 
Translation table11 
GC content67% 
IMG OID641245805 
ProductNB-ARC domain-containing protein 
Protein accessionYP_001511196 
Protein GI158318688 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.270488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCG AGCCGGTGTC AGCGCAGGTT CGCGCGTCTC CTCGCGAGAT CGCGGATGCG 
GTCTGGCTTT CCGCGTTCCT GGAGCCGGAG CCGAGCGTCC CGGAGAGCAC CGTGGACGCA
CACGGCGACG TCCAGCCGAC GCGGCCCGGC CCGGCTGGAC CGGCCGCCAC CGAGGTCATC
GCCGATGACT ACCGGAGCCA TCCGGAGGCG GGTGAGCGAC CGGCGGAAGC GCCGGGCACC
GGCGCCCTCC CCCGCGCCTC TGACGGGATG GTTCCAGCCA GATCACCACG GAACGTGAAT
ATGCAGCCCT CGGTGGCCAC GTTCGCAGAA GTTTCCATTC CCGCCCGGCA CGAGGTCGCC
GGCGTCCAGA CCACCACGGC CGGAGCCTAT ACACCAGTCA CGCTGAGTGA GACGGCTCGA
GGTATGGCCC GCGCCCTGGC AAGCCTCCGG CAGACTGTGT CGTCGGCGAT CAACGTAGAT
CTCGACGAGG AGGCCACCGC TGAGCGGCTC GCAGGTGAGG CGCTCGCCAT GCCTGTTCAC
CGCGCACGGC TCGAACGGCG ATGGGACATG ACCCTCGTCA TCGACACGAG CACCTCGATG
CACGCGTGGC GCGACGAGGC GGCGCGGCTG GTCGCAGCGC TCGAGCGCTG CGGGGTTTTT
CGTGACGTGA ACCATTGTTA TGTCGACACG GATGTCGAGA CGGCCTCTGA GCTGCGGCTG
CGTGGATCGC GCACTTCCGC CGGCACGCGG TCGCCCGACC TTCTGGTACG GCCCGGCCTC
GACCACGTTG TCTGGATCTA CTCGGACACC CTGGGCAGGG CATGGCGTTC GCACGCCGTG
TTCTCGCTTC TGTGGCGATG GGCCGGGAAG GCGAACACCG CCGTGCTGAC GCCGATCAAA
CGGCGCATGT GGCACAACAC GAACATCCGT TCATATCCCA TGCTTGGTTC GGCGAGGCTT
GGCCCGGCGC CCGCCTCGGG AATGTCGTGG TCCTTCCGCC AGTCCTGGGA CAAGAGGCTG
TTCGCGCTGG ACGTCGATAT AGAAAACGCA CGTCCGATCC CTGTTCTGGA ACAGAGTCGC
CATGCTGTGG AAAAGTGGGC GCGGGCGCTG GCGGGCCGGG CTGACGGCCG AACCGAGCTG
CCGGTCATGC TCGTTCCACC GGTCACCCGC ACGGTTGTCC CGGCGGAGAC CGCGGATACC
GACGCGACAC GCCGGAACTC CACCGACGAT GCCGGCTGGC AGCAGGTGGC GGCATTCCAC
AATGCCGCCA CCTCACCAGC ATTCGACCTG GCGACGCATC TGGCCGCCGC ACCGCTCACC
TGGTCAATGA TCGATCAGGT GATCGCGATG ACCCCTGGTG CCGACCGGCG AGAACTCTCC
GAGCTGTTCA TGCACGGGCT CCTCACCCGG ACGGGCCCGT CCGCCGCGGC GGCCACCGGG
CACGTGCCGG GAGCCGAGGC GGAGATAGTC CTCGACTTTC TCCCCGGAGT TCGTGCCAAC
CTGCTCGCGT TCGGCCGTCA GCGCGACACC ATCCGCGTAC TGAAGGCTGT CTGTGACCAT
CTGGGACCCA ATATCGCGAT GGTTCGACAT CTTCGGCAGG CGATAGATTC GCCCGGATCG
GCGCCGATTC CTGAGGTGAC GCCCGCCAGT GCGCCTTTCC TCGCGGTGGA GGAGACAGCG
TTGTCCGCCT TGTCCGGTCC GTTCCTGGCG CGAGCACGCC GTATTCAGGA ACATTTTGTG
GCAGCGGCGG ACAAGCAGCA CATCCTCACA TCCACCCAGT CACCGACAGT AGTTAGTAAC
ATGCCCTTGG TGACAGTTGA TATTAGCGCT GCGTCAGACA CCGACAGCGG CGACGAGATC
TCCGTTCCGC CGCCACCCCG CGCGCCGACG GACGGCGACG GCCCAGACCG CCTAGGGGAT
AGCCAGCTAG GGGGTAACCG GACACATCTG AAAGGAGATG TTTTGACCTC GACCACCGAG
ACCTCGACGG CGAAGCAGCA GCAGGCGCGC CAGCACCCGG TGGTGTTCGG AAACGTCCCC
CAGCGAAACC CGTATTTCAC CGGGCGCAAC GGACTTCTCC GCGAGCTTCA CACGAGGCTC
GGCCACGGCA CCACCGCGGT ACTGCCGGAA GCTCTGCACG GAATGGGCGG GGTGGGTAAA
TCCCAGCTCG CGGTGGAGTA CGTATACCGG CACCAGGCCG ACTACGACAT CGTGTGGTGG
ATACCGGCCG AACATTCCAC ACAGATCGGC AAGGCGTTGG CGGAACTCGC CCAACGCCTT
GGCCTGTCGG TCGGCGGGGA GGCGAACACG GCCGTTCCGG CGGTGCGGGA GGCCCTGCGG
ATCGGCGTCC CCTACGGCAA CTGGCTCCTC GTTTTCGACA ACGCGGAGGA TCCCCGCGTC
GTTCGAGAAT ACTTCCCACA GGGCGGAAAC GGAAAGATCC TGGTGACGTC GCGAAATGCG
CAGTGGTCCA GTATCGCACG CCCTCTGGAG GTGGACGTCT TCAGCCGGGA GGAGAGCGTG
GAGCTCCTTC AGAAGCGCGA CACGGACCTC ACTGACCACG ACGCCGGCCG CCTCGCCCGG
GCGCTCGGAG ATCTTCCTCT CGCGGTGGAG CAGGCGGCCA CCTGGCGCGC CGAGACGGGA
ATGAGCGCCG ACGAGTACCT CACCCTCTTC CAGGAGAAAA GGGACGAGCT GCTCGGCACC
TCCCCTCCAA TGGACTACGA GGTCCCCGTC CAGGCCGCGT GGAACCTGTC CCTGGATCGA
CTCGCAGACC GCAACCCGGC GGCGCTGAGG CTTCTCCAGG TGTGTTCCTT CTTCGCGCCG
GAACCGATTC CGCGGCAGGT CTTCCGGCGC GGCCGCAACA TTATGATCAT GCCGGAGCTG
GACGCGGCCC TGCGCGACCC CTTCAAGCTG AACATGGCCA TCCGGGAGAT CACCCGATAC
GCGCTCGCCC GGGTCGACCA CCGGACGAAC TCGATCCAGA TGCACCGCCT GGTGCAGACC
GTGCTGCGCG GTCGGATGAC CCCCGACGAG CGGGAGACGA TGCGGCACGG CGCCCATCTG
CTGCTCGCCG CCAACGATCC GGACGAGCCC AGCAACCCGG AGAACTGGGA GCAGTACTCC
GAGCTCTATC CGCACGTCAT CGCGTCCGAG GCCATCGGCA GCCGTGACCC CTTCGTCCGC
GACCTGCTCG TCCACGAGGT GGAATACCTG TTCCGGTGGG GTGACCACGA GGGAAGCCTC
ACGCTCGCGC AGCAGACGTA CGACGCCTGG ACCGGCAATC CGGATCTTGG CGAGGAGGAT
CCGCACAGCA TCACCATGGC CGGCTGGGTC GGCTGGGTGA GCTACATAAC CGGCCGCTTC
GCGGATGCCG CCCGGGTCAA CAAGCGGCTG CTGGAACTGT GCGAGCAGGT CCACGGCGAC
AACCACACCG AGACCCTCGA GGCGCTGGGA AACGTCGGCG CGGACCGGAT AGTCGCCGGC
GATTTCGAGG ACTCGCTGAG ATTCGCGCGC GAGCGACACC GGCGCGCCCT GCGCGCCTAC
GGGCCGGGTG ACGCGGTCAC TCTCGACGCG GCCCACAACG TGGGGCTTGG CCTGCGGCTC
CTCGGCAGAT TCCAGGAGGC CAAGGAACTC GACCAGGAGA CCTGGGAACG CCGCGTACAG
CTGGTCGGCG AGGACAACAT CGAAAGCCTC CGCACCTACA GCAACCTTCT CGTGGACGAA
CGTGAACTCG GCGACTACCA GGGGGTACGC ATCCGGCTCG AAGACATGGT GGAACGCGTC
CGCCGGCTGG TGAAGAACAT CGAGGACCAT CACGAGCTGC TCCGGGTGTC GGGGCTCCTC
GCGATCGCGC GCAGGAAATC CGGTGACCAC GACAGCGCCC TCGAGCTCTC CCGAGACGTG
GAACGTCGCT CCCTGCGGCG CTACGGCAAG GACACTCCCC GCACGATCGG TGCCGCGCTG
GCCCTGTCCA TCGATCTCCG GCATGCCGGG GAGCTGGCCG AGGCACGCGA GCTGTGTGAG
GCCACCAGAA GGCGCTTCGA CCGCGCGTTC GGTGCCACCC ATCCACACAC TCTCGCCGCG
ACGGTGGACC TAGGTGTCAT CTCACGGCTC GCCGGCGATC TGGAGACCGC GTCCGAGCTG
AGCCGAACCG GCCTGGATGG GCTTCGCAGC AGGCTCGGCG AGGACCACGC ACACACGATG
ATCGCGGCGA CGAATCTCGC CAGCGACCGC TACGCCCTCG GCGAGTTCCA GACCGCGCAC
GACATGGACG TGGCGACGCT GGAACGCAGC CGCCGCGTCC TCGGCGAGAA CCATCCGTCC
ACCCTCGCCT GTGCCAGCAA CCTGGCGCTG GACCTGCGAG CCCTTGGCGA GGACGGCCCG
GCGGAGAACC TCCTCGCCGA CACCGTCGTG CAGCTCGACC GGGCTCTGGG CAAGGGCCAT
CCGGCGACCC GCGCCGCCGC CAGTTTCGTC CGGGCCGACT GCGACATTGA TCCGTTCGTG
TGA
 
Protein sequence
MAPEPVSAQV RASPREIADA VWLSAFLEPE PSVPESTVDA HGDVQPTRPG PAGPAATEVI 
ADDYRSHPEA GERPAEAPGT GALPRASDGM VPARSPRNVN MQPSVATFAE VSIPARHEVA
GVQTTTAGAY TPVTLSETAR GMARALASLR QTVSSAINVD LDEEATAERL AGEALAMPVH
RARLERRWDM TLVIDTSTSM HAWRDEAARL VAALERCGVF RDVNHCYVDT DVETASELRL
RGSRTSAGTR SPDLLVRPGL DHVVWIYSDT LGRAWRSHAV FSLLWRWAGK ANTAVLTPIK
RRMWHNTNIR SYPMLGSARL GPAPASGMSW SFRQSWDKRL FALDVDIENA RPIPVLEQSR
HAVEKWARAL AGRADGRTEL PVMLVPPVTR TVVPAETADT DATRRNSTDD AGWQQVAAFH
NAATSPAFDL ATHLAAAPLT WSMIDQVIAM TPGADRRELS ELFMHGLLTR TGPSAAAATG
HVPGAEAEIV LDFLPGVRAN LLAFGRQRDT IRVLKAVCDH LGPNIAMVRH LRQAIDSPGS
APIPEVTPAS APFLAVEETA LSALSGPFLA RARRIQEHFV AAADKQHILT STQSPTVVSN
MPLVTVDISA ASDTDSGDEI SVPPPPRAPT DGDGPDRLGD SQLGGNRTHL KGDVLTSTTE
TSTAKQQQAR QHPVVFGNVP QRNPYFTGRN GLLRELHTRL GHGTTAVLPE ALHGMGGVGK
SQLAVEYVYR HQADYDIVWW IPAEHSTQIG KALAELAQRL GLSVGGEANT AVPAVREALR
IGVPYGNWLL VFDNAEDPRV VREYFPQGGN GKILVTSRNA QWSSIARPLE VDVFSREESV
ELLQKRDTDL TDHDAGRLAR ALGDLPLAVE QAATWRAETG MSADEYLTLF QEKRDELLGT
SPPMDYEVPV QAAWNLSLDR LADRNPAALR LLQVCSFFAP EPIPRQVFRR GRNIMIMPEL
DAALRDPFKL NMAIREITRY ALARVDHRTN SIQMHRLVQT VLRGRMTPDE RETMRHGAHL
LLAANDPDEP SNPENWEQYS ELYPHVIASE AIGSRDPFVR DLLVHEVEYL FRWGDHEGSL
TLAQQTYDAW TGNPDLGEED PHSITMAGWV GWVSYITGRF ADAARVNKRL LELCEQVHGD
NHTETLEALG NVGADRIVAG DFEDSLRFAR ERHRRALRAY GPGDAVTLDA AHNVGLGLRL
LGRFQEAKEL DQETWERRVQ LVGEDNIESL RTYSNLLVDE RELGDYQGVR IRLEDMVERV
RRLVKNIEDH HELLRVSGLL AIARRKSGDH DSALELSRDV ERRSLRRYGK DTPRTIGAAL
ALSIDLRHAG ELAEARELCE ATRRRFDRAF GATHPHTLAA TVDLGVISRL AGDLETASEL
SRTGLDGLRS RLGEDHAHTM IAATNLASDR YALGEFQTAH DMDVATLERS RRVLGENHPS
TLACASNLAL DLRALGEDGP AENLLADTVV QLDRALGKGH PATRAAASFV RADCDIDPFV