Gene Franean1_4259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4259 
Symbol 
ID5672614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5081796 
End bp5084339 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content72% 
IMG OID641243132 
Producthypothetical protein 
Protein accessionYP_001508549 
Protein GI158316041 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.177218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGC GGGCCATGGT GGAGCCGGTT GCCTTCCTGA GCTATGCGCA CACGGACGAC 
ACCGGTCGCG GTCGTCTCAC CAGGCTCAGG ACGGATCTGG AACACGAGCT CTGCGAGCGC
TCCGGCCGGG ATATCAAGAT CTTCCAGGAC CGCAGCGACG TCGTCGCGGG TGACCTCTGG
AGATCCACGC TCGAGACCGG GCTGGGCAGG GTCCTGTTCC TGCTGCCGGT GGTGTCTCCC
GCGTTTGTGG CCAGTGACGA GTGCCTGCGC GAGTTCACCC AGTTCCTGGC CCTGGAACGC
GCCGAGCAGT CCCGGCGCGG CGTGCGCCGC CGGATAATCC CGATCTACTG GTCCGAGGTG
GACCTGGCCG GGCTGCTCGC CGCCGCCGAG GAGAACGGCG ACGAGGAGCG CGCCGCCATG
CTCCGGGTGC TGCGCGATCG CCAGCACCAC GGTTCCCGGC CGCTGCGCTC CCTGCGCCCG
AGCAGCGGGA CGTACGTGCG GGAGGTCGGT CGCCTGGCGC AGGTGATGGC GGACCGGTTG
CCCTCGCCAC CGAGCCTCGG GCGCCGGCTG ACCGATTCGG TGTCCCTCCT CGCCCGCCGC
CGCCGTAGCC GTAGCCGGTT GTGGGCCACG CTCGTCGGCG CGGCCACCCT CCTGGCGGTG
CTGGTCACGT TGGTGCTCTG GAACATCATC GGAACCGGCG ACGGGGGCGG CACCGCCACC
GGCCCGGGCA CCGGCGGCCC GTGCGGCAGA GCGGCGTTGG AGGCGATGCG GTCGCGCCAG
GGCGCGGCCA CCCCGAGGAA GTTGCAGTGC ATCGGTGTCC GGACGGGCGC GGACGGGGCC
TTCGCCGTCG CCGACCCGAA CGCCGGTGCC GGCGCGGCGG CGGGCGGGGG CGCGTTGGCG
GGCGGGGGCG GCCCGGCGGT AGCGGCGGTG CCCAGTGCCA CCGAGCGGAG CGTCGAGGCG
GCCTGGCAGC TGATCGTGGC CGAGAACGAG GCGACCGAGC GGGCGGACCG CGAGCGCGGC
GCGCCGACCG ACCATCTGAC AGTCGTGGTG GCCGGCCACC TGAGCTCCGG GGAGGAGACC
GATGGGCTTG ACAACGTCGA GCTGCTCACC GCGCTGACCG CGATGCGGCG GTTCAACGCC
GCGCACCGCG ACGGCCCGCT GATGCGGGTG TGGGTCGCGA ACCTCGGCGG GGACGCGGAG
TTCGCAGCCC AGGCCGGACG CAAGATCAGC GATGCGGCCA GGGGTACCGG GCCGGTGGTT
GTCGTGGGGC TCGGCCAGAC CCGGGACGAC TCGGAGAAGA TGATCAATGA GATCGGCCGG
ACAGGGCTCG CCGGCACCAC CGGCGCCTTC TCGGGCGTGC CGATGATCGC CGCCACCCAG
TCCGGGACGG ACCTCACCGG GGAGCCCGGC TACTTCCGGA TCGGCGGATC CGACGACCGG
CAGGCCGAGA TGGGCCTGTC CTGGGCGCCG CGGGGCCAGC TCGACCAAGT CGTGCCGTTC
ATCGTCTACG ACGAGGCGGA CCGCTGGGGC CGGAGCATGT ACGACGCCTA CACCAGCAAG
CTGACCAGCC CCGAGTACGA GTCCCGGTTC CGCGGCACCG CGTTCGAGAC CCTGCTCGAC
AATCGCGTGA TCGCCTACTC GACGAATCCC GACACCCCGA GACCGCTGCC CGAGGCGCTG
CCGGACCCGC AGGCCAAAAT CTGCGGGGAC AACCCCGAGA TAGCCGGGGT GGAGGTGGAG
AACGCACCCC GGCTCATCAT CTATGCCGGG CGGACGGCCG AGCTCCCCGA GCTACTCACC
CTTCTCATGA CGCTGCCCTC GAAGTGCCGG AGCAACGTCC ACGTACTGGG TGGGGACGCG
CTTTCCGACC TGCGCGACGA GGATGCCTGG GACACCGTCC AGGAGATTCT CGAGAAGGAC
CGCGCCGACA GGCCGAAGCT CCTGTATTCG GCTTTTGGCC CGGAGGAGCC CCGTTCGGCG
CGAACGCTGC TTGAGAACGG CCTCATCGCC GACTTCGGCC CGGCCGAGAG CGAGTTCGGC
AGGGTCCGGG CTCTGCTCCC GGAGGAGGAC CAGCCCAGGT TCGCGGAGTT CCCTGACTCG
GCCGCCTGGT CCCTGTACGA CGCCGTCACG CTCGCCGGGC AGGTCGGACT GCGTGCCGGC
TCGTGCCGGC GGGATCCGGC CCTGGCCGGG TGCCGCGAGC TGCTGGAGCG GGCGGACGCC
GTCCGCGAGT GGTCGCGCCG TCGGCCCGCC GCCGGCTTCG ACGGCTTCGA CCTGTACGTC
GCGCTCCGCG AGGTCTACCA GGGCAGGAGC GCCTTCGAGG GAGTCACCGG GACGTTGCAC
GACGCCGAGC CCGACCGCGC GCTCGACGTC CCCAACCAGG GCGGCGGTCT GGACCCGACC
GGGAAACTGA TGGCGGTCTA CGAAGTTTCC GCGGAGCCGT CGCGCGCGGT CGTCGAGCTG
CACTGCGCCA TCCCGGAACC GCGGCTCCCG ACCAAGCCGA GCACGCGCTG CGATGACGAG
ACGTTCGGCC CAGAAGCACC CTGA
 
Protein sequence
MAPRAMVEPV AFLSYAHTDD TGRGRLTRLR TDLEHELCER SGRDIKIFQD RSDVVAGDLW 
RSTLETGLGR VLFLLPVVSP AFVASDECLR EFTQFLALER AEQSRRGVRR RIIPIYWSEV
DLAGLLAAAE ENGDEERAAM LRVLRDRQHH GSRPLRSLRP SSGTYVREVG RLAQVMADRL
PSPPSLGRRL TDSVSLLARR RRSRSRLWAT LVGAATLLAV LVTLVLWNII GTGDGGGTAT
GPGTGGPCGR AALEAMRSRQ GAATPRKLQC IGVRTGADGA FAVADPNAGA GAAAGGGALA
GGGGPAVAAV PSATERSVEA AWQLIVAENE ATERADRERG APTDHLTVVV AGHLSSGEET
DGLDNVELLT ALTAMRRFNA AHRDGPLMRV WVANLGGDAE FAAQAGRKIS DAARGTGPVV
VVGLGQTRDD SEKMINEIGR TGLAGTTGAF SGVPMIAATQ SGTDLTGEPG YFRIGGSDDR
QAEMGLSWAP RGQLDQVVPF IVYDEADRWG RSMYDAYTSK LTSPEYESRF RGTAFETLLD
NRVIAYSTNP DTPRPLPEAL PDPQAKICGD NPEIAGVEVE NAPRLIIYAG RTAELPELLT
LLMTLPSKCR SNVHVLGGDA LSDLRDEDAW DTVQEILEKD RADRPKLLYS AFGPEEPRSA
RTLLENGLIA DFGPAESEFG RVRALLPEED QPRFAEFPDS AAWSLYDAVT LAGQVGLRAG
SCRRDPALAG CRELLERADA VREWSRRRPA AGFDGFDLYV ALREVYQGRS AFEGVTGTLH
DAEPDRALDV PNQGGGLDPT GKLMAVYEVS AEPSRAVVEL HCAIPEPRLP TKPSTRCDDE
TFGPEAP