Gene Franean1_2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2653 
Symbol 
ID5671046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3138172 
End bp3141513 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content61% 
IMG OID641241568 
ProductNERD domain-containing protein 
Protein accessionYP_001506988 
Protein GI158314480 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAG TGATCGATAT CCTTTGGGGC TCGGAGCCGG TAGAAGCCTC CGAGCAGCAC 
TTCCTCGGTC GGCTCCAGGC TGATCTGCAG GCGCATGGCG TCACGGCTAC CGTCTTTGCC
AACTTCCACA CCGCCGGCTC GCGGCAGGTC GACTTCCTCG TGATCACTCC CGGCCATGCC
TGCCACGTGG AGCTGAAGGC ATACCGCGGC CCCATCCACG GTGGCCGGAA TGGCCAGTGG
TCGAGCACCC GGCCGGATGG AACCACCCAG GTCATCGAGC GCTCACCGAA CCCGTACGAG
CAAACCGTCA GAGCCCAGCA GTCCATCAGT GACGACATGC GGGCCTTTGC TTCGGGACGG
GCTGGCACGC CGCGACCATC GGATGGCCGT CAGTTCTACA CGTGGTTCGA CAGCGTCCTG
TGCGTCTATC CCAGACTTGC CGTCGGGTCG GAGGTCCCGA GCGACTTCAA GGTTAAGACG
TTCGGCTACC CCGAGCTCCT CGACTTCTTG CTGAAGCCCG GCAAAAATCC CCCGTGGGAT
GCCGGGATCT GGCGCGAGTT CGGCATGCAT CTGAACCTTG TTCGCCCTGA GCGGCCGGAT
GGTCCGTCCT TGGCTGCCAC GCGGGCACAG CAGGCGCTGG ACGACTACAC GCGACGCTTC
AGCCAGTTCC ATCGCCAGGG CCTGCATGAG CTGGTTCCCA GCACCCTTGA AGCTGGTGAG
ACTGCGCATC CCTCTTCCGA GCTACCGACG CTACTGGCTG GCACACCGTA TGCCCAGATA
GTGGCGCCGT CCGGCTACGG CAAGAGTCAT CTCGCCCATC ACACGGCGCT GACTCTGGCG
TCTGAGGGCA CCGTGCCCAT CTTCCTCAGC GCGGCCCGCT ACGAGGGCCG GCTTTCCGTG
CTCATGGATC GTGCGGTTGC CCCGTATTCG CCGCAGCCGG CGCTGAATCT CTTGCGGGCG
GCGACCTCGG TCGGGCGTGA CGTACTGCTT GTAGTGGACG GCTTCAACGA GTGCCCGGAG
CGGGAGCGGG ATGCCTTAGT CCAGGACATC AGTGCGCTTT GCCTGCGGTC TCCGTGCCGT
GTATTGGTGA CGGCCCAGCA CTCGGTGTCG TTTGCAACGA TTACCGGCTG GCAGGAGTTT
CGGCTCCGGA AACTGGACGA GGACGAGCGC CGGGCGGTGC TCGCATCGTA TGACGCTTCC
GTACCACTAG ACCTCTGTGG GCCCTTCGAG ACGGCGTACG AACTCTCGAT CGCCGCGGAG
TGTTCCAGCG AATTGAGGGA AGGGGCTACA CGCGCTACCC TCCTCGACGC CTTTGTCCGT
CACCGGTTGC AGTCGGGGCG TTCCCCGGCC CTAACCCGAA GCTTGCTTCG TCGGCTGGCC
GTGGTTATGG ACGAGCGATT GGTAACCGCC CTGCCGATCG GTGACGTCTG GCGCATCGGT
GAGGCGGCGC TGCGGGAGCA GCAGACACCG TCGGACATCC TGGACGAGGT CTTCCGCGCC
TCCGTAGTCA GCGTGGAACA GGGCGTGCTT ACTTTCTCCC ATGAACTGAT CGGGCGGTAC
TTGGCCGCCG AGGAGCTGCT GCTCGCGGCC GGCACCAACA TGGACGACCT GACATCTGAG
CTGCAACGGC CGCGGCACGA GGATCTACCA GCGTTGGTGA TTCCACTTGA GACCTCAGAG
GACTGTCTGC GCCAACTGTT CAACTGCCTG GCCAGCAAGC ACCTACTGGT TGAGGCTCTG
GGGAACAAAC TGGGGCCGCG AGCCCGAGCA GTAGCTGTCA TGGAGGCCGA GCGGGTCTTG
GCGGAACTCT GTGAAATGAC AGCCGGGCTC AAATTAGTGT TCGGAAGCAC CTTTGAGACC
ACCGTCACCG GTGGTCGAGA CGTGACCGGC TATGAAGCTG CGGTTCTCGC CGCTGTCGGA
GATGGTCTAG CTGACGGAGA TTTCCTGGGG CCCGCAATGC GGTTGCTGGA TGCCACCGAC
GAGGCATGCC GGACATCGAC GGCTGCGCAG GCGGCTTCCG GCCATCGGCC CACTCCTTCC
GACATCGCCG CAGCGGTAGT GTTGAATTTT GCGGAACCAG GCTCGCGGAG AAAGATCGCC
TCCCATATTC TGTTAGATTC GGCGCGCCTC GGCTGGCCCT GGCGTTACAG GCATCGGAAG
GGTGTCATCT CCTCCGGCGG CCTTGAAGAG GTATCTATGG GCGGTGGCGA GGACAACTAC
TGTCGGCTAT TCTTGATGGG CCTGCTACTG GACAGAGTGC CGCTGGATGT CGGCCAAGAG
ATCGTGATGT CCTGGCTTCG GATGTGCTGG CAGTCGGGTG CTTACCACGT CCAACTTCAA
GCCCTTGAGT CTATTCGTGC GTATTGCCAG CTTGAAGCAG GGCCTTTGCG TTCCTCGCTC
GTTGACTATC TTTCTGACCT GCAAACCCAG AACTTGGGCT TGTCGACGAC GATCGTTGAG
GCGCTGTACT CGTTCGGAGA AATTGAGCCG CAGATTAGTA GAAAGGTGGC CGACGATGAG
ATTCGAGAAG TTTTGGAGAA GCCCGCCGAC GATAGTGAAG CCTGTGGGCG TGCTTATGGC
ATAGTTTCAA ACAGTTTCGA AGACGTTCTG GGGCCGCACT ACTTCGAAGC TATCGAAGCG
CTCGGGAAAA ATGATCGAGT TCGCTTGCTC ACCAAGGCGG CTCTCGGAGT CAATCATGGA
TTCTGGTTGG ATTGGGTTCT CGGAGAGCTA TTAAAACTCC GTGATCCTCA GGCGATCCCC
GCCTTCGAGA GATATGCGAC GGCCTTTGAT GTCAGGTCGC CCTCTCCGCA GGAAGCGGTT
AGTTGCTATA TTCTTGCCAT GCAGGGTTGC GGAAATTTCC TCGACACTCC GCCAAGGTTT
CATCAGTCAA TGACCGTCGA CCTTGAAGCA TGGCAATGCT ACGGAGCAAT CACCTTCTGG
CTGGCGCATC CGATGCCAGC GATCGAGCGC GCAGACCGCT GCGCACCGAT CTGGGCGCGT
TTGGCAACAG AACTGGTCGA AGCTGCGGCG GATCCGCTAT ACCAGTTGGC ACAAACCTAT
GTTAAGGACC AAGATGCAGG CAAGCACCTC GGCCATAATC TTGTGCTAGA CACATTTCCG
GACGAGGTTC GCGTAATTCT TGAGGCCGCT GCCTCAAATT TTGACCGGCA AACAGCGATA
TTTCGAGCAT TCGATCCCAT CGAGCGATCA CAATATGTTC TGCGAACCCT CGGGTTGGTC
GGCAATGAGG GTTCGCTGCG CCTGATCGAG CCATACATCG AAGACCCGTC GCTCGGATCG
ACGGCGATAG AGACGCTCAA AGCACTTCGG GCGCGCTTTT GA
 
Protein sequence
MTSVIDILWG SEPVEASEQH FLGRLQADLQ AHGVTATVFA NFHTAGSRQV DFLVITPGHA 
CHVELKAYRG PIHGGRNGQW SSTRPDGTTQ VIERSPNPYE QTVRAQQSIS DDMRAFASGR
AGTPRPSDGR QFYTWFDSVL CVYPRLAVGS EVPSDFKVKT FGYPELLDFL LKPGKNPPWD
AGIWREFGMH LNLVRPERPD GPSLAATRAQ QALDDYTRRF SQFHRQGLHE LVPSTLEAGE
TAHPSSELPT LLAGTPYAQI VAPSGYGKSH LAHHTALTLA SEGTVPIFLS AARYEGRLSV
LMDRAVAPYS PQPALNLLRA ATSVGRDVLL VVDGFNECPE RERDALVQDI SALCLRSPCR
VLVTAQHSVS FATITGWQEF RLRKLDEDER RAVLASYDAS VPLDLCGPFE TAYELSIAAE
CSSELREGAT RATLLDAFVR HRLQSGRSPA LTRSLLRRLA VVMDERLVTA LPIGDVWRIG
EAALREQQTP SDILDEVFRA SVVSVEQGVL TFSHELIGRY LAAEELLLAA GTNMDDLTSE
LQRPRHEDLP ALVIPLETSE DCLRQLFNCL ASKHLLVEAL GNKLGPRARA VAVMEAERVL
AELCEMTAGL KLVFGSTFET TVTGGRDVTG YEAAVLAAVG DGLADGDFLG PAMRLLDATD
EACRTSTAAQ AASGHRPTPS DIAAAVVLNF AEPGSRRKIA SHILLDSARL GWPWRYRHRK
GVISSGGLEE VSMGGGEDNY CRLFLMGLLL DRVPLDVGQE IVMSWLRMCW QSGAYHVQLQ
ALESIRAYCQ LEAGPLRSSL VDYLSDLQTQ NLGLSTTIVE ALYSFGEIEP QISRKVADDE
IREVLEKPAD DSEACGRAYG IVSNSFEDVL GPHYFEAIEA LGKNDRVRLL TKAALGVNHG
FWLDWVLGEL LKLRDPQAIP AFERYATAFD VRSPSPQEAV SCYILAMQGC GNFLDTPPRF
HQSMTVDLEA WQCYGAITFW LAHPMPAIER ADRCAPIWAR LATELVEAAA DPLYQLAQTY
VKDQDAGKHL GHNLVLDTFP DEVRVILEAA ASNFDRQTAI FRAFDPIERS QYVLRTLGLV
GNEGSLRLIE PYIEDPSLGS TAIETLKALR ARF