Gene Franean1_0745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0745 
Symbol 
ID5669161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp870145 
End bp871782 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content69% 
IMG OID641239672 
Productcarbohydrate-binding CenC domain-containing protein 
Protein accessionYP_001505109 
Protein GI158312601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.556771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00012998 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATCAGA GCATCATCGA GGACGAGGCA AGGCCGGACG GCCCCGCACA GAATTCGCGT 
GGCGCGGGCC GCCGGGTGGG GCGGCGTGAC CGGCGCAGGC GTGACCGCGT GGTCCGAACC
GCCAGACGCA AGATCCTGCT ATCCGCCCTC TGTCTCCTTG TCGTGGTCGG CGGGACGACC
GGCGTGATCC TGTACTCCGC GCTCGGGGGC GACGATGCGG GTGGGAAGCC CATCCAGCTG
GCCGACACCG AGCCGCGATA CCAGACCGGT GTGAACTTCC TTCCGAACCC TGGCTTCGAG
AACACGGTCG CGGGCTGGTC GGCCGGGCCA TCCGAGAGCG CTCGCCTGTC CCCGACGGTG
CCCGGCCGCA GCGGGTCCGG CGCGGCGATC GTCACGGCCA GGGCCCGGGT ACCCGCCATC
ACCCTGACCG ACACCCCGGA CGCCGTGACC TGGACCGGCG GCAGGCTGAC CTATCTCGGC
GCCGTCTGGG TGCGCACGAC CGCCCCGGGC ACCCGCGTCC AGCTGAAGCT GACCGAACTC
GACGGTGCCC GCACGGCCGC ACGCGCCCTG ACCAACGTCA CCCTGAACGA TGCCACCTGG
CGAAAGGTCG AGGTCTCCCT GCGGGCGACT GCCACGGGCC ACCGGCTGGA CTTCGCGGTC
GTCGCCGACC GGCTTCCCGC CGGCCAGTTC CTCTACGCGG ACGATGCGTC CCTTCTCATC
GGGAGGCCGT CATCGCCGGC CCCGACGGGG CCGGCGGCGA CCACGGGCGG CCAGCCGAGC
GCGTCCCCGC CGGTGAGCGG CAGCCCGCGG CCTGGCACGA GCGCGACTCC CAGCGCGCCG
ACGAGTGAGC GACCGCGCCC GAGCACGGCA CCGCCCGCGC CGGGTGGGTC GGGTATCACG
CCGATCCCGG CCGGCATGCC CAACTCGAGC ACCACTGGCG TCCGGGCGGG GACGTCGCTG
ACCCTCCTCT CCGGTGACCA GCGCATCACC CGCGCGGGAA CGGTGATCGA GAACAGGGAT
GTGCGCGGGT GCATCCGAGT GGAGGCCGAC AACGTCGTCA TCCGAAACTC CCGGGTCTCC
TGCCGGTCGT CGGGTTCCGC GGTGATCAAG AACCTCGGCC AGAACCTGCT GGTCGAGGAC
GTCACCATCG ACGGGCAGGG CGCCTCGAAC TCCGGCATGT CGACCGCCGA CTTCACCGCC
CGCCGGGTGA ACATCTCGAA CACGATCGAC GGCTTCTTCA TTAACGACAA CATCCTCATC
GAGAACTCGT ACGTTCATCA CCTCGCCCAG ACCCCGTCCA GCCACAACGA CGCGGTACAG
ACCACCGGCG GCAGCAACAT CGTGCTGCGT CACAACACCC TGATTCCCAT CAACGAGACG
ACCGGCGAGT CGAGTAACGC CGGCTACATG TTCGGGCAGA ATATCGGGCC GATCTCGCGG
GTTGTCTTCG ACGGCAACTT CATCAACGGC GGCGGCTTCA CCCTCAACGG TGCCGGTGAC
GTGACCTTCG TGAACAACCG GTTCGGCCGC AACTGCCTGT ACGGGATCAA GGCGTTCAAG
GGCAGCCAGG TGAAGTTCGA CCAGTCCAAC ATCTGGGCCG ACACCGGCCG TCCGGTGAAC
GACGACCCCA AATGCTGA
 
Protein sequence
MHQSIIEDEA RPDGPAQNSR GAGRRVGRRD RRRRDRVVRT ARRKILLSAL CLLVVVGGTT 
GVILYSALGG DDAGGKPIQL ADTEPRYQTG VNFLPNPGFE NTVAGWSAGP SESARLSPTV
PGRSGSGAAI VTARARVPAI TLTDTPDAVT WTGGRLTYLG AVWVRTTAPG TRVQLKLTEL
DGARTAARAL TNVTLNDATW RKVEVSLRAT ATGHRLDFAV VADRLPAGQF LYADDASLLI
GRPSSPAPTG PAATTGGQPS ASPPVSGSPR PGTSATPSAP TSERPRPSTA PPAPGGSGIT
PIPAGMPNSS TTGVRAGTSL TLLSGDQRIT RAGTVIENRD VRGCIRVEAD NVVIRNSRVS
CRSSGSAVIK NLGQNLLVED VTIDGQGASN SGMSTADFTA RRVNISNTID GFFINDNILI
ENSYVHHLAQ TPSSHNDAVQ TTGGSNIVLR HNTLIPINET TGESSNAGYM FGQNIGPISR
VVFDGNFING GGFTLNGAGD VTFVNNRFGR NCLYGIKAFK GSQVKFDQSN IWADTGRPVN
DDPKC