Gene Franean1_4956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4956 
Symbol 
ID5673295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5949811 
End bp5951919 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content75% 
IMG OID641243810 
Productputative integral membrane protein 
Protein accessionYP_001509226 
Protein GI158316718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGTCGG CGGTTTTCGT CCCGGCGGGC GTCGCGCTGC TGACTGTGCA CGAGACAGCC 
CAGCAGGCCC GTTTCTACCG CGGCGCGCTG GACGAGGCGC GGGTGCACGA GCGGATCAGC
GACGAGGTCC TGACCGACCC GGTGCTCACG GACGTGACCG GTGACCTGCT CGCCGACCTG
CCCGTCGACC CGGACCTGAT GGTCGACAAC CTGCAGCTCG TCGTCCCGCC GTCGGCGCTG
CGCGGGATGA CCGACGGCGT CGCCGACAAC GTCGCCGCCT ACCTCGACGG CAGTCGCGCC
GAGTTCGTCC TCGCCGTCGA TCTGCGGCCG ATCCTCGACA ACATCGGGCG GCTCGCGTCC
GTTTACCTCG CCGGGCAGGT GTCGGGCGCG CCGCGGTACC GCACCGAGGA CGTCGCCGCC
GCCCTGCGCG ACGTCCTCGA CGGTGTGGAC CGGGTCAGCC AGGGCCGCCC GCCCGCCTCC
GTGCCCGAGA TCGATCTCAC CGACGATCAG GTCGCCTGGG CGACCGACCT GCTGGTGAGC
CGGGTGGACG GGGCCGACCG GCCGGTCGCG CGGGAACAGG TGCTGGTCGC GCTGCGCTCC
GGTGATCTCG GCGCCGCGCT GGCGGTCGTC GGCCCGCTGA TCTTCGCCGG GGACGTGTCC
GCCGTCGCCG ACCTGCGGTC GCGCCTGGCC GGTGGCACCG TCCTCGACCT CGGCCGCCCG
CTCGCGGACG CGCCCGGCGG CCCGGCCGGC TTCGTGCTCC GCACGATCCA CACGATCGGC
GGCACGGGCA TGCTCGCCCT CGCCGCGCTC TGCTTCGCCC TGCCCGCCGG CGCGCTCGGT
CTCGCCGTCC GGCGGCGCGG ACCGTCGGTG CGGCTGGTGG GCGCCGCGCT CGTCGCGGGC
GGGCTGTCCG CGCTGGCTGC GGGGGTCCTC GTGACCGGCC TGGTCGGCGA CCCGCTCGCG
CCGCTGCGCG GGCCGGACTC GCCCCTTCCC CCGGCGGGGC GGGTCCTCGC CGGAGACGTC
GGCCGCGTGC TGGTCGCCAA CGTGCGCGCG ACCTGGAGCG AGATCGCGGC GATACCGCTC
CTGGCCGGGC TCCTCCTCGG CACCGTGACG GTTGTGGTGC GCCGGCTGTC GCGCGCCGAG
TGGCGCGTGC GGCGCCTGGT GGCGGTGGGC ACGACCTGCT CGGTGTTCGT CGCCGTGTCC
TGGGTGCTGA TCCCCGGCGA GGCGGGCACC GGCACCGCGT TCTGCAACGG CGGGGCGGAC
CTGTGCAACC GGCGCTACTC CGACGTGGTC TACCCGACGA CCCACAACGG GATGGCGTCC
GTGCAGGCCG GGTTCCTCGG CGCGGTGCAG GACCCCGACC TGGTGGGCCA GCTCGACAGC
GGCATCCGCG CCCTGATGCT CGACGTCCAC CACTGGACGA CACCCGCGGA GGTCGAGTCG
TTCCTGGCCG AGCTGCGCCC GCGGGCCCGC GAGGCGCTCG CCCCGTTCGC CACCGGTGCC
CGTTCGAGCC GCCCCGGGCT CTGGCTCTGC CACGGCATCT GCCAGCTCGG CGCGACCCGC
CTGGACGACG CGCTGGCCGG CGTCGCGGGC TGGCTGGCGC GCAACCCGGC CGAGGTCATC
ACCATCATCG TCCAGGACGG CGTCGCACCC GAACCGATCA TGGCCGCGTT CCGGGCGGCG
GCCCTCGGTC AGTACCTGGT CCGCCCGCCC GCGCCGGGCC GGCCGTGGCC GACCCTCGGC
CAGCTGATCG ACCGTGGCCG GCGCCTGGTC GTCTTCGCCG AGAACGGGGA CGTGCCCGGC
ACCTGGTACC GCAACTTCTA CCGCTCCAAC GCGGACACCC CGTTCGACGT CCGGATCCCC
GGGGGCTTCA GCTGCCGGAT CGGCCGCGGG GCCAGCCGGC CCACCATGCT CCTCATCAAC
CACTGGCTCA CCGACCACGC CGCCACCCGC GCCGACGCGG CGCTGGTGAA CACCAGCTCG
TCGCTGACGG CGCACGCCGA GCAGTGCGCC GCCCGCGGGC TGCGCCCGAC CTTCCTCGCG
GTCAACTTCG CGACGGTCGG TGATCTTGTC TCCACCGTCG CCGCCTACAA CCGGCATTCG
CCCGACTGA
 
Protein sequence
MVSAVFVPAG VALLTVHETA QQARFYRGAL DEARVHERIS DEVLTDPVLT DVTGDLLADL 
PVDPDLMVDN LQLVVPPSAL RGMTDGVADN VAAYLDGSRA EFVLAVDLRP ILDNIGRLAS
VYLAGQVSGA PRYRTEDVAA ALRDVLDGVD RVSQGRPPAS VPEIDLTDDQ VAWATDLLVS
RVDGADRPVA REQVLVALRS GDLGAALAVV GPLIFAGDVS AVADLRSRLA GGTVLDLGRP
LADAPGGPAG FVLRTIHTIG GTGMLALAAL CFALPAGALG LAVRRRGPSV RLVGAALVAG
GLSALAAGVL VTGLVGDPLA PLRGPDSPLP PAGRVLAGDV GRVLVANVRA TWSEIAAIPL
LAGLLLGTVT VVVRRLSRAE WRVRRLVAVG TTCSVFVAVS WVLIPGEAGT GTAFCNGGAD
LCNRRYSDVV YPTTHNGMAS VQAGFLGAVQ DPDLVGQLDS GIRALMLDVH HWTTPAEVES
FLAELRPRAR EALAPFATGA RSSRPGLWLC HGICQLGATR LDDALAGVAG WLARNPAEVI
TIIVQDGVAP EPIMAAFRAA ALGQYLVRPP APGRPWPTLG QLIDRGRRLV VFAENGDVPG
TWYRNFYRSN ADTPFDVRIP GGFSCRIGRG ASRPTMLLIN HWLTDHAATR ADAALVNTSS
SLTAHAEQCA ARGLRPTFLA VNFATVGDLV STVAAYNRHS PD