Gene Francci3_1165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1165 
Symbol 
ID3905276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1388110 
End bp1392078 
Gene Length3969 bp 
Protein Length1322 aa 
Translation table11 
GC content74% 
IMG OID637878497 
Producthypothetical protein 
Protein accessionYP_480273 
Protein GI86739873 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0368185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG GCTTTATGGT GCTCTTTCTG CTCCAGGCCC CCGGCAACCT GACCGCCGAC 
ACCAAGCTGG ACGTCCCCCT TGACCCGTGG GGCTTCATGG GGCGGGCCAC CCACCTGTGG
AACTCCCTCG CCGAGTTCGG CTACCTGCCC AACCAGTACG TCGGCTACCT GTTCCCGATG
GGACCGTTCT TCGGTGCCGG TGACCTGCTC GGGATACCCG CCTGGGTCAC CCAGCGGCTG
TGGATCGCGC TGCTCCTCAC CGTCTCCGCC TGGGGGGTCG TGCGGCTGGC CGAGGCCCTG
CGGCTCGGCG TACCGCGCAC CCGGGTGCTG GCCGGACTCG CCTACACCCT GTCCCCGATG
TTCCTCGGCA AGATCGGGGC GACTTCCGTG GCGCTCGCGG GCGCCGCCAT GCTCCCGTGG
ATCACCCTGC CGCTCATCCT CGCGCTGCGT CCGGACGGCG CCTTCGGCGC CGACACCGGT
ACGGCCGCCA CCACCAGTAC CACGGCCGTC GATGATGTCG ATGCCGCTGA CGATCCCGCC
CTGGACCCGG GCCGGCGGCT GTCCCCGCGA CGGGCGGCTG CCCTGTCCGG ACTCGCCATC
CTGGGCACCG GCGGAATCAA CGCGACGGTC ACGCTGTGCG TCCTGCTCTG TCCGGCGGCG
GTCCTTGTGT TCGCCGGCGG CACCCGCCGG GCCTGGGCGC TGCGGGCCTG GTGGGTCGTC
GCCGTCATCC TCGCCTCGGC CTGGTGGCTG CTGGCGCTGA TGGTGCAGGG CCGCTACGGG
CTGAACTTCC TCGCGTTCAC CGAGACCGCC CAGACGACCA CGGCGACGAC GTCGGTGGCG
GAAGCCCTGC GCGGCGCGAC GGACTGGCTG GCGTATGTCC AGTTACCCAG GCCCTGGCTG
CCGGCCGCCA CCGAGTACGT GTCCCGGCCG GTGCCTATCA TCGGATCGGC AGCCGTGGCG
GCGATCGGGC TGTGGGGGCT GTCCCGGGCC GATCTGCCCG CCCGCCGGTT CCTGCTGGTG
ACGTTGGCCG CCGGCACGGT CTCGGTGGCT GCGGCCTATC CGGGGCATCC GGGCAGCCCG
CTGGCGGGCG TGGTGCGGGA CCTGCTCGGT GACCAGCTGG GCTTTCTGCG CAACGTCTAC
AAGTTCCAGC CGGTGGTGCG CCTGCCGATA GCCCTGGGGA TCGCGCACGC CCTGACGGTC
CGACCTGTCA GTCTCACCGC CCGTGTGCGG AGATGGACGA CCCGCGGGGC TTCCCGCCGG
CACGGCAGGA CCGCGGACAT GATCGTGACC GTCGCGGTGA ACGTCCCCGT CGTGCTGACC
GTGGCCGCGC TGGTCTGCGG CATGACGCCA GCGCTCGCCG GCAGGGCGCT GCAGCCCCGG
CCGTTCGCCC GGGTGCCCGA CTACTGGGTG GCCGCCGCGG ACTGGCTGGC CGCCCATCCC
CAGGACGGCC GGGCACTGGT GCTGCCCGGC TCGCCGTTCG CCGAGTACGA GTGGGGCCGG
CCGCTCGACG AGCCGCTGCA GTGGCTGGCC CGCACGCCGT GGGGGGTCCG CGGCCTGATC
CCGCTCGGTG GCACGGGCGT GACCCGGCTG ATGGACGGCA TCGAGCACCA GCTCGCCACC
GGGTCCGCCG CGGGCCTCGC CCCGGCGCTG GCCCGGGCCG GGATCGGCCA GATCCTGCTC
CGCAACGACC TGGAACAGAA GAACTGGGAC GTCCCCCCGT CCACCGACGA GCTCCACCGG
GCGTTGCGTT CCTCCGGTCT CACCCTGACC GCCGCGTTCG GCCCGCAGGT GCCGGCGCGC
GCCTCGGCGA AGGAGAGGCT CGTCCCGGAG CTGCGTAACC CGACCGCGCG GGTACCCGCG
CTGGAGGTCT GGACCGTCCC GGGCGGCGCG AAGCTTGTCG GCTCCTACCC GGCGGACACC
GCCGTCGTCG TCTCCGGCGG CCCCGAGGCG ACGGTGCAGC TCGCCGGGCA GGGCCTGCTG
AGCAGCGACC GGGCCGCCGT GCTCGCCGCG GACCTCGCCG CGGATCGCAC CGATCCGTCC
GCCGCCGTGG CGTCTTCGAC GAGCCCGGCG GCGATCAAGG TTGCGCCGGG GGAGGTGATC
GGCCCGACGA CCGCCTGGGT CGACACGGAC ACCCTGACCC GGCGCGACAG CACCTTCGGA
CTCGTCCACG ACGCCGCTTC GTACCTCCTC GGGCCCACCG GCACCGCGGT CGGCAAGACG
GGGGAGCCCC ACCAGTGGGC CGAGGCCGAC GTCGCCGGCC ACCAGACGGT CGCGGGCTAC
GTCGGCGGCA TGTCCGTCAC CGCGTCCTCC TACGGCTACG ACCTACTCGC CGCGCCGGAC
CTCGCGCCAC CGGCGGCCGT CGACGGCTTC GCCGCGACCG CGTGGACGGC ACAGCGCACG
AAGGGCACGA CGTCGGCCGG GCAGTGGATC CAGCTCGATG CCGGCCGGCG GATGACCGTG
CCCTACCTCG ACGTGCGGCT GCTGTCCGAG GGTTCGTGGC GGCCGGCCGT CGAGGCGCTG
CGGGTGTCCA GCGAGGCGGG CTCGGTGGTC ACCGAGGTGC GTCCGGTGGA GGACATCCAG
CGCCTCGCCG TCCCAGCGGG GCCGAGCCGC TGGTACCGCA TCACGTTCGC GCGGGTCGGC
CAGGAGACCG ACGACACCCT CGGCGCCGGT ATTCGGGAGA TCACCATCCC CGGGGTCACC
TTCCGCCACT ACGCGCAGCT CCCCACCGAC GTGGCCCGGC GCTTCGCCGC GCCGGACGGG
GGGCTGGTCG CCTTCTCGAT GGCCCGGGAG CGGGTCGACC CGGCGCAGCC GTTCGGCGGC
TCGGAGGAGC TGGCGCTGTC CCGGCGCTTC GAGGTGCCCC GGGACATGAC CTTCACGATG
AGCGGCTCGG CGAGCTTCAT CCCCCCGCCG CGGGGGACAC CCCCAGCGGG TGACACTCCG
TTGCTGGTCG ACTGCGGCCA GGGCCCCACC CTGGTCGTCG ACGGGACCCG TTACCCCCTG
CGGATCTCCG GCCGGGACAG CGACGTCACC TCGGTGCGGC CGGTGCGGGT GTCGCTGTGC
ACCGACGGCG GCACGCTGCG GCTCTCGGCG GGGCCGCACC TGCTCGGCGT CGACCAGGGA
GCGACCTCCA TCCTCGTCGA CGCGATCGCG CTGGTCGGAT CGGGCGCCCA GGTGAGCGCG
GCCACGCCGC GGGCGACGAC GGTGCACGAG TGGACAGCGG AGCACCGGAC GGTCCAGATC
GGCGCCGGTG ACCGCGCCTT CCTCGCCGTC CGGGAGAACG CCAACTCCTC GTGGACGGCG
AAGCTGAACG GCGTGGCGTT GACCCCGCTG CGCCTCGACG GCTGGCAGCA GGGCTGGATC
GTCCCGGCCG GCACCGGCGG CACGATCGTC ATCGACAACG CGCCCGGTGC GGAATACCGC
CGCAACCTCG TGGTGGGGCT GTTGCTGGTC GTGGTGCTGG TCGTGCTCGC CGTGCTGCCC
GCCCGGTTCC GGCTACGGCC ACGATCCGAC GAGAACGGCT ATCCGGCGCT GCTGAGGCCG
CGCCGGCTGG GCCGGCTGGG CCGGGCGACG GTGTCCCCGC CGGTGGCCGG TGTGGGGTGG
ACGGTGCTCG CCATGCTGGC GGTGGCGCTG GTGGCGGGCC CGGTGGCGCT GGCCGTACCG
GTGTTCGTCC TGGTCGGTCG GCGCTGGCCC GCGGCGGGGG GCCGCTGCGC CGCGGTCGCC
ATGATTGGCG CGGGAATCGG GGTGGCGGCG CAGCCCGGCA GCCAGCCCGG ATCCGGGCAG
GGGGCGTTCG GGCCGCTGGT CCAGGTGTTC GGCGCCTTGG CCTTGGCGGC GGTGCTGGCC
GCGTTGGCCG GGCGGGCCTG GGACCGTCCC GCCGGCTCGG GCGCCGGCGC CCACTTCGGC
GCCCGAGCCG GGTCCGATGT CGGGCATAGG ACTCGCAATC CCTATACCGA CAGCAAGGGA
CTCGGTTAG
 
Protein sequence
MAAGFMVLFL LQAPGNLTAD TKLDVPLDPW GFMGRATHLW NSLAEFGYLP NQYVGYLFPM 
GPFFGAGDLL GIPAWVTQRL WIALLLTVSA WGVVRLAEAL RLGVPRTRVL AGLAYTLSPM
FLGKIGATSV ALAGAAMLPW ITLPLILALR PDGAFGADTG TAATTSTTAV DDVDAADDPA
LDPGRRLSPR RAAALSGLAI LGTGGINATV TLCVLLCPAA VLVFAGGTRR AWALRAWWVV
AVILASAWWL LALMVQGRYG LNFLAFTETA QTTTATTSVA EALRGATDWL AYVQLPRPWL
PAATEYVSRP VPIIGSAAVA AIGLWGLSRA DLPARRFLLV TLAAGTVSVA AAYPGHPGSP
LAGVVRDLLG DQLGFLRNVY KFQPVVRLPI ALGIAHALTV RPVSLTARVR RWTTRGASRR
HGRTADMIVT VAVNVPVVLT VAALVCGMTP ALAGRALQPR PFARVPDYWV AAADWLAAHP
QDGRALVLPG SPFAEYEWGR PLDEPLQWLA RTPWGVRGLI PLGGTGVTRL MDGIEHQLAT
GSAAGLAPAL ARAGIGQILL RNDLEQKNWD VPPSTDELHR ALRSSGLTLT AAFGPQVPAR
ASAKERLVPE LRNPTARVPA LEVWTVPGGA KLVGSYPADT AVVVSGGPEA TVQLAGQGLL
SSDRAAVLAA DLAADRTDPS AAVASSTSPA AIKVAPGEVI GPTTAWVDTD TLTRRDSTFG
LVHDAASYLL GPTGTAVGKT GEPHQWAEAD VAGHQTVAGY VGGMSVTASS YGYDLLAAPD
LAPPAAVDGF AATAWTAQRT KGTTSAGQWI QLDAGRRMTV PYLDVRLLSE GSWRPAVEAL
RVSSEAGSVV TEVRPVEDIQ RLAVPAGPSR WYRITFARVG QETDDTLGAG IREITIPGVT
FRHYAQLPTD VARRFAAPDG GLVAFSMARE RVDPAQPFGG SEELALSRRF EVPRDMTFTM
SGSASFIPPP RGTPPAGDTP LLVDCGQGPT LVVDGTRYPL RISGRDSDVT SVRPVRVSLC
TDGGTLRLSA GPHLLGVDQG ATSILVDAIA LVGSGAQVSA ATPRATTVHE WTAEHRTVQI
GAGDRAFLAV RENANSSWTA KLNGVALTPL RLDGWQQGWI VPAGTGGTIV IDNAPGAEYR
RNLVVGLLLV VVLVVLAVLP ARFRLRPRSD ENGYPALLRP RRLGRLGRAT VSPPVAGVGW
TVLAMLAVAL VAGPVALAVP VFVLVGRRWP AAGGRCAAVA MIGAGIGVAA QPGSQPGSGQ
GAFGPLVQVF GALALAAVLA ALAGRAWDRP AGSGAGAHFG ARAGSDVGHR TRNPYTDSKG
LG