Gene Francci3_4164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4164 
Symbol 
ID3907129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4965578 
End bp4967083 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content64% 
IMG OID637881492 
Producthypothetical protein 
Protein accessionYP_483241 
Protein GI86742841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTC CTTCGTTGCT CCCCTTGCCC GAATGGCCCG ATGCCGAGCC GGCCCTGTGG 
TACGGCCGGA TGTCCGATGT CGACAATGAC GCACGTATCC CGGACCAGTT CGCCCGAGGT
CAGCGTTACG CGCGGCTGAC GGGTGAGTAC TGGATAGCCC GGGCGTGGGC CGATGACGGG
ATTTCCGCCT GGCGTGAGGA TGTGGTCCGG CCGGAGTTCG AAGGATTCCT TACGACGCTA
CGAACGGGGA AGCACCGGGT CGTGGTGGCG TGGGAGGAGA GCCGGATTAC CCGTGACCCA
GTGGTCGGCG CGGAGTTCGG CAAGATCATG CAGCGGGTCA GCGGCCGACT AATCGTCACT
GATGGTGAGA AGGCCACGAC CTACGACTTC CGCAGGCAGC GGGACCGCGA TGCCTGGCAT
GGCGCGGTCG GGAAGTCGGT CAGCGATTCC GGGCTGAAGT CGGAGCTGGT AAAGCGGAAG
TTGGATGCCA AGCGGGAGGC CCGGGAGTTC CTTGGTGGCC CGGTCGGCTT CGGCTGGTCT
CAGACGATCA GCCGGAAGGG CAAGAAGATC GTGACCGAGT GGTCGGTCAA CGAGGAGCAG
GCCCGTTGGC TACGCGAGGC ATCGCAGCGG ATCCGGGAAG GTGAAGCCGT ACTCAAGGTC
GCGGACGACT TCTATGACCG TGGTCTACGC ATTCCGCACC GGCGGACCAG GCCCGACGAC
ACCATGAAGA CCGGCACCCT GACACGGGCC AGTCTGACAG CGATGCTGCG TAACCCGCGC
ATCGCTGGAC TGTTCGCAAC GGGAAACGTC CACCGCGGGT GGACCGTGGT CGGTCCGATG
GCGAACTTCC CTGCCATCCT CACGGAGGAG GAGTGGCGGG AGACCTGTGC GGCGTTGGAA
GCCGTGAAAA CCCGCAAAGG CACCGGAACG GCCGTCAAAC ACGTGTTCGC CGGCTACTAT
GTGTGCCACA AGTGCAAGCG ATCATTGATC AGGAACTCTC CCCGCGCGTA CGCGCTGTGG
CGGCATCGTC TCGGTAAGAG CCGTGAGCAT GTCGAGTGTG ACCAGTCGTT CCACATCAAC
GCCGCCGATG CCGACGACCT GATGACCCGC CTGGTTGACG CCTACCTTGT GCGCCGGGAC
TGGGAGAAGG CGGGTGAGGT CGCCGACGCC GAGGAGTTGA AGGCCGAACG GACCGAGAAG
GAACAGGAGC TTGCCGACCT CCCCCGCGCC ATCACGGCCA AAGAGATCAG TCTGCGGTTG
GGTGGCCAGG TCGAAGCCCA GATCGAGGCC CGGCTACGGG AGATCGACGC CGAGTTGGCC
CGCCGGGCGC GTCTCGTGAC CGTCCTGGAC GGCCAGGAGG CACTACGGCT ATGGCGTAAC
GGCACCCTGA CGGAGAAACG CCGCGTCCTA TCGACGATCA TTGAACGGAT CATCGCGATC
CCCGGGAAGG ATCTTCCGTT GCGTGACCGG TTGGACCCCC AGTGGCGCAA TCCCAGCCCT
GCCTAG
 
Protein sequence
MEFPSLLPLP EWPDAEPALW YGRMSDVDND ARIPDQFARG QRYARLTGEY WIARAWADDG 
ISAWREDVVR PEFEGFLTTL RTGKHRVVVA WEESRITRDP VVGAEFGKIM QRVSGRLIVT
DGEKATTYDF RRQRDRDAWH GAVGKSVSDS GLKSELVKRK LDAKREAREF LGGPVGFGWS
QTISRKGKKI VTEWSVNEEQ ARWLREASQR IREGEAVLKV ADDFYDRGLR IPHRRTRPDD
TMKTGTLTRA SLTAMLRNPR IAGLFATGNV HRGWTVVGPM ANFPAILTEE EWRETCAALE
AVKTRKGTGT AVKHVFAGYY VCHKCKRSLI RNSPRAYALW RHRLGKSREH VECDQSFHIN
AADADDLMTR LVDAYLVRRD WEKAGEVADA EELKAERTEK EQELADLPRA ITAKEISLRL
GGQVEAQIEA RLREIDAELA RRARLVTVLD GQEALRLWRN GTLTEKRRVL STIIERIIAI
PGKDLPLRDR LDPQWRNPSP A