Gene Francci3_4532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4532 
Symbol 
ID3907509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5410599 
End bp5413253 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content70% 
IMG OID637881865 
Producthypothetical protein 
Protein accessionYP_483607 
Protein GI86743207 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.796948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGA AGGGACCCGG GCGGCGCCGT CGTGGACCTC GTTGGGCGGC GATGCTCGCG 
GCGCTGGCCC TGTTCGCCGT GGCCGTGCCC GGGGTCTGCG CGGCCTCGGC TTCGGCTTCG
GCCCCCGCGT CGGCCGGGGG AGTCCTCGTC GAACCGGTTA CCCACCACGG TTCATCTACA
AGAACACCAC GATCTTCCGC ATCTTCATCC ACCGATAAAT CGAATAATTT GCCATCTATC
ACTCCGGATA ACACGGCAGC CAGGCCCGGT GGTTCGCCGA ACCCGGTGAC GATTGATCTG
GATGAGGTAA GTCCCCCCAC CCCGGGGGGC TCGACCATGC TCTCCGTCAG CGGCCAGCTC
ATCACCACGG TCGGGGAGCC GCTGGCCCAC CTCTCCATGA GCCTGTGGAT AGGCGCCAGG
GTGACCTCAC GCGGGCAGCT GGCCACCCTG ACCGATGACC CGGCCACGGC GAGAACACGC
AGCTACCGCC AGGTGGTCCT CGCGAACGGC GACGGCACCC AGCCGCTGGC CGCCCCGCCC
ACCCAGGGTG CCGCGCCCGT GCCCTTCCGG ATCGCCACCG ATCGCGCCGC GCTGGACCTG
CCGCCGAGCC TGGGGGTGTA CCCGATCCAG CTACGGATTA CCGGCTCCGT CGGGGGCCGC
GCCGTGGCAC CGATCGGCGC GGCCTACACC TTCCTGGTCT GGGCGCCGAA TCCGCAGCAG
CCGAGGACGC CGATCGCAGC GGTCCTGCCG ATCGCAGATC AGCCGCGCCT GCGCTCGGAC
GGCAGGCTGA CCGACAACGC GCTCGCGGAC GAGGTGAAGC CGGGCGGCCG GCTGTCCCGG
CTGCTGACCG CCGCCGCGCC GCCGGTCACG CTCGCCGTCG ATCCGACCCT GGTGCAGGCG
CTGACCATCA TGGCGCAGCC GGGCGGCTAC GACTACGCCA CCCCGGGCGG GCCCGTTCAT
GCGCAGGCGA ACGCCGATGC GGCCCTTTTT CTCTCCGACA TCGTGCGCTT CGCCGAGCAC
GGAGGCATGG TGTTCGCGCT GCCCTACGGG GATGCCGATC TGACCGCGCT CGTCCACGCC
CGGAAGCTGG ACACGATGAA GTACGCCGTC AAGACTGGCG AAGTTGTGCT GGCCCAACTG
TTGGGGCGTG CCCCTCTACA GAAGGGCACC ATCGCATACC CTGCCGACGG GTCGGCGGAC
GCCACCACCG TGGACGTACT GCGCCAGCTG GAGGTCGGGA CCGTGATCCT CGATGACCAG
CTGCTACCCC CCGCGACGAA GATCACCTAT ACTCCGTCAG CGACCGTCAG CCTCGCCACC
TCGGCCGGAC CGATCCAGGC GCTCGCGGCC GACCGCCGGC TCACCAGGAT CGCAACCGCC
TACACCGGGC TGCCCGACGG TCCCGACTTC GGGACGGCCC TGGCACGCCT GCGGGCCGAG
CTCGGCATGA TCACCGCCGA ACGACCCGAG CAACGGTTCC AGGTCCTGGC GCTGCCCCGG
GACTGGGATC CGCCGAGCGA CTGGGCACGG TCGGTACTCG ACACCCTCAC CAGCGGCTAC
TCCATCTCGG TGGGACTCGA CGCAGGTGCC GGGGGCCAGG GCCGGCCGGC CGACCGGCCG
GGCCGGCTCG TCTACCCGGC CGAGGCACAG GCCCGCGAAC TGCCGGCCAA CTACCTCACC
GCCGTCGAGG ACGTCCTCGA CGAGGTGCAG GCCCTGAGCC CCGTGCTGTG CCCACCCAGG
CGCGGTCCCA GGTGCCAGCT GCAGCTCATC AACCCGATGA AGAACGCGCT TGTCACGGCG
GCCTCCGCCG CGTGGCGGGG AGACCGCAGG GTTGACGGTG TGTCGTTGAG CCAGCAGGTG
GACGGGGAGG CGTCCGCGAT CCGCAACGGA ATCCAGGTGG TGGCCTCCCG CAGCGTCAAC
CTGACGAGCA AACGTGGGCT GGTCCCGATC ACCTTGGAGA ACAACACCCA ATACGAGGTC
ACTGTGGTGC TCGCCTTCTC TTCGACCAAC CGGTCCCGGC TGCGCTCGCC AGCCCGGCAG
ACACTCCGCC TTCCCCCAGG GCAGAAGGCT CAGGTCGAGA TCGGGATGGA GGCCGAAGGC
GCGGGAACGT TTCCCCTCGA GATCCGCAAA CTCAACCTCG ACGGGCAGGC GCTGTCCAAG
GAACCACCGA ACCGGGTTCT CGTCCGATCC ACCGTCTACG GCGCCATCGC GACCGCGATC
ACCATCGGTG CCATCGGCGT GCTGCTCCTG GCGGTCATCA TCCGCCTGGC GCGACGGCTA
CGGGCCCGCC TGCGTGGGGC CGAGGACGAC CTGGGTAGAC CGGCGGGTCC GCACGAGGCT
ATGACCGGGC CGACTGCCCC GCTCTACCCG GTGAACCAGG TGAACCAGGT GAACCAGGTC
GATCCGGTCG CCCCCGCCGG TCCGCTCACC CCCGCCGGTC CGCTCACCCC CGCCGCCTCG
GTCTCGACCC GGCATAGTGG CGACGACGAG CGCGAAGCCG CCACCTCGAC TACGACGACG
CCGGTGGTAC GGACGGCCGT CCGGCGCTAT GACGACCAGG ACCGAGACAC CCTTCGCCAC
GGCCTCGACA GCCAGGAGGA CACCGGCGAC TCGGCGGCCA CAGCTCCCTT CCGACGGCGA
CGGAGCGGCC CATGA
 
Protein sequence
MREKGPGRRR RGPRWAAMLA ALALFAVAVP GVCAASASAS APASAGGVLV EPVTHHGSST 
RTPRSSASSS TDKSNNLPSI TPDNTAARPG GSPNPVTIDL DEVSPPTPGG STMLSVSGQL
ITTVGEPLAH LSMSLWIGAR VTSRGQLATL TDDPATARTR SYRQVVLANG DGTQPLAAPP
TQGAAPVPFR IATDRAALDL PPSLGVYPIQ LRITGSVGGR AVAPIGAAYT FLVWAPNPQQ
PRTPIAAVLP IADQPRLRSD GRLTDNALAD EVKPGGRLSR LLTAAAPPVT LAVDPTLVQA
LTIMAQPGGY DYATPGGPVH AQANADAALF LSDIVRFAEH GGMVFALPYG DADLTALVHA
RKLDTMKYAV KTGEVVLAQL LGRAPLQKGT IAYPADGSAD ATTVDVLRQL EVGTVILDDQ
LLPPATKITY TPSATVSLAT SAGPIQALAA DRRLTRIATA YTGLPDGPDF GTALARLRAE
LGMITAERPE QRFQVLALPR DWDPPSDWAR SVLDTLTSGY SISVGLDAGA GGQGRPADRP
GRLVYPAEAQ ARELPANYLT AVEDVLDEVQ ALSPVLCPPR RGPRCQLQLI NPMKNALVTA
ASAAWRGDRR VDGVSLSQQV DGEASAIRNG IQVVASRSVN LTSKRGLVPI TLENNTQYEV
TVVLAFSSTN RSRLRSPARQ TLRLPPGQKA QVEIGMEAEG AGTFPLEIRK LNLDGQALSK
EPPNRVLVRS TVYGAIATAI TIGAIGVLLL AVIIRLARRL RARLRGAEDD LGRPAGPHEA
MTGPTAPLYP VNQVNQVNQV DPVAPAGPLT PAGPLTPAAS VSTRHSGDDE REAATSTTTT
PVVRTAVRRY DDQDRDTLRH GLDSQEDTGD SAATAPFRRR RSGP