Gene Francci3_3427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3427 
Symbol 
ID3905667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4075126 
End bp4077648 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content73% 
IMG OID637880750 
Producthypothetical protein 
Protein accessionYP_482510 
Protein GI86742110 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTA CCGGCAGACC GGGCGACCGT GCCCGTTCGG CGCGCATAAG AATGGAGGGG 
ATGACACCAC GATCGTTCGT TGACCATCTT GCGGATCTGG ACTTGCCGGC GCTGGCCGCT
CTGCTCCGCT CGCGTCCGGA CGTGCTGATC GAGCCGGTAC CCCGCGGTTT CGGGCAGCTC
GCGCAGCGGC TGGAAAGCCC GGATTCGCTG GCGGCGGCGC TGTGGGAGCT GAACCGCGAC
ACGCTCCGGG TCGGCCAGGC GGCGGCGCTG CTGGGGGAGT CCGCGACGGT GCCCCGGATG
GCCGAGCTGT TGGGCGCGTT ACCGGACGCG GTGCGCGCCG GGGTGGACGA TCTGTGCGCC
CGCGGCCTCG CCTGGACCGA CACGGTGGTC CGACTCCCGG AGCCGCTGCA CAACCACTGG
GTCGAGGAGA TCAGCTCCGC CCGCCCGGTC GCAAAAATCG CCCGCAACGT CCTGGTCGAG
GATCTGCGTA AGGCGGTCAC GGCGCTCGGG GCTCCGGCCG AAAACCTGCG CAAGACCGAG
CTGGTCGCGC GGCTCACCGA ACTCATGTCC GATGCCCGCC GCATCGCGAC GATCATCGCT
TCCCTGCCCG CCCCGGCGCG GCAGCAACTC GATGACCTGC GCCTGGCCAC GGCCGGCGGC
TACTTCCCCT ACATGTTCCC CTACACGCCC TTCGAGGAGA TCCATGACCG CAGGGCAGCC
ACGCGGTCCA TCAGGCAGCT GATCACCGCG GGGCTGGTGC TGGGCGTCGG CAACCGGCCG
GAGCTCCCCC GTGAGGTCGC GGTCGCCGCC TGGCTGGCCA GGCAGGACCT GGGGCTCACC
GGGCCGCCGG ACATTCCCCG GCCGGAGGTG AACGAGGCCG CCGTCCGGTC GAGCTCCCAG
GCGGCCGCCC AGGGCGCTAT CCGCGCCGTG ACCGCCCTGC TCGACGAGGC CGCCGCCGCC
CCCGTCGTGG CGCTCAAGCG AGGAGGCGTC GGTGGGCGGG AACGGGCCCG GCTGGCGAAG
CGGCTGTCGA CACCGTTGGC CGAGCTGCCG CTGTGGATCG ACCTGGTCGC GGCCGCCGGC
CTGCTGGCCT TCACCGAAGG CGGCTACGCA CCGAGCCCGC GGTTCCAGGA GTGGCGGGAG
GAGTCGCCAA GCCATCAGTG GGCGACGCTG GCGGCGGCGT GGCACGCTCT GGAGCACGCG
CCGACGAGCC GCGAGGGTGC CGACGAGAAG GACGTCCCGC CGCCCCTGCC CATCGGGTCG
GGGGCGGGCC AGATTCGCCG GGTGCTGATC CGCGCGGCGG CCGGCGGGCG GTCGGTGCGG
CTCACGGCTG AGAACCTCGA CTGGTTCTTC CCCCTGCACG GCTACGACGC CGCCACCTGC
AAGATCAAGG TGGCGGCGTC GATCCGCGAG GCCGAGCTGC TCGGGGTGTT CGCCCTCGAC
GTCCTGTCCG AGCCCGGCCA GGCCCTGATC GAGGCCGCGC CGAGCACCAC CGACGCGGCG
GCTGCCACCG TCGGGCTTAT GATCGTCTCG ACCGCGGACG AGCTCGCCGC GCAGGTCGCG
AGGACGGTGC GGGATCTGGC GGACCGCTGC GCCGAGCTGC TGCCCGCGAC GCCGTCCAGC
CTGATCCTGC AGTCCGATCT CACCGCGGTC GTGTCCGGCC AGCCCAGCGC GGCGATATCC
CAGCTCCTGC GGGCCGCGGC GGTGCCGGAG TCCCGGGGCG CGGCCGGGAC CTGGCGGTTC
ACCTCCGCCA GCGTCCGCGC CGCGATGGAC GCCGGATGGT CCGCCACCGA GCTGCTCGAA
CAGCTACGCG CCATCGCCGA CCACGGTCTC CCGCAGCCGC TCGAATACCT CATCGCCGAC
GTCGCCCGCC GGCACGGACA CGTGCGGGTC CGCGGGATGC GCAGCGCCGT CCTCGCCGAC
GAGCCCACGA CCGCCGAGAT CCTCCACACC CGGACACTGG CGAAGCTGCA CCTCGCCCGG
CTCGCCCCCA CCGTGCTGGC CAGCCCGGTG GAACTGGACA CGGTCCTCGC CGAGCTGCGC
CGGACGGGCT TCTACCCCGT CGCCGAGGAC GCGACGGGCA CCGTCATCGT GACGAGCGGA
CCGGGGCGGA AGGCGCCGCG GGGGGCCCCG GCACAGCCGT CCCGGGAGTC CCGGGCGTCC
CGACCGCACG GAGGGTCCCC CGCCGGTCCG CCGGCGACCC GACGGCGGCT GACGGCCGAC
GAACTCGCCC AGCGCCTGCG CAGCGGCAAT GACGACGGTG TCACCCTGCT GACCGACCTG
GCCCAGCGGC TCGGCGAGAT GAACGGACGG CTAAACGACG CCGAGCTCGC CGTGCTGGCG
GACGCGGTGG AACGCCGCAG CGATGTGGTC ATCGCCTACC GGGACAAGCA GGGCACCCGG
ACGGTCCGCC GCATCCAGCC GAACCAGCTC TTCGGCCGGT GGCTGGACTC CTGGTGCCAC
CTGCGCAACG CCGAGCGCGA GTTCGCCATC GCCAACATCG AGTCCGTCAG TCCCGTCGGC
TGA
 
Protein sequence
MSATGRPGDR ARSARIRMEG MTPRSFVDHL ADLDLPALAA LLRSRPDVLI EPVPRGFGQL 
AQRLESPDSL AAALWELNRD TLRVGQAAAL LGESATVPRM AELLGALPDA VRAGVDDLCA
RGLAWTDTVV RLPEPLHNHW VEEISSARPV AKIARNVLVE DLRKAVTALG APAENLRKTE
LVARLTELMS DARRIATIIA SLPAPARQQL DDLRLATAGG YFPYMFPYTP FEEIHDRRAA
TRSIRQLITA GLVLGVGNRP ELPREVAVAA WLARQDLGLT GPPDIPRPEV NEAAVRSSSQ
AAAQGAIRAV TALLDEAAAA PVVALKRGGV GGRERARLAK RLSTPLAELP LWIDLVAAAG
LLAFTEGGYA PSPRFQEWRE ESPSHQWATL AAAWHALEHA PTSREGADEK DVPPPLPIGS
GAGQIRRVLI RAAAGGRSVR LTAENLDWFF PLHGYDAATC KIKVAASIRE AELLGVFALD
VLSEPGQALI EAAPSTTDAA AATVGLMIVS TADELAAQVA RTVRDLADRC AELLPATPSS
LILQSDLTAV VSGQPSAAIS QLLRAAAVPE SRGAAGTWRF TSASVRAAMD AGWSATELLE
QLRAIADHGL PQPLEYLIAD VARRHGHVRV RGMRSAVLAD EPTTAEILHT RTLAKLHLAR
LAPTVLASPV ELDTVLAELR RTGFYPVAED ATGTVIVTSG PGRKAPRGAP AQPSRESRAS
RPHGGSPAGP PATRRRLTAD ELAQRLRSGN DDGVTLLTDL AQRLGEMNGR LNDAELAVLA
DAVERRSDVV IAYRDKQGTR TVRRIQPNQL FGRWLDSWCH LRNAEREFAI ANIESVSPVG