Gene Francci3_2728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2728 
Symbol 
ID3904718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3212637 
End bp3216959 
Gene Length4323 bp 
Protein Length1440 aa 
Translation table11 
GC content70% 
IMG OID637880051 
Producthypothetical protein 
Protein accessionYP_481817 
Protein GI86741417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.935327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCG TGGGTGCCAC CACGCGGGGC CGGGGCATCC CCGGAGGTCG CGGCAGCGGT 
GGCGGCAGCG GCAGCGGTGG CGGTCGGCGC CGGCCGCCCG CGCCGGACGG CTCCGCGCAG
CACCGCGAAT GGCTGAGCCT CATCGAGGTC AGCGGGCCGT TCCTCAGCCT GCCCGTGCTC
CGGCGGGTCT GGCCGACGCT CGACCCGCTG GAGAAGAAGA CCCGGGAACG CCTGCGCCGG
GAGCACACCG CCTGGCTGGA CGATCCCGTC GGCGGCCAGG CGGCCTGGGT GCGTTTCGTT
CTCGGCGAGC TACTCGGCTG GGGGAACACG CTCGTCCTGC CCGATGGCAC GAACGGCGCG
GGTGGCGCGG ACGGCGCGAA CGGCGCGAAC GGCGCGCTGC GCTCGTTCAC CGTCGACGTC
GGCGAGCACG ACGCGCACCT CGTGCCGTCC TTCATCCTCG TCGATCCGGA CACAGCGGAC
ACAGTGGCCG CGGGGGACAC AGTGGCCGCG GGGGACACAG TGGCCGCGGG GGACACAGTG
GCCGGGGGGT CGTCCGAGGT GAAACCGGAC GCGGTCCGCG TGCTCGGTCT GGTCGTCGCG
CCGTCACAGT CCCCGACGGC GCGGATCGCC GGGCAGGCCT GGGCTGCGAC GCCGGTGGAC
CGGGCGGCGA TCGCCTGCCG GCATCACGGG ATCGAGCTCG CGCTGGTGAC GAACGGCCGG
CAGTGGGCGT TGGTTTGGGC GCCGCGCGGC GGGGTGACGG CGACGGCGGT GTTCGACGCG
GTGTCCTGGC CGGAGGCGGC CGAGCGCGAC GTCGTGCGGG CGTTCGTGTC GTTGCTGTGC
CGGCGGCGCT TCTTCGGCGT ACCCGACGAC GAGACGCTGG TCCCGTTGCT GAAGGCGAGC
CTCGGCAGCC AGGAGGAGAT CACCGAGGCG CTCGGCGTCC AGGTTCGCCA GGCCGTGGAG
CTGCTCGTCG CCGCCTTCGG CCGCTGGCAC GTCGCCGAGC GGGACCGGGG TGGCGCCGGG
CTGCGCGACG TCGGCGCGCA CGACGTCTAC CGGGGCGCGG TCGCGGTGAT GATGCGCATC
GTCTTCCTGC TCTACGCCGA GGAACGCGGC CTGCTGCCCG CCGGCAACGA CCTGTACGCG
AGTACCTACT CCGCCGGCCG GCTGTGCGCC GAGCTGGAAC AGCAGCTTTA CGACGGCGCG
ACCGAGGACG ACCTGGAGCA GTCGACGGCG GCCTGGCACC GGCTGCGCGC CCTGTTCACC
GCCGTGTACG CGGGGGTCGA TCATCCCCGG CTGCGCATGC ACGCCTACGC CGGTTCGCTG
TTCGACCCGG CCGCCCATCC GTGGCTGCCA CTCGGCGTGG ACGACCGCAC GGTGCTGCAC
ATGCTGCGTG CCGTCCAGTA CATCCAGGTC GGCAGCGGGA AGTCCCGGGA GCGGCGGACG
CTGAGCTTCC GCTCGCTGGA CGTCGAGCAG ATCGGTTACG TCTACGAGGG TCTGCTCTCC
TACGATGCCG AACGCGCCCC GGCCGCGGAG GTGGTCGTCG GCCTGGCCGG CCGGGAGGGG
CTGGAGGCCG AGGCGAAGCT GTCCGAGCTG GAGGAGCTCG CCTCGCGTCA CCGGGGCCGC
CCGCTGCTCG CCGCCGCGCT CGCCGAGAGG TTCGCCGCCA GCGGGATCGG CGCGCCGAAG
GCGGTCGAGC GCCGGCTCGT TCCGTTGGAG GCCGGCGAGC GGGAGGAGGC CCGCAAGGCC
CTGCTCGCCG TGACCCGCGG CGACTACCCG ACCGCCGAGC GGCTGCTGCC GTTCCACGGC
CTGCTGCGCA CGGACCTGCG CGGCCAGCCC GTCGTGGTCC TGGAGGACGG GCTGTACGTC
ACCGAGTCGC CGCTGCGGAA GAACACCGGC ACCCACTACA CCCCCAGGTT CCTGGCCGAG
GAGGTCGCGG ACAACACCCT GGAGCCCCTG GTCTACGCCC CCGGGCCGTT GCAGACCGCC
GACCGGGGCG CCTGGCGGCT GCGGTCCAGC GCCGACATCC TGTCGCTGAA GGTCGCCGAC
ATCGCGATGG GCTCGGCCGC GTTCCTCGTC GCCGCCGCCC GCTACCTCGG CGACCGGCTG
GTGGAGGCGT GGATCCGTGA GGGTGACCCG CGGGCCGCCG GGCACGCCGG GTACGCCGGG
TACGCCGGTG ACGCCGACGC GGACGACGTC ACCGTCGAGG CCCGCCGGCT GGTCATCGAG
CACTGCCTGT ACGGGGCGGA CATCAACCCG ATGGCCGTGG AGATGGCGAA GCTGTCGCTC
TGGCTGGTGT CGATGGACCC GCGCCGGCCG TTCACCTTCC TGGACGACCG GCTCGTCTCC
GGCGACTCCC TGCTCGGTAT CACCTCGGTC GAGCAGCTGG AGTACATGCA CCTCGACCCG
GCCAAGGGCC GGAAGATCCA CTCCGACATC TTCGGCTGGA CCGCCGGCGT CCGTGAACTG
GTCCGCGACG CCGCCGGGCG GCGCCGGGAC CTTGTCGAGA CCGACGGCTC GTCTCTGGCG
GGACTTGCGA GGAAGCGCAG GCTCCTGTCG GGGGTGGAAC ACGACACCGA GCGGCTGCGG
CTCTTCGCCG ACCTGACTGC CGGCGCCGCG CTCGTCGGCT ATCAGAAATG GTCGGGGACG
AGTAACCCGC ATGCCCGCGC CGAGATCGAC GAGGAGAAGA AGGCCAGCCG GGAAAGCTTC
TCCCTGACCG CGGCCCGCCT CGCTGATGAG GTCACCGGCA AGCGGGATGA GACCGAGGCC
CGGAAAACGG CACAGTCCTG GTTGGCCACC GATTACGTTC CCGGCGGTTT CCGGCGGAAA
CCCCTGCACT GGCCGCTGGT CTTCCCCGAG GTCTTCGAGG AGGGCGGTTT CGACGCCGTC
CTCGGCAACC CCCCGTTCCT CGGTGGCCAG AAGCTGACCG GTGCACTGGG CGTGGCGTAT
CGCGAGTACC TCGTCGAGGA GATCGGCGGG GGAGCACGGG GGAGCGCCGA TCTGGTCGCC
TACTTCGCGC TGCGTGCGCA CTCTCTGCTC AACGGTGGCG GTCAGACGGG GCTGATCGCG
ACGAACACCC TGGCCCAGGG CGATACCCGT GAGGTCGGCC TTGACCAGAT TGTCGCTGAC
GGGAACGTCA TCCGCCGGTC GATCAAGAGC CGGCCCTGGC CGTCGAAGTC CGCGGTCCTG
GAATTCAGCG CCGTCTGGAC CACAGGGCAG ATCGGGCCGG CCGCCGCCCG GTTCGCCGAC
GGCGTCCAGA CCCCCGTGAT CGCGCCCTCC CTGGAGCCCG AGGCACGGGT GTCCGGAAAT
CCTCACCGAC TGGAAAAAAG CGCAGGAATC TCCTTTATTG GAAGTTATGT CCTGGGTCTT
GGCTTCACGA TGGAACCCGC CGAGGCCCGC GCGTTGATCG CGAAGGATCC GCGTCATACC
GACGTGCTGT TCCCCTATCT GAACGGTCAG GATCTCAACT CGCGCCCCGA CTGCTCGGGC
AGTCGTTGGG TCGTCAATTT CCACGACTGG CCTGAGGACC GCGCAAAGCG GTACCGCGAC
TGTTACGACC AGGTCCGCCT CCTCGTCAAG CCAGAACGTG ACCGAAATAA TCGAAAAGTA
TATCGCGACT ACTGGTGGCA GTACGCTGAG AAGCGTCCCG CCATGCTCGA CGCGATGGGG
GATCTCGAAC GGCTTGTCGT CATCACTCTT GTGAGCCGCA CGGTGATGCC GGTGATGGTG
CCGACCGGGC AGGTGTTCGC GCACAAGCTT GGTGTGTTTG TCACTGACGA CGGGGCAATG
CTGTCACTAT TGTCGAGCGC GCCCCATTTT TGGTGGGCGA TGAGTCGTAG TTCGACGATG
AAGGCGGATC TCAACTACTC GCCGTCGGAC GTGTTCGAGA CCTTGCCGTT GCCTGAACTG
ACGGGGGAGA TGCGGGAATT GGGGGAGGGG TTGGACGGGT TCCGGCGGGA GGCGATGCTG
GCGCGGCAGT CAGGGCTCAC CAAGACGTAC AATCTGGTTT TCGATCCGGG CTGCAACGAT
GCCGATATCG CCGAGCTGCG GGAGATCCAC CGGCTCATCG ACGAGGCGAC CTTTCGGGCG
TACGGGTGGG TCGATATGAT CGACCGGGGC CTGGACCACG GCTTTCATCC GGCCGGGAAG
TACACTCGGT ACACGATCGG GCCGTGGGCG CAGCGCGAGG TGCTCGACCG CCTCCTCGAA
CTGAACCACG CCCGGTACGC CGAGGAGGTC GCCGCCGGCC TGCACGAGAA GGGCGCGAAG
AAGAAAGGCG CCCGCGCCAG GGCGGCCGCA TCGCCGGACC AGGGGTCGCT GCTTGACCTG
TAG
 
Protein sequence
MSTVGATTRG RGIPGGRGSG GGSGSGGGRR RPPAPDGSAQ HREWLSLIEV SGPFLSLPVL 
RRVWPTLDPL EKKTRERLRR EHTAWLDDPV GGQAAWVRFV LGELLGWGNT LVLPDGTNGA
GGADGANGAN GALRSFTVDV GEHDAHLVPS FILVDPDTAD TVAAGDTVAA GDTVAAGDTV
AGGSSEVKPD AVRVLGLVVA PSQSPTARIA GQAWAATPVD RAAIACRHHG IELALVTNGR
QWALVWAPRG GVTATAVFDA VSWPEAAERD VVRAFVSLLC RRRFFGVPDD ETLVPLLKAS
LGSQEEITEA LGVQVRQAVE LLVAAFGRWH VAERDRGGAG LRDVGAHDVY RGAVAVMMRI
VFLLYAEERG LLPAGNDLYA STYSAGRLCA ELEQQLYDGA TEDDLEQSTA AWHRLRALFT
AVYAGVDHPR LRMHAYAGSL FDPAAHPWLP LGVDDRTVLH MLRAVQYIQV GSGKSRERRT
LSFRSLDVEQ IGYVYEGLLS YDAERAPAAE VVVGLAGREG LEAEAKLSEL EELASRHRGR
PLLAAALAER FAASGIGAPK AVERRLVPLE AGEREEARKA LLAVTRGDYP TAERLLPFHG
LLRTDLRGQP VVVLEDGLYV TESPLRKNTG THYTPRFLAE EVADNTLEPL VYAPGPLQTA
DRGAWRLRSS ADILSLKVAD IAMGSAAFLV AAARYLGDRL VEAWIREGDP RAAGHAGYAG
YAGDADADDV TVEARRLVIE HCLYGADINP MAVEMAKLSL WLVSMDPRRP FTFLDDRLVS
GDSLLGITSV EQLEYMHLDP AKGRKIHSDI FGWTAGVREL VRDAAGRRRD LVETDGSSLA
GLARKRRLLS GVEHDTERLR LFADLTAGAA LVGYQKWSGT SNPHARAEID EEKKASRESF
SLTAARLADE VTGKRDETEA RKTAQSWLAT DYVPGGFRRK PLHWPLVFPE VFEEGGFDAV
LGNPPFLGGQ KLTGALGVAY REYLVEEIGG GARGSADLVA YFALRAHSLL NGGGQTGLIA
TNTLAQGDTR EVGLDQIVAD GNVIRRSIKS RPWPSKSAVL EFSAVWTTGQ IGPAAARFAD
GVQTPVIAPS LEPEARVSGN PHRLEKSAGI SFIGSYVLGL GFTMEPAEAR ALIAKDPRHT
DVLFPYLNGQ DLNSRPDCSG SRWVVNFHDW PEDRAKRYRD CYDQVRLLVK PERDRNNRKV
YRDYWWQYAE KRPAMLDAMG DLERLVVITL VSRTVMPVMV PTGQVFAHKL GVFVTDDGAM
LSLLSSAPHF WWAMSRSSTM KADLNYSPSD VFETLPLPEL TGEMRELGEG LDGFRREAML
ARQSGLTKTY NLVFDPGCND ADIAELREIH RLIDEATFRA YGWVDMIDRG LDHGFHPAGK
YTRYTIGPWA QREVLDRLLE LNHARYAEEV AAGLHEKGAK KKGARARAAA SPDQGSLLDL