Gene Francci3_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0901 
Symbol 
ID3906276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1042363 
End bp1045176 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content56% 
IMG OID637878234 
Producthypothetical protein 
Protein accessionYP_480014 
Protein GI86739614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCAG ATAAAGACCA GCTCGCGGAC GAACTCAACG AAGTAGCGCG ACGCTCGATC 
CCACACACTT TCACGCGGCA CTTCCAAAGT CCGCCGCTTG CGATCCCGAT GCTGCGTCGT
ATCCTCGATA GCCGCGATCC TGGCCTTTTT ACACAGAACC CGGGACGCGC GCTGCGGGTC
CTTATTGATC AAGCAGTCGC TCAGCTTGAG GCTGCTCCCA TGGCACGCCT TCAGGACAAA
TTCACCTGGC GTGAGCTCGG TGCAGAAATA TTTGCAGATA GCAACAAGTC GTACAGCGAA
ACGTTCCAGG AGATCCGGAC GGGTTGCGGC GAGCCCCTCC TATCAGAGAA AACCGTTCAG
CGCGCAATTG AAGATCTCCG TGACAGGCTG GCGGCCATCC TGCAGCGGAT GGAAATCTCC
GATCGGCATT TCGTACTTGC CGAAGATCCG GAATATGTTT CGCGACCGGA TCTTGAGAAG
AGGTTTTCTG AGGTGCTTTC CTCTGGAGAG CGTCTGGTTC TGATTCACGG TGAGGCGGGT
ACCGGAAAAA CCACACTGGC GCTCCACCTG ACTCGGAGTT TCCTTCGGTT TGCGGCGCAC
GACTGGATTC CGGTCATACC CTTTCAAGAA GGGGGTAAGG GGCCAGTCGA AGATTCATTC
CGCGAGATGC TTGATCGTCA CGGATTTGCG GTTTCGGATG TCGGCGTAAG TGAGCTAGTT
GTAACCTTCA GAAAACTCGT CACCAGTGAA ACACCGCCAC CGGTCGTTGT ACTCGACAAC
ATAGACTCTC GTACTGAACT GGAGAGATTC GTGCCGCCGG GAGTGCGGTC ACGGATAATT
ATTACCAGTC GAAAGAATCT CGACAGCAAA GTGGTAAACA TCGCGCCAAT CGAAGTCATG
AATATGGCGA ACGATGAGGC TGTTCAGCTA GTTTCCCTGC ACAGTCAAGG GTTGCAGGAG
GATGGTAAAA AATATCTTGC TGAGGCGCTA GACTTCAGGC CGCTCGCGAT AGTGCACGGA
TGCGCCTGTC TGATAAACGC TCCTTACAAT GGTGATCTCA CAGCCTTCCT TGAAAGTCTG
CGCGAGAATG TAGCGGTCGT GCTAGAGAGC TACGGCGACG ACGAGGATAC GACGATCACA
GCGATCTACC GCATGATTGT CGACGCCCTG AAGCGCCAGC CGCAGTATGT GCTTCTTGCA
CTCGACCTGG TGATCCTTGC GTACCCCGTG CGCTCTGTCC AGGAAGCTCT GCGTCTATCG
TGGCCGCCAT CGGACGTGGT CGAAGGCCAA ATCAGCAAAG AGCTAAATCT CGAAGATGAG
GCAATCCTTA GCAAAGGCCT GCGAATCATC GAACGCTGGA ACCTAGTACA CCCAAACGCG
CTACCATTCC TCCATATACA CAGTCTGACA CGTGAACTCG TTGCCTCGAT CCGAGGCGAC
GCACTGGCTG CCGTCGCCGT GCAAGCCTAT AGAGTAGCAT CACATCGACT TAATTTGTCA
TCTTGGATGG GCGGTGCACC GATTCCTTCC TCCTGGCCGC AAGAAGTCGA GTACATGCTC
GCGAGCTTCG ATATTTCGGC TCGCGGCTCG GCGGTCAGAA CTGGCATTGC CTTCTCGCAA
GGGGACGAGA CAGATCCGCT ATGCCAAGTG CCGGCACTTT TTCTCCGCCG TGGAAGGCAG
CGCGAAGAGA TGTCAGCGGA GGCCGTCAAT GGCGCCCGGT CGGTCCTATC TGAACCCTGG
TCCCGGCGCT TCACGGCCCT TGAGACTGAA GCACTCGAAC TCGGAATCCT TGATGAGGTC
GGAGATGTTC TCAATACGGT GCGTGCGCGT CGACCTGATG CGTATGTGTA CGAGGCGCTT
CTATCGAGCG ATCTCAGGCC GCTAACCGAG GCTTACACGA CCGACGTGAT CGAGGATGCT
CGCTGGCACA TCGACCATCC CTTGGCAACT GAAATGCCTC TTACTCCCGC CCAGCACAGC
TATGCCCTTG GCGTGTTCCA TTTTCAACGA TGCGAGTGGA AGGAGGCCGA AGCCTGCTAC
ATGCAGAGTT CGGAATTCTA CAAACTGCTG GCTAGCGAGA AGGCTGAATT CGGCCTGTAT
GCGCTTGAAG CGGGGCGTCG CCTTGCGGAT CTCGACCTTC GCCGAGGCGA TTTGAATTCT
GCAAATGAAC GGATCTATGC ACTTCTATCA GAGGTCTTTG AACTTGGCAA GTCCGGCATT
GTGGATGCGT TCTTGAGTCG ACGTATTACC CAAACCGGCC TGCGGATACA GTCGGAACGG
ATGTTGCGTG GTGCTCCGAG CGGCGGCAAC ATCGACCAAC TCCTGGCCCT TTATCAGGAC
CTGATGCTGA AGTTCGCAAA GACTGGCAGT TCTCTGCCTC TTCTTGAGGT GGAGTTTGCT
CGGGCGGTGA TGACTGCGCT GGTCGACAAC AAGCAGGCGA ACAGGATGCT CACCGATCTG
GGTAGCCGTT GTAGAGACTC TGGCTATAAG GTCGGAACTA CGGTCTGTAT GGCCACTCAG
TTGAAAGTCA TAATCGCAGT ACATGCCGGC ACTGCAAAGC CCGCTCGGTT TGCGCAGCTT
GCAGAATGGG CGCTTTCTCT CGCAGAAGAT TTTGTCGAGG TCAGCCGTTT CTGGCATGCC
GATGTACTAT GTTCCGCCTT GGCTTGCGCT ATTCTCGGAG ATGTACCTGA CAGGCGAGCG
GAGGAAATAC GATCCAGAGC GCAGGAAGCG GCATCGCTGA TCAGCCGTCC TGACAAGATG
ACCGTTGCGG AGAAGGTAGG CGGAGTGCCG CCATATTTTC TCCTCCGAGA ATAG
 
Protein sequence
MKSDKDQLAD ELNEVARRSI PHTFTRHFQS PPLAIPMLRR ILDSRDPGLF TQNPGRALRV 
LIDQAVAQLE AAPMARLQDK FTWRELGAEI FADSNKSYSE TFQEIRTGCG EPLLSEKTVQ
RAIEDLRDRL AAILQRMEIS DRHFVLAEDP EYVSRPDLEK RFSEVLSSGE RLVLIHGEAG
TGKTTLALHL TRSFLRFAAH DWIPVIPFQE GGKGPVEDSF REMLDRHGFA VSDVGVSELV
VTFRKLVTSE TPPPVVVLDN IDSRTELERF VPPGVRSRII ITSRKNLDSK VVNIAPIEVM
NMANDEAVQL VSLHSQGLQE DGKKYLAEAL DFRPLAIVHG CACLINAPYN GDLTAFLESL
RENVAVVLES YGDDEDTTIT AIYRMIVDAL KRQPQYVLLA LDLVILAYPV RSVQEALRLS
WPPSDVVEGQ ISKELNLEDE AILSKGLRII ERWNLVHPNA LPFLHIHSLT RELVASIRGD
ALAAVAVQAY RVASHRLNLS SWMGGAPIPS SWPQEVEYML ASFDISARGS AVRTGIAFSQ
GDETDPLCQV PALFLRRGRQ REEMSAEAVN GARSVLSEPW SRRFTALETE ALELGILDEV
GDVLNTVRAR RPDAYVYEAL LSSDLRPLTE AYTTDVIEDA RWHIDHPLAT EMPLTPAQHS
YALGVFHFQR CEWKEAEACY MQSSEFYKLL ASEKAEFGLY ALEAGRRLAD LDLRRGDLNS
ANERIYALLS EVFELGKSGI VDAFLSRRIT QTGLRIQSER MLRGAPSGGN IDQLLALYQD
LMLKFAKTGS SLPLLEVEFA RAVMTALVDN KQANRMLTDL GSRCRDSGYK VGTTVCMATQ
LKVIIAVHAG TAKPARFAQL AEWALSLAED FVEVSRFWHA DVLCSALACA ILGDVPDRRA
EEIRSRAQEA ASLISRPDKM TVAEKVGGVP PYFLLRE