Gene Francci3_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4056 
Symbol 
ID3907017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4849088 
End bp4851106 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content72% 
IMG OID637881385 
ProductIucA/IucC 
Protein accessionYP_483135 
Protein GI86742735 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4264] Siderophore synthetase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.512402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCG ATCGACCGCA GTCCGAGGCC ATCGCGGACG CGACCATCGC GGACGAGATC 
GTCACGGAGA CTCTGCTGCG GTGCTGGGTG CGGGAGACAG GCGTCCGGTT ACCGGCAGAC
GCCCGGACGC TCGGCATCGA CCTACCCGCA TCCGGCCTGT CCGTTCGGGC GACGTTGACG
CACCACTCCC CGGCCGGATG GCACGGGTTC GGACCACCAC GCCTGGTTTG CGCGGCCCGC
GAACCAGCCC CGGTGGACGC GGGCCTGCTC GCCGCGCTGC TCGTCCGAGA GGTCGTCACG
CGGGATGGGC TACCGCCGGC CCTCGGCGCC CAGGCGCTGG GAAGGATCAT CGATTCCGCC
GGCTGGATCG CGAGCTGCCT CGCGGCCCGC GGTCCCGGGC CGGACCCGGC CACCATCCCC
CCGTTCCTCG CCGCGGAGCA GGCCCTGATC ACCGGTCATC CGTTCCATCC GGCGGCCAAG
AGTCGCCAGG GGGCTTCGCC GGAGGCACTG GCCACCTACT CTCCGGAAGT CGGCGGGTCC
TTTGCGCTGC ACTGGTTCGC AGCCCATCCC TCCGTGGTGA CCGTCTCGGG CAGTACGACG
ACCCGCTCCG GAGGGCGGGA CCAGGCCCTC GGCACGAACG GCCGGGGCCT GATCCGGCTG
CTGGCGGATC TGGCCAGGGA AGACGCTGGG GCCGAGGAGC CGAAGTCCGA CCTCTCGGCC
GCGTCCGGTT CCGGACCAGA CCGACATGGC CCACCCGGGC TCACGGGGTC CCCTCGGACC
GGCGGACGCG ATCGCCCGGA CATTCCCGCC GGTTACGTCG TGGTACCCGC GCATCCCTGG
CAGGCGGACG AGGTCAGCAG GCGGGAACCG GTCGCCGAGC TGCTTGCGGA CGGCCGCCTG
ATCGACCTCG GGGTGTCCGG GCACCGGTGG TATCCCACCT CGTCGCTGCG TACCGTCTGG
CGGCCGGAGA TCCCGGTGAT GCTCAAGCTC TCGCTGGGGA TGCGGATCAC CAACTCGCGC
CGGGTGCTGC AGCTCGACGA GCTGCGGCTC GGCGCGATGA CCGCGAGGCT GATCGACGCC
GGTCTCGGCG CGGCGCTCGC CGCCCGACAT CCGAACTTCC GCCTCATCGG GGAGCCCGGT
TGGGTGGCCG TCACCCGACC GGGAGGGGAG GGTCGGCCCC GAACCGGCCC CGCAACACCG
CTGCGGATCG GGTCCGGCGA GCGGCGCGAC GGAGGGCGCG ACGAGGACGG CGTCGGGCTG
GAGACCGCCG TCCGGGTCAA TCCGTTCGGG TCAGTCGACC GGGTGGTGTG CGCCGCGGCA
CTCATCGCGC CCAGGCCCGA GCGGACCACG GACAGCCGTG GCCGGGCCCG GACGGCGATG
CTTCCCCGGA TCCTCACGAC GCTGGCTGGC TCGCTGAGCG ATCCCCTCCC GACGCTGACC
GACGCCTGGT TCGACCGTTA CCTGACCGTG GTAGCCGCCC CGGTGGTGGA CCTCTACCTG
CATTTCGGGA TCGGCGTCGA GGCCCACCTG CAGAACACCC TGGTCACCCT GGACGCGGCG
GGCTGGCCGA TCGCCGGCTG GTACCGCGAC AGCCAGGGCT ACTACGTGGC CACCTCCGCC
ACGGCGGAGG TGCAGCAGCT GCTGCCCGGC TTCGGTACCG GGCTCAACGC GATCTTCGAT
GACGCGCTGG TGGCCGAGAG GACCATCTAC TACCTGTTCG TGAACAACGT CTTCGCCCTC
GCCGGCGCGC TCGGGGTCGC GGGTGTCGCG GACGAGCGCC GCCTGCTCGG GCGGGTGCGT
GGCCTGCTCC ATCGGCTGGA CGAGGATGGC CCACGCGGGC GGGGGCTGCG ACCACCCGCC
GCTCGCCGAG CACTGATCAC CAGGCTGCTC GACTCCCCCA CCCTGCCATG CAAGGCCAAC
CTGCGCACCT GCGTGGACGG GCGGGACGAA CTGGTCGGAC CGGTGGCCAC TCAGTCGGTG
TACGTGCCGA TCCCCAACCC GCTCCTGGAG GCCTCGTGA
 
Protein sequence
MTGDRPQSEA IADATIADEI VTETLLRCWV RETGVRLPAD ARTLGIDLPA SGLSVRATLT 
HHSPAGWHGF GPPRLVCAAR EPAPVDAGLL AALLVREVVT RDGLPPALGA QALGRIIDSA
GWIASCLAAR GPGPDPATIP PFLAAEQALI TGHPFHPAAK SRQGASPEAL ATYSPEVGGS
FALHWFAAHP SVVTVSGSTT TRSGGRDQAL GTNGRGLIRL LADLAREDAG AEEPKSDLSA
ASGSGPDRHG PPGLTGSPRT GGRDRPDIPA GYVVVPAHPW QADEVSRREP VAELLADGRL
IDLGVSGHRW YPTSSLRTVW RPEIPVMLKL SLGMRITNSR RVLQLDELRL GAMTARLIDA
GLGAALAARH PNFRLIGEPG WVAVTRPGGE GRPRTGPATP LRIGSGERRD GGRDEDGVGL
ETAVRVNPFG SVDRVVCAAA LIAPRPERTT DSRGRARTAM LPRILTTLAG SLSDPLPTLT
DAWFDRYLTV VAAPVVDLYL HFGIGVEAHL QNTLVTLDAA GWPIAGWYRD SQGYYVATSA
TAEVQQLLPG FGTGLNAIFD DALVAERTIY YLFVNNVFAL AGALGVAGVA DERRLLGRVR
GLLHRLDEDG PRGRGLRPPA ARRALITRLL DSPTLPCKAN LRTCVDGRDE LVGPVATQSV
YVPIPNPLLE AS