Gene Francci3_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0741 
Symbol 
ID3905868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp857291 
End bp858619 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content74% 
IMG OID637878074 
ProductF420-0--gamma-glutamyl ligase 
Protein accessionYP_479854 
Protein GI86739454 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG0778] Nitroreductase
[COG1478] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01916] F420-0:gamma-glutamyl ligase
[TIGR03553] F420 biosynthesis protein FbiB, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.918099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGAGC GCAGCGCGGA ACGGGCGCTG CGCGTGTTCC CGCTCACCGG CATCGGCGAG 
GTCCGTCCCG GCGACGATCT CGCCGTCCTG GTGGCCTCGG CCGTCCGGAC GCACGGCCCG
ACACTCGCCG ACGGCGATGT CGTCGCGGTC ACCTCAAAGA TTGTTTCGAA GGCCGAGGGA
CGGCTCGTCA CGGTGTCCGG CGACCGCGAG GAAGCCAGAC AGGCCATGAT CGACAGTGAA
TCGGTACGCG AGGTCGCCCG GCGCGGCCCG ACCCGGATCG TCGAGACGCA CCACGGTTTC
GTCCTCGCCA GCGCGGGGGT CGACGCGTCG AACATCGCCA AGGACTCCCT CGCCCTGCTC
CCGGTGGATC CGGACGCCAG CGCCAGGCGG CTGCGGTCGG GCCTGGCCAC CGTGCTCGGC
GTGGATGTCG CCGTCATCGT CACCGATACC GCCGGACGGC CGTGGCGGCG CGGCCTGACC
GACATGGCCG TCGGGGTGGC CGGCATGGCG GCGCTGCGCA GCCACGTCGG CGACCTCGAC
GGCTACGGCA ACGAGCTGGG GATGACCGAG GTGGCCGAGG CCGACGAGCT CGCCGCGGCC
GCCGACCTCG TCAAGGGCAA GCTGGGCGCC ACACCGGTGG CAGTCGTCCG CGGCTACGGC
CGGCTGCCCG ACGACGGCGC CGGCGGGCGG GCACTGCTGC GTCCCGCTGG TGAGGACATG
TTCCGCCTCG GCACCCTCGA GGCGCGCCGG GCGGCGCTGC GCGACCGGCG CACCGTGCGG
GACTTCTCCG ACGCCCCGGT CGACCCGGCC GCGGTCGACC GGGCGATCGC GGCCGCACTC
ACCGCCCCGG CCCCGCACCA CACCACACCA TGGCGTTTCG TGATCGTGAC CGAGCGGCAC
GCCGCGCTGC TCGACGCGAT GGCCGAAGCC TGGGCGGACG ATCTACGACG CGACGGGTTC
GACGAGGCGG CCGTCGAGCG TCGGCTTCGG CGCGGCGAGG TGCTGCGGCG TGCCCCGCTG
CTGATCGTTC CGATCATGGT CCTCGACGGC GCGCATCCCT ATCCGGACGC CCGCCGCGCC
GCCGCCGAGG AGCGGATGTT CACCGTCTCC GTCGGGGCCG GGGTGCAGAA CCTGCTCGTC
GCCCTGGCCA CGGAGGGTCT GGGGTCGTGC TGGGTGTCGT CGACGCTGTT CTGCCCGGAG
GTGGTCACCC GGGTGCTCGA CCTGCCGGCC GATTGGACAC CGATGGGCGC GGTCGGGGTC
GGGCACGCCG CCGCGCCCGC ACCCGCCCGA CCGGACCGCG ATACCGCGGC GTTCGTCCTC
CACCGCTGA
 
Protein sequence
MSERSAERAL RVFPLTGIGE VRPGDDLAVL VASAVRTHGP TLADGDVVAV TSKIVSKAEG 
RLVTVSGDRE EARQAMIDSE SVREVARRGP TRIVETHHGF VLASAGVDAS NIAKDSLALL
PVDPDASARR LRSGLATVLG VDVAVIVTDT AGRPWRRGLT DMAVGVAGMA ALRSHVGDLD
GYGNELGMTE VAEADELAAA ADLVKGKLGA TPVAVVRGYG RLPDDGAGGR ALLRPAGEDM
FRLGTLEARR AALRDRRTVR DFSDAPVDPA AVDRAIAAAL TAPAPHHTTP WRFVIVTERH
AALLDAMAEA WADDLRRDGF DEAAVERRLR RGEVLRRAPL LIVPIMVLDG AHPYPDARRA
AAEERMFTVS VGAGVQNLLV ALATEGLGSC WVSSTLFCPE VVTRVLDLPA DWTPMGAVGV
GHAAAPAPAR PDRDTAAFVL HR