Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5869 |
Symbol | |
ID | 5674192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7122275 |
End bp | 7123855 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244719 |
Product | F420-0--gamma-glutamyl ligase |
Protein accession | YP_001510121 |
Protein GI | 158317613 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG0778] Nitroreductase [COG1478] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01916] F420-0:gamma-glutamyl ligase [TIGR03553] F420 biosynthesis protein FbiB, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0435784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.795016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGACC TGTCCCCGGT GGGGACATCC ATGACGAAGG ACGGACCGGA ACCCGCGGAG GCGTCGTTCC CGGGCGTCGC CGCACCCGAG GCGGAAGGTC CGCCCCCGGT CGGTGGCGAA GCCCCGGCCG GGGAACCTCC CGCGGCGGAG GACGCGCTGC CGGCCGAGCA GTCCGAGAAG TTCGAGAACT TCGAGAAGTC CGACGAGCAG GTCGGCACGG GCGCGGCGGA GCAGGTCGGC GAGGAGCCGC TGGCGGAGCG GGAGCTGCGG ATTCTGCCGC TGGGCGGGAT CGGCGAGGTG CGGCCGGGCG ACGACCTCGC CGCGCTGATC GCCGTGGCCG CGGCCACGGG CGCCGGGCCC GGAGGCGGCC CCGGGCTGCG TGACGGTGAC GTCCTCGTCG TGACGTCCAA GATCGTGTCG AAGGCCGAGG ACCGGCTGAT CCGGATCGAA GGCGACAGAG AGGCCACGCG CCAGGCCGCC ATCGACGCCG AGTCCGTGCG TGAGGTCGCG CGCCGCGGCC CGACCCGCAT CGTCGAGACG CACCACGGCC TCGTGCTGGC GAGCGCGGGG GTGGACGCGT CCAACATCGC GAAGGACTCG ATCGCGCTGC TGCCGGTCGA CCCCGATACC AGTGCCCGCA CGCTGCGGGA CGGCCTGCGC GACCGGCTCG GCGTCGACGT CGCGGTGATC ATCAGCGACA CGGCCGGGCG GCCCTGGCGG CGCGGGCTGA CCGACATGGC GATCGGCGTC GCCGGGATGG CGGCGCTGCG CAGCCACATC GGCGACGTCG ACTCCTACGG CAACGAGCTC GGGATGACCG AGATCGCCGA GGCCGACGAG CTGGCCGGCG CGGCGGACCT GGTGAAGGGC AAACTCGGCG CCACGCCGGT GGCGATCGTC CGCGGGTTCG CGCGCCGGCC CGACGACGGG AGCGGGTCAC GCGCGCTGCT GCGGCCCTGG GACGAGGACA TGTTCCAGCT CGGCACGGTC GAGGCACGCC GCTCGGCCCC GTTCGCGCGG CGCACGGTGC GGGAGTTCGC CGACCTCCCG GTCGACCCGG CGGCGGTGCA CCAGGCGATC GCCGCGGCGA TCACCGCGCC CGCCCCGCAC CACACCACAC CATGGCGGTT CGTGCTGGTC GAGGCGCGCC GGCACGCCCT GCTCGACGCG ATGGCGATGG CCTGGACGGA CGACCTGCGC CGCGACGGGT TCGACGAGGA CTCGGTGAGC AGGCGGCTGC GCCGCGGCGA CGTGCTGCGC CGGGCGCCGG TGCTGATCGT CCCGATGATG GTGACCGACG GCGCGCATCC CTACCCGGAC GAGCGTCGGG TGCGGGCCGA GGAGCGGATG TTCACCGTGG CGCTCGGCGC CGCCGTGCAG AACCTGCTGG TCGCCCTCGC GGCCGAGGGC CTGGGCTCGT GCTGGGTGTC GTCCGCGCTG TTCTGCGGGG AGGTCGTGAC CGACACGCTG GACCTGCCGC GCACCTGGAC GCCGATGGGC GTCGTGGGTG TGGGGCACCC GGCCGCACCG GCGCACCCGC GGCCGGCCCG TGACATCGCG GCGTTCGTCG TGGAGCGCTG A
|
Protein sequence | MSDLSPVGTS MTKDGPEPAE ASFPGVAAPE AEGPPPVGGE APAGEPPAAE DALPAEQSEK FENFEKSDEQ VGTGAAEQVG EEPLAERELR ILPLGGIGEV RPGDDLAALI AVAAATGAGP GGGPGLRDGD VLVVTSKIVS KAEDRLIRIE GDREATRQAA IDAESVREVA RRGPTRIVET HHGLVLASAG VDASNIAKDS IALLPVDPDT SARTLRDGLR DRLGVDVAVI ISDTAGRPWR RGLTDMAIGV AGMAALRSHI GDVDSYGNEL GMTEIAEADE LAGAADLVKG KLGATPVAIV RGFARRPDDG SGSRALLRPW DEDMFQLGTV EARRSAPFAR RTVREFADLP VDPAAVHQAI AAAITAPAPH HTTPWRFVLV EARRHALLDA MAMAWTDDLR RDGFDEDSVS RRLRRGDVLR RAPVLIVPMM VTDGAHPYPD ERRVRAEERM FTVALGAAVQ NLLVALAAEG LGSCWVSSAL FCGEVVTDTL DLPRTWTPMG VVGVGHPAAP AHPRPARDIA AFVVER
|
| |