Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1458 |
Symbol | engA |
ID | 3903190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1748701 |
End bp | 1750164 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878795 |
Product | GTP-binding protein EngA |
Protein accession | YP_480564 |
Protein GI | 86740164 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.961762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00757148 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGGAC AAGACCTATC AACGGACCTC GGCGCGGCCG GGTTCGACCG GTTTGCCGAC GGTGGCCTCG GCGATGACGT CGGCGGCCCC GATGGGTTCG CGGGCGAGCT CGACGGCGAG TTCGCGGGCC CCGGGATCGG TCAACCGGTG GTGGCCGTCG TGGGGCGACC CAACGTGGGC AAGTCGACGT TGGTCAACCG CATCCTCGGC CGGCGGGCCG CGGTCGTCGA GGACGTCCCC GGGGTGACCC GTGACCGCGT CGCCTACGAC GCGGTCTGGA GCGGGCGGCG GTTCACCCTC GTCGACACCG GCGGCTGGGA ACCCGACGCG ACGGGGCTGG CCGCCCAGGT CTCCGAGCAG GCCCGCGCGG CGCTCGACAC CGCCGATGCG GTGCTGTTCA TCGTGGACGT CGTCACCGGT GCCACCGACG CCGACGAGGC GGTCGCCCGG GTGCTGCACC GCTGCGGGCT GCCGGTGATC CTGGTGGCGA ACAAGGTCGA CGACGTACGA TTCGAGGCTG ATGCGGCGGC GCTGTGGAGC CTCGGGCTCG GCGAGCCGCA CCCGGTCTCG GCCCTGCACG GCCGGGGCAG TGGAGACCTG CTCGACGCCG TGCTCGCCGC GTTGCCCGAG GCGCCGCGCG AGATCCTCAC CGAGTCCGAC GGCCCACGGC GGGTCGCCCT GATCGGCCGG CCCAACGTCG GCAAGTCGAG TCTGCTGAAC AAGCTCGCCG GCAGCCGGCG GTCGTTGGTG CACGACGTCG CCGGCACCAC CCGGGACCCG GTCGACGAGC TGGTGACGGT CGGCGGCGAG GAGTGGATGT TCATCGACAC CGCCGGGCTG CGCCGTCGCG TCCGGGAGGC CTCCGGCGCC GAGTACTACT CGTCGCTGCG CACCGCCTCG GCCCTGGAAG CCGCCGAGGT CGCCATCGTC CTGCTCGCCG CGGACGAGCC GGTGACCGAG CAGGACCAGC GGGTCATCTC GATGGTCGTC GACGCGGGAC GGGCCCTCGT CCTCGCCTTC AACAAGTGGG ACCTGGTGGA CACCGACCGC CGGCTCACCC TGGAACGGGA GATCGTCCGT GACCTCGGCC GGGTCGCGTG GGCGCCGCGG GTGAACGTCT CGGCCCGTAC CGGGCGTGCG ACCGACCGGC TCGCCCCCGC GCTGCGCACC GCGCTGGAGT CCTGGGGTAC CCGGATCCCG ACCGGCCGGC TGAACACCTG GTTGGGTGAG GTCGTCGCCG CGACCCCGCC GCCAGCACGT GGAGGGCGGG TTCCGAAGGT CCTGTTCGCG ACCCAGGCCG GAGTGCGCCC GCCGCGGTTC GTGGTGTTCA CGACGGGTTT TCTGGAGTCG TCCTACCGTC GATTCCTGGA ACGCCGGCTC CGGGAGGACT TCGGCTTCGC CGGTTCGCCC ATCTCGATCT CGGTGCGGGT GCGTGAACGC GGCGACCGCA GGGGCGGCGG CTGA
|
Protein sequence | MSGQDLSTDL GAAGFDRFAD GGLGDDVGGP DGFAGELDGE FAGPGIGQPV VAVVGRPNVG KSTLVNRILG RRAAVVEDVP GVTRDRVAYD AVWSGRRFTL VDTGGWEPDA TGLAAQVSEQ ARAALDTADA VLFIVDVVTG ATDADEAVAR VLHRCGLPVI LVANKVDDVR FEADAAALWS LGLGEPHPVS ALHGRGSGDL LDAVLAALPE APREILTESD GPRRVALIGR PNVGKSSLLN KLAGSRRSLV HDVAGTTRDP VDELVTVGGE EWMFIDTAGL RRRVREASGA EYYSSLRTAS ALEAAEVAIV LLAADEPVTE QDQRVISMVV DAGRALVLAF NKWDLVDTDR RLTLEREIVR DLGRVAWAPR VNVSARTGRA TDRLAPALRT ALESWGTRIP TGRLNTWLGE VVAATPPPAR GGRVPKVLFA TQAGVRPPRF VVFTTGFLES SYRRFLERRL REDFGFAGSP ISISVRVRER GDRRGGG
|
| |