Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5671 |
Symbol | |
ID | 5673998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6884321 |
End bp | 6885430 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244525 |
Product | putative agmatinase |
Protein accession | YP_001509928 |
Protein GI | 158317420 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01227] formimidoylglutamase [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCATGG CACAGGCCGG CGATCGGGAC GTGGCACGCG AGCGGGGGAG CGCCGGTCCG GTGCCGCCGT ACGCGGCGCG GGCCGCGGGA GCGGACGATC CCGGCGCGGA CGCCCCGGGA TACTCCGGCC TGGCGACCTT CGCCGGTCTG CCCTGGATGC CGGGACTCGG CGATCTTCGG GCGCGCCGGC CCGATGTCGC TGTGGTCGGG GCTCCGTTCG ACATCGCGAC CACGCACCGG CCGGGCGCGC GTTTCGGCCC GCGGGCGCTG CGCGCTCAGG CGTACAACCC TGGCACATAT CATCTTGATC TTGGTATAGA GATCTTTGAC TGGCTGGACG TCGTTGACGC GGGGGACGCG CACTGCCCGC ACGGGCTGAC CGAGGTCTCA CATCGCAACA TCCGGGCGAA GGTCGGCGAC GTCGCGCGGC TCGGCGTCAT CCCAGTGATC ATCGGGGGGG ACCACTCGAT CACCTGGCCG GCGGCTAGCG GGGTCGCCGA GGCGGTGGGC TGGGGTGAGG TCGGCCTGCT GCACTTCGAC GCCCACGCCG ACACGGCCGA CATCATCGAC GGGAACCTGG CCTCGCACGG GACGCCTATG CGGCGGCTCA TCGAGTCGGG GGCGGTGCGC GGGCGCAACT TCGTCCAGGT GGGGCTGCGC GGGTACTGGC CTCCGCCGGA CGTGTTCGCG TGGATGCGCG AGCAGGGTAT GCGCTGGCAC CTGATGCACG AGATCTGGGA GCGGGGGAGC CGAGAGGTGG TCGCCGAGGC GATCGCGCAG GCGGTGGACG GCTGCCGCGC GCTCTACCTG TCGGTCGACA TCGACGTCCT CGACCCGGGG TTCGCGCCTG GGACGGGCAC CCCCGAGCCG GGCGGCATGA ACCCGGCCGA CCTGTTGCGG GCCGTGCGGC AGATCGCGCT GGACACGCCG ATCGTCGCCG CGGACATCGT CGAGGTCTCG CCTCCGTACG ACCACGCGGA GACGACGGTG AACAGCGCGC ACCGGGTCGC GATGGAGATT TTCGCGGCGT TGGCGCATCG CCGCCGTAGC GCGGCCGGTG GGACGGCGGA CCTTCCCGCG GGGCTCCCGA AGGCGAAAGC TGGGTCTTGA
|
Protein sequence | MCMAQAGDRD VARERGSAGP VPPYAARAAG ADDPGADAPG YSGLATFAGL PWMPGLGDLR ARRPDVAVVG APFDIATTHR PGARFGPRAL RAQAYNPGTY HLDLGIEIFD WLDVVDAGDA HCPHGLTEVS HRNIRAKVGD VARLGVIPVI IGGDHSITWP AASGVAEAVG WGEVGLLHFD AHADTADIID GNLASHGTPM RRLIESGAVR GRNFVQVGLR GYWPPPDVFA WMREQGMRWH LMHEIWERGS REVVAEAIAQ AVDGCRALYL SVDIDVLDPG FAPGTGTPEP GGMNPADLLR AVRQIALDTP IVAADIVEVS PPYDHAETTV NSAHRVAMEI FAALAHRRRS AAGGTADLPA GLPKAKAGS
|
| |