Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4506 |
Symbol | |
ID | 5672855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5376873 |
End bp | 5378099 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641243371 |
Product | arginine biosynthesis bifunctional protein ArgJ |
Protein accession | YP_001508787 |
Protein GI | 158316279 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) |
TIGRFAM ID | [TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.49831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGAGA TCTCACGGTT CGCCCCGGAT GCGTTTCCGG ATATCAGCCC CGTCGCTGGC GTCACCAGTG CCACGGCATC GTGCGGCCTG AAGTCCGGCG ATGGCCCGGA CCTACTCCTG ATGGCAGTAC GGCCGGGAAC GGTGGTCGCC GGCGTGTTCA CGCAGTCCAC CGTCACATCT CCCGCCGTGC AGCGCTGCCA GCGCAGCATG CGTGAGGGGC GCGCACGGGC GCTGGTGGTG CACGCGGGCA ACGCGAACGC CCTCACCGGC GCCCAGGGCA GGCTGCTCGT GGAACGCATC TGTGAAAGCG TCGCCGTGAG CCTGGACTGC CCAGCCGGGG AAATTGTTAC CGCTGGCACG GGTATCATCG GGCAGCGCAT CAGCGATGAG CAGCTCCTCG AGCCACTGCC CTCTCTTGTG GACGCCCAAG ACGCCGTCGA CTGGCGTCAG GCCGCGGAGG CTATCCGCAC GACCGACACC TTTCCCAAGG CCAGTTCTCG CAGCTTCATG ATAGGCGACC AGCCTATCGT CGTCAGTGGC ATCGCTAAGG GATCCGGGAT GATCGCACCC GATATGGCCA CAATGCTCTG CTTCATGTTC ACCAACGCGA CAGTGGATCC CGCCGAGCTT CACCGCCTGC TGATCTACGC GAACTCGGGC AGTTTCCGGC GCATCACGGT CGATGGGGAT GAGTCGACGA GCGATACGAT GCTTGCCTTC GCGACCAACA ACACCGACCT CGCAGCCGAC GGTGACCGCG ATGCGGCCGT CGTCCAGCTC CGCGATGCCT TCGCCGAGGT CGCCCATGAC CTGGCCATGC AGGTCGTAGG CGACGGCGAA GGGATCTCGA AACTCATCAC GGTGACCGTG CGGGGCGCGC GGACCGACGA CGACGCCCAC GCAATGGCGA AGTCCATCGC GGAGTCCCCA CTGGTGAAGA CCGCGATCGG CGGTGGCCAT CCGAACTGGG GACGTATCGC GATGGCGATC GGGAAATCCC GCCGCCCCGT CCAGCAGGAG CGTCTCACCG TGTGGCTCGG CAAAAACAAG GTCATGATCA ACGGAGGTCA GAACGACGGT CTGGACCAGA CTACACTAGC CGCCTACATG AAATCAGACT CCATCGACAT CACCGTCGAC GTCGGGATGG GCCACGAAAA TAGCACCATG TGGACCTGCG ATCTCACGAA GGAATACATC GACATCAATG CGCACTACAT GACTTAA
|
Protein sequence | MLEISRFAPD AFPDISPVAG VTSATASCGL KSGDGPDLLL MAVRPGTVVA GVFTQSTVTS PAVQRCQRSM REGRARALVV HAGNANALTG AQGRLLVERI CESVAVSLDC PAGEIVTAGT GIIGQRISDE QLLEPLPSLV DAQDAVDWRQ AAEAIRTTDT FPKASSRSFM IGDQPIVVSG IAKGSGMIAP DMATMLCFMF TNATVDPAEL HRLLIYANSG SFRRITVDGD ESTSDTMLAF ATNNTDLAAD GDRDAAVVQL RDAFAEVAHD LAMQVVGDGE GISKLITVTV RGARTDDDAH AMAKSIAESP LVKTAIGGGH PNWGRIAMAI GKSRRPVQQE RLTVWLGKNK VMINGGQNDG LDQTTLAAYM KSDSIDITVD VGMGHENSTM WTCDLTKEYI DINAHYMT
|
| |