Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6289 |
Symbol | |
ID | 5674608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7638331 |
End bp | 7639308 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245141 |
Product | hypothetical protein |
Protein accession | YP_001510537 |
Protein GI | 158318029 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGC CGCTCTCGCC GCGGCGGGTG TTGGTAACCG GGGTCGGTGC GGCGCCGGGC CTTCAGACCG CCCGCGCCCT GCTGAAGCGG GGCTGCGAGG TGGTCGCGGT GGACGCCGAT CCGCTCGCGT TCGGCCTACG GCTGCCCGGC GTGCTCGCAC GGACGATGCC GCCCGTTGCC GATCCCGGCT ACCACGACGC ACTGGTGGCG CTGTGCGACC AGGACAGGCC CGACGCTCTG ATCGGGGGTG TCGAACAGGA GATCCCTACG CTGATCGACC TGGCCGAGGA GCTGGCAAGG CTCGGAGTGG CGACCTGGCT GCCGCCCTTG CCCGCCACGC GGGCGTGCCT GAACAAGGCC CATTTCCACG AGGTGATGAC GGCCGCGGGC CTCCCGGTTC CGGCGACCTG GCTGCCCGAC CGGCTCGCCG AGATCCCGTC CGGTCTGCCG CTGCTGGTCA AGCCGCGCGG CGGCCAGGGT GGGCAGGGCG TGATCCGCTG CTCTACCGCC GCTCAGGCGC GTGTGCTGTG CGAGCTGGTG ACCGGGCCGA TCGTCCAGGA ACGCCTGGCC GGCTGGGAGT TCACCGCGGA CTGCCTCACC GACCCTGCCG GCCGGTCCTC GGTGATCCTG CGGCACCGCC AGATCGTCAA GGGCGGGCTC GCGGTCGTCG CCACCACCTT CCACCACCCG GCCGCCACCG ACCTCGTCAC CCGCGCGCTG GCGGCGCTCG AGATGGAGGG CGTGTGCTGT GTGCAGGGGT TCATCGACGA CGGCGGCCGG GTGGTGCTGA CTGAGGCGAA CGCTCGCCTG GCCGGGGCGT TCCCCGTCAG CGAGGCCGCC GGAGCCGACC TGCTCGGCCA GTACCTCGCG GCCCTCGCAG GCCGCCCTGT CGACCACGGC CGGCTCACCT ACAAGGCCGG GGTCCGGCTC ACCACCGCCC CGGCCACCCT CGCGATCGAG GAGACCGACA CCCCATGA
|
Protein sequence | MSEPLSPRRV LVTGVGAAPG LQTARALLKR GCEVVAVDAD PLAFGLRLPG VLARTMPPVA DPGYHDALVA LCDQDRPDAL IGGVEQEIPT LIDLAEELAR LGVATWLPPL PATRACLNKA HFHEVMTAAG LPVPATWLPD RLAEIPSGLP LLVKPRGGQG GQGVIRCSTA AQARVLCELV TGPIVQERLA GWEFTADCLT DPAGRSSVIL RHRQIVKGGL AVVATTFHHP AATDLVTRAL AALEMEGVCC VQGFIDDGGR VVLTEANARL AGAFPVSEAA GADLLGQYLA ALAGRPVDHG RLTYKAGVRL TTAPATLAIE ETDTP
|
| |