Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4258 |
Symbol | |
ID | 5672613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5080178 |
End bp | 5081509 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243131 |
Product | HNH endonuclease |
Protein accession | YP_001508548 |
Protein GI | 158316040 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.137511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.228597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACCG CGCGGAGTGT GCAGGTCGAC GGGGACGGTT CCAGTCCGGT TGACGTGCCG GAGCAGGAGA GAACGTTCGA GGGGCGGGTG CGTGGGCTGC TGGTGCGGAT CGGTGCGGCG GTCCGGTCGA TAGCGGCGGG GAACGCGGAC CTGCTGGGGC TGTTGGCGCA GTTCGCCGAC CTGCGGCCCC CGGCGGCCGG CCGGGAGGTT CTGTCCGACG AGTTCGCTCC GGAGGATGTC GCAGCGGTGC GGGGGGTGTC CCCGCAGGCC GCGGCGAGTC AGATGCTGTT CGCGTGCACG GTGGCGCGCC GGCTGCCCGC CGCGGTGGAG GCGTTGAAGG CCGGGGTGCT GGACGTGCAG CGGCCGCGTT CGTTGGAGAA CGCGGTCCGT CCGCTCGACG GTCCGCTCGC GGCGCAGGTC GAGGCCCGTG TGCTGGCCGG GGGTGCGCGG CCGACCCGGG GGGCGTTCAC GGATGCGTGC CGTCGTGCCG TGCACACGGT GGACCCGGCT GGGGCGGCCG AACGCGCGCG GGCCCGGAAG AAGGAACGGC GGGTGTGGGT CTCGCCGGGG GAGGACGGGA CGAGTTGCCT GTCGGCGGTG CTGCCCGCCG ACGAGGCGAC CGCCTGCTAC CAGCGGGTCG ACCAGATCGC CCAGGGAATC GCCGCGCACC GAGGCGGCGG GGACACCCGC AGCCGTGACC AGATCCGCGC GGACGTCCTC GTCGATCTCC TGTGTGGGCG GGTCGCGCAT GCGGTGCCGC TGCCGTGTGA GGTGCAGGTC GTGGTGCCGG TGACGGTGCT GCTGGGGTTG GCGGAGGATC CCGGGGAGAT TCCCGGGTAC GGGCCGGTTC CCGCCGCGGT GGCCCGGGAG ATGGCCGCAC GGCCGGGGTC GACATGGCGG CGGATCCTCG CCGACCCTCA GGGCACGCTC GTCGAGATCG CGGACCGGCG CCTACCGACC GCGGCCCAGG CCCGGCACGT GCGGGCACGG AACCGTAGCT GTGTCTTCCC GGGCTGCGCC CGTACGTCGC GACGCGCGGA CATCGACCAC ACGGTGGCAC ACGTGAGCGG CGGGCCGACG CTCACCCGGA ACCTCGGGCC GATATGCCGC AAGCACCACC GCATGAAGCA CTCCGGCCGT TGGCGGCTGA CACAACCGCG GGAAGGAACG TTCGTCTGGA CGGGTCCGTT CGGGGCGACG CTCGTCACCC ACCCACATTC ATACATCGAA CCGCAGAACA AGGCCGGTAC GACGGGAGGG GGTGGTGATG AACCGTCGGG CAACACCGCC TCGGGGTGGA AAATACCCCA CGACACACAG CCACCCTTCT AA
|
Protein sequence | MNTARSVQVD GDGSSPVDVP EQERTFEGRV RGLLVRIGAA VRSIAAGNAD LLGLLAQFAD LRPPAAGREV LSDEFAPEDV AAVRGVSPQA AASQMLFACT VARRLPAAVE ALKAGVLDVQ RPRSLENAVR PLDGPLAAQV EARVLAGGAR PTRGAFTDAC RRAVHTVDPA GAAERARARK KERRVWVSPG EDGTSCLSAV LPADEATACY QRVDQIAQGI AAHRGGGDTR SRDQIRADVL VDLLCGRVAH AVPLPCEVQV VVPVTVLLGL AEDPGEIPGY GPVPAAVARE MAARPGSTWR RILADPQGTL VEIADRRLPT AAQARHVRAR NRSCVFPGCA RTSRRADIDH TVAHVSGGPT LTRNLGPICR KHHRMKHSGR WRLTQPREGT FVWTGPFGAT LVTHPHSYIE PQNKAGTTGG GGDEPSGNTA SGWKIPHDTQ PPF
|
| |