Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2127 |
Symbol | |
ID | 5670527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2553997 |
End bp | 2555337 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241048 |
Product | deoxyguanosinetriphosphate triphosphohydrolase-like protein |
Protein accession | YP_001506469 |
Protein GI | 158313961 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCCCAA AGTATCGCCG TTGCGATCGA CGTAGGTACG TGAATCGCGC CGATACGCTC GGCTATGTGG TCAGTGAGTG GTATGACCCT GCCGACCTGG CGCGACGGGT CACAGAGCAG GACAAGGCTA CACCGGGCGA ACGCACCCCG TTCGAGCGGG ATCGTGCCCG GGTGCTCCAT TCGAGCGCCT TGCGGCGACT TGCCGGAAAA ACCCAGGTCG TGGGCCCGCT CGACGACGAC TTCCCGCGCA CCCGGCTGAC CCACTCGCTC GAGACGGCGC AGATCGGGCG CGGTCTGGCC CGCTCGCTGG GCGCCGATCC CGACCTGGTG GACGCCGCCT GCCTCGCCCA CGACATCGGG CATCCCCCGT TCGGTCACAA CGGAGAGGTG GCGCTCGACC AGGCGGCGCA CACCTGCGGC GGGTTCGAGG GCAACGCCCA GAGCCTGCGC GAGCTGACCC GGCTCGAGGT GAAGATCGTC GCGCCCGCCG GGGAGACCGG CGGCGGCGCG GCCGGCGGCG GCGCGGGGCT GAACCTGACC CGGGCGACCC TGGACGCCGC CGTGAAGTAC CCGTGGCTGC GCCGCGCCGG CACGCCGAAG TTCGGGGCCT ACGCCGACGA CGCCGGCATC CTCTCCTGGG TGCGCCGCGA CGCCCCCGGC GCGCGGCGCA GCTTCGAGGC CCAGCTCATG GACTGGGCGG ACGACGTGGC CTACTCCGTG CACGACCTGG AGGACGGCGT CGTCGCCGGG CACATCGACC TCGCCGCGCT GCGTGACCCC GAGCTGCGTG CCGAGCTGGC CGCCCGGACG GCGGCCTGGT ACCCGGACGT CGACGCGCCC GCCGCGGCGG CCGGGCTCGA CCGGCTGCGC GCCCAGCCGT GGTGGATCCG GGAGGAGGTC GGGTCCGTCG CCGGCCTGGC CGCGCTGCGC GCGATGACCA GCGAACTCGT CGCCCGGTTC TCGATGGCCG CGGTGCGGGC CACCCGCGAG CGGCACGGAG ATGAGCCGCT GCGCCGCTAC CGGGCCGACC TGGTCGTGCC GGTGGAGACG CTCGCCGAGT GCGCGGCGCT CAAGGGCGTC ACCGCGTGGT ACGTGATGGG CCGGCCCGGC GCCGCCGAGC GGCGGGCCCG GCAGCGCGAG CTGATCGCCG AGCTGGTCGA CCTGCTCGCG GCCAGCGCCC CGGCGTCGTT GGACGCGCCG CTGGCCGACT CCTACCGGCA CGCGGCCGAC GACGCGGCCC GGCTGCGGGT GGTCATCGAC CAGGTCGCCC GGCTCACCGA CGCCAGCGCC GGGCGCCGGC ACGCGGCACT GACGGGCCGC CCGCTGCCGA GGGCGCTGTG A
|
Protein sequence | MRPKYRRCDR RRYVNRADTL GYVVSEWYDP ADLARRVTEQ DKATPGERTP FERDRARVLH SSALRRLAGK TQVVGPLDDD FPRTRLTHSL ETAQIGRGLA RSLGADPDLV DAACLAHDIG HPPFGHNGEV ALDQAAHTCG GFEGNAQSLR ELTRLEVKIV APAGETGGGA AGGGAGLNLT RATLDAAVKY PWLRRAGTPK FGAYADDAGI LSWVRRDAPG ARRSFEAQLM DWADDVAYSV HDLEDGVVAG HIDLAALRDP ELRAELAART AAWYPDVDAP AAAAGLDRLR AQPWWIREEV GSVAGLAALR AMTSELVARF SMAAVRATRE RHGDEPLRRY RADLVVPVET LAECAALKGV TAWYVMGRPG AAERRARQRE LIAELVDLLA ASAPASLDAP LADSYRHAAD DAARLRVVID QVARLTDASA GRRHAALTGR PLPRAL
|
| |