Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5674 |
Symbol | |
ID | 5674001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6887281 |
End bp | 6888702 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641244528 |
Product | amidohydrolase |
Protein accession | YP_001509931 |
Protein GI | 158317423 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTGGCG CGGAGCTTGT CGCCACGGTC GATGCCGACC GCCGAGAGAT CCCAGGCGGC TGGGTAGCAA TCACCGACGG CCTCGTCAGC TCCCTCGGCG GCCCGGCGGA GACACCCCCG ACCGCGACCC GGACCCTGCG TGCCGACGGC TGTCTCATAA CGCCGGGCCT GGTGAACACA CATCATCACA TCTACCAGAA CCTCACCCGT TCCTTCGCTC CGGCACTCGG CGGCACCCTT TTCACCTGGC TGACCACGCT CTATCCACTC TGGTCACGGC TGGACGAGGA GGCCGTCCAC ACCTCGGCCT ACGTGGGCCT GACCGAACTG GCGCTCGGCG GCTGCACGAC ATCAACAGAC CACCTCTACG TGCACCCGCG CGGTGGCGGC GATCTCATCT CCGCCGAGAT CGCGGCAGCC CGGACCCTGG GCATGCGGTT CCACCCGACC CGCGGCTCGA TGTCGCTTTC GGTCAAGGAC GGCGGGCTCC CTCCTGACTC TGTCGTCCAG GACGCGGATG AGATCCTCGC CGACTCCGCC CGGCTTGTGG CCCAGCATCA CGACCCGTCC CACGGCGCGA TGGTGCGGAT CGCCCTGGCC CCCTGTTCGC CGTTCTCCGT CAGTCCGGAA CTCATGCGGG CCACCGCGGA ACTGGCCGAG TCCTTGGACG TCCGGCTACA CACGCATCTC GCCGAGGACC CCGAGGAGGA CGACTACTGC CTGGCGGTGT TCGGACGGCG TCCGATCGAC CAGTTCGCGG AGGTTGGCTG GGGCGGCGAC CGGGCGTGGG TCGCGCACTG CATCTGCCCC AATGACGAGG AGGTCGAGCA GCTCGGCAGG TGGGGCACAG GGGTGGCCCA CTGCCCGAGC AGCAACATGA TTCTCGGCGG CGGCCTCGCG CCCGTGGCCG AGCTCCGCTC GGCCGGCGCC CCGGTCGGCC TGGGCTGCGA CGGGTCGTCA TCCGCCGACT CCGCCTCGCT GTGGTTGGAG GCTCGTACCG CCATGCTGTT GGGCCGGCTG CGACACGGCG CCGCGGCGAT GTCCGCCCGG GACGCGCTGG AGATCGCCAC CCGGGGCGGC GCAGGCTGTC TCGGCAGGAC CGGTGAGATC GGTGAGCTCT CCGTCGGGTC TGTCGGCGAC CTCGTCGTCT GGCCGTTGGA CGGGGTCGCG TACGCGGGAG CGCTCTCCGA TCCGATCGAC GCCTGGCTGC GTTGCGGGCC CACAGCGGCC CGGCACACGA TCGTGGCCGG CAGGCTGGTG GTGGAGAACG GAGTGCCGGT CCATCCTGAT CTCGACGAGA TGCTCGTCCG GCACCGCCGG ACCGCCGGCG GCATCCAGGC GGCGTTCGAC GATGCGGGCA TCGATCCGAC CGTTCCCATC AATACCGGCG GCAGCAGCGT CGGGGCGGCA AAATCACTTT GA
|
Protein sequence | MIGAELVATV DADRREIPGG WVAITDGLVS SLGGPAETPP TATRTLRADG CLITPGLVNT HHHIYQNLTR SFAPALGGTL FTWLTTLYPL WSRLDEEAVH TSAYVGLTEL ALGGCTTSTD HLYVHPRGGG DLISAEIAAA RTLGMRFHPT RGSMSLSVKD GGLPPDSVVQ DADEILADSA RLVAQHHDPS HGAMVRIALA PCSPFSVSPE LMRATAELAE SLDVRLHTHL AEDPEEDDYC LAVFGRRPID QFAEVGWGGD RAWVAHCICP NDEEVEQLGR WGTGVAHCPS SNMILGGGLA PVAELRSAGA PVGLGCDGSS SADSASLWLE ARTAMLLGRL RHGAAAMSAR DALEIATRGG AGCLGRTGEI GELSVGSVGD LVVWPLDGVA YAGALSDPID AWLRCGPTAA RHTIVAGRLV VENGVPVHPD LDEMLVRHRR TAGGIQAAFD DAGIDPTVPI NTGGSSVGAA KSL
|
| |