Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4742 |
Symbol | |
ID | 5673084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5663504 |
End bp | 5664694 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243599 |
Product | amidohydrolase 2 |
Protein accession | YP_001509015 |
Protein GI | 158316507 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.65443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCA ACGACATCCC GGTGTTCGAC GCCGACAACC ACCTGTACGA GACCAAGGAC GCGCTGACGA AGTTCCTCCC GGCCCGCTAC AAGGGCGCCG TCGACTACGT CGACGTCCAC GGCCGGACGA AGATCGTGGT CCGGGGGCAG ATCAGCGACT ACATCCCGAA TCCGACCTTC GACGTGGTCG CCCGCCCGGG CGCGCAGGAG GACTACTACC GGCACGGCAA CCCCGAGGGC AAGTCCTACC GGGAGATCTT CGGCGCGCCG GTGCGCTGCA TCGACGCCTG GCGGGAGCCG GCCGCCCGGC TGAAGCTCAT GGACGAGCAG GGCCTGGACC GCACCCTGAT GTTCCCGACG CTGGCCAGCC TCATCGAGGA GCGGATGCGG GACGACCCGG ACCTGGTCCA TGCCGTCATC CACTCGCTCA ACGAGTGGCT GTACGAGACC TGGCAGTTCA ACTACGAGGG CCTTGACCGC ATCTTCACCA CCCCGGTGAT CTCCCTGCCG ATCGTGGAGA AGGCGATCGA GGAACTGGAG TGGGTGCTGG AGCGGGGTGC CCGGGTGATC CTCATCCGTC CGGCACCGGT ACCCGGCCTG CGCGGCGCGC GCTCGTTCGG GCTGCCGGAG TTCGACCCGT TCTGGGAGCG GGTGCAGGAG GCCGACATCC TGGTCACGCT GCACTCGTCG GACAGCGGCT ACGACCGGTA CTACAGCGAC TGGACGGGCA CCAGCTCCGA GATGCTGCCG TTCAAGCCCA GCGCGTTCCG GATGCTGCAG GCCTGGCGCC CGGTGGAGGA CGCGGTGGCG GCGCTGGTCT GCCACGGCGC GCTGTCCCGG TTCCCCCGGC TGAAGATCGC CGTGGTCGAG AACGGCAGCA GCTGGGTCGC ACCGCTGCTG CTGTCCATCA GGGACCTCTA CAAGAAGCTG CCGCAGGACT TCGCGGAGGA CCCGATCGAG GTCATCAAGC GGAACATCAA CATCAGCCCG TTCTGGGAGG AGGACCTGGG CGCGCTCGCC GAGCTCATCG GGGAGGACCG GGTGCTGTTC GGCTCCGACT ACCCGCATCC CGAGGGGCTT GCGAGCCCGC TGACCTACCT CGACGAGCTC AAGCACCTGC CCGAGGCGAC CACCCGCAAG ATCATGGGCG GCAACCTCGC TCGGCTGATG AACATCTCGG TCCCGGCCTA G
|
Protein sequence | MPTNDIPVFD ADNHLYETKD ALTKFLPARY KGAVDYVDVH GRTKIVVRGQ ISDYIPNPTF DVVARPGAQE DYYRHGNPEG KSYREIFGAP VRCIDAWREP AARLKLMDEQ GLDRTLMFPT LASLIEERMR DDPDLVHAVI HSLNEWLYET WQFNYEGLDR IFTTPVISLP IVEKAIEELE WVLERGARVI LIRPAPVPGL RGARSFGLPE FDPFWERVQE ADILVTLHSS DSGYDRYYSD WTGTSSEMLP FKPSAFRMLQ AWRPVEDAVA ALVCHGALSR FPRLKIAVVE NGSSWVAPLL LSIRDLYKKL PQDFAEDPIE VIKRNINISP FWEEDLGALA ELIGEDRVLF GSDYPHPEGL ASPLTYLDEL KHLPEATTRK IMGGNLARLM NISVPA
|
| |