Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6919 |
Symbol | |
ID | 5675232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8427155 |
End bp | 8428345 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245768 |
Product | amidohydrolase 2 |
Protein accession | YP_001511159 |
Protein GI | 158318651 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.861609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTCACG CCGGTATCTC GGTGTTCGAC GCGGACAACC ATCTGTATGA GACGAAGGAG GCGCTGACGA AGTACCTGCC CGCCCGGTAC AAGGGGGCGG TCGACTACGT CGAGCTCAAC GGGCGCACGA AGATTATGGT CCGGGGCCAG GTGAGCGAGT ACATCCCGAA CCCCACGTTC GAGGTCGTGG CCCGGCCCGG TGCGCAGGAG GACTACTACC GGAAGGGGAA CCCCGAGGGG CTGTCCCGCC GGGAGATCTT CGGCAAGCCG GTGAAGTGCA TCGACGCGTG GCGCGAGCCC GCCGCCCGGC TTGCGAAAAT GGACGAGCAG GGCCTCGACC GCACACTGAT GTTCCCGACG CTGGCCAGCC TCATCGAGGA GCGGATGCGG GACGACCCGG ACCTGATCCA CGCGGTCATC CACTCCCTCA ACGAGTGGTT GTACGAGACC TGGCAGTTCA ACTACGAGGG GCTGGACCGG ATTTTCACGA CTCCGGTGAT CACCCTGCCG TTCGTGGACA AGGCGATCGA GGAACTGGAG TGGGTCCTCG AGCGGGGCGC CAAGGTCGTG CTGATCCGTC CGGCGCCGGT GCCCGGGCTC CGCGGCCCTC GCTCGTTCGG CCTGCCCGAG TTCGACCCGT TCTGGGCGCG GGTGCAGGAG GCCGGCATCC TGGTCGCGAT GCACTCCTCG GACAGCGGCT ACGCCCGTTA CACGAGTGAG TGGATGGGCG CGACCACCGA GATGCTCCCC TTCCAGCCGA ACACCTTCCG CATGCTGCAG GCCTGGCGCC CGGTCGAGGA CGCCGTCTCG GCGCTGGTGT GCCACGGTGC GCTGTCCCGT TTCCCGGGGC TGAAGATCGC CATCGTCGAG AACGGTATGA GCTGGGTCGA GCCGCTGCTC AAGTCCATGA AGAACCTCTA CAAGAAGATG CCGCACGACT TCCTGGAAAA CCCGGTCGAC GTGCTCAAGC GGAACATCTA CGTGAGCCCG TTCTGGGAGG AGGACCTCGG CGAACTGGCC CAACTCCTCG GCGAGGACCA CGTGCTGTTC GGCTCGGACT ACCCGCACCC GGAGGGCCTG GCCAACCCGG TGAGTTACAT CGACGAGCTG TCGCACCTGC CGGAGGAGCT CGTCCGCAAG ATCATGGGTG GCAACCTCGC CCAGCTCATG GGCATCGGCG TTCCCGCCTA G
|
Protein sequence | MAHAGISVFD ADNHLYETKE ALTKYLPARY KGAVDYVELN GRTKIMVRGQ VSEYIPNPTF EVVARPGAQE DYYRKGNPEG LSRREIFGKP VKCIDAWREP AARLAKMDEQ GLDRTLMFPT LASLIEERMR DDPDLIHAVI HSLNEWLYET WQFNYEGLDR IFTTPVITLP FVDKAIEELE WVLERGAKVV LIRPAPVPGL RGPRSFGLPE FDPFWARVQE AGILVAMHSS DSGYARYTSE WMGATTEMLP FQPNTFRMLQ AWRPVEDAVS ALVCHGALSR FPGLKIAIVE NGMSWVEPLL KSMKNLYKKM PHDFLENPVD VLKRNIYVSP FWEEDLGELA QLLGEDHVLF GSDYPHPEGL ANPVSYIDEL SHLPEELVRK IMGGNLAQLM GIGVPA
|
| |