Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4532 |
Symbol | |
ID | 5672881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5407738 |
End bp | 5408919 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243397 |
Product | hypothetical protein |
Protein accession | YP_001508813 |
Protein GI | 158316305 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2030] Acyl dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.177036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CACACGAGGA AAAGACCGAC GGCATCGCGA GAGAACTGGC GCTGGGCAAA TTCACCGACG AGTTGCTGGA GGAAATGCGG GCGCTGATCG GGACGCAGCT GCGCACCGAC GCCTGCGTCA ACAACGAGTA CGCGACCCGC CTCGCCATCC TGCGCTTCGC CGAGGGGATA GGTGACGACA ACCCGCTGTG GACCGACGCC GACTATGCGG CGGGGACGCG GCACGGCGGC ATCGTCGCTC CCCCCAGCTT CGTCTTCGCC TGCCTCGGCT CCGTCCAGGT GGGGTGGCGC GGCCTGGGCG GGTTCCATGC CGAGACGACC ATGACCTTCG AACGGCCGAT CCGACTCGAC GACAAGATCA CAGCCACCGT GGTGTTCGAC GGGTTCGACG GGCCGACCGA CAGCAACTTC GGCGGCCGCC GCATCAAGGA CTACCTGCGC CAGGAGTACC GCAACCAGCA CGGCGAGCTG GTCGCGACCT TCATCTGCTC CCGGATGCGC TTCGAGCGCG GCGAGATGCA GAAGCGGCGC GACTCCCGCA AGATCGAGCT GCCCCACCCG TGGACGGACA GCGAGCTCAC CGCGATCGAG GCGGACGTCC TCGCCGAACG CCCCCGCGGC GCGGAGCCGC GCTACTGGGA CGACGTTGCC GTCGGTGACG AGATCGACGT CATCACCAAG GGCCCGATCG GGCTGACCGA CTTCATCGCC TACATCGCCG CGGGTGCCGC TCCCATTCCC CGGCTGTCCG CGCACGGGGT CGCGCTGCGG CGCTACCACA AGCACCCGAA ATGGGCGTTC CGCGATCCGA ACACACACGC CCTCGAACCG GTCTACTCGG TGCACTACAA CGACTACGCG GCCCGCCTGC AGGGCGCCCA GATCGCCTAC GACGTCGGCA TCCAGCGCAC CTGCTGGCAG ATCCACTCCC TGACCAGCTG GATGGGCGAC GACGGCACGC TCAAGGCCCT GCACGGCCAG TACCGCAGCC ACGTGTACCT CTCGGACGTG GTGCGACTGG GTGGCCGCGT GGTGGCCAAG GAGATCGACG CCGACGGCGA CCATGTCGTG CGCGTGGAGA CCTGGGCGAC CAACCAACGG GACACGAACG TCATGCCCGG ATCCGCCGTC ATCGCGCTCC CGAGCCGGAA GGAGGCACCG GCCGACCGGT GA
|
Protein sequence | MTTTHEEKTD GIARELALGK FTDELLEEMR ALIGTQLRTD ACVNNEYATR LAILRFAEGI GDDNPLWTDA DYAAGTRHGG IVAPPSFVFA CLGSVQVGWR GLGGFHAETT MTFERPIRLD DKITATVVFD GFDGPTDSNF GGRRIKDYLR QEYRNQHGEL VATFICSRMR FERGEMQKRR DSRKIELPHP WTDSELTAIE ADVLAERPRG AEPRYWDDVA VGDEIDVITK GPIGLTDFIA YIAAGAAPIP RLSAHGVALR RYHKHPKWAF RDPNTHALEP VYSVHYNDYA ARLQGAQIAY DVGIQRTCWQ IHSLTSWMGD DGTLKALHGQ YRSHVYLSDV VRLGGRVVAK EIDADGDHVV RVETWATNQR DTNVMPGSAV IALPSRKEAP ADR
|
| |