Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3401 |
Symbol | |
ID | 5671772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4032563 |
End bp | 4033651 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242289 |
Product | hydrogenase expression/formation protein HypD |
Protein accession | YP_001507709 |
Protein GI | 158315201 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0409] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00075] hydrogenase expression/formation protein HypD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATACC TCGACGAGTA CCGCGACCCC GCCCTCGCGC GGCAGCTCCT CGACGAGATC CACACGACGG CCACCCGGCC CTGGACCCTG ATGGAGGTCT GCGGCGGCCA GACGCACACC ATCGTCCGCC AGGGCATCGA CAACCTGCTG CCGGCGGGCC TGCGGATGAT CCACGGGCCG GGCTGCCCGG TGTGCGTGAC CCCGCTCGAA CTCATCGACA AGGCCCTGGC CATCGCCGCC CGTCCCGAGG TCATCTTCAC CTCCTACGGC GACATGCTGC GGGTGCCCGG AACGGGGACC GACCTGTTGG CCCTGCGCGC CCGCGGCTCC GACGTCCGCG TCGTCTACTC CCCGCTGGAC GCGGTACGCC TGGCCGAACA GCACCCGGAC CGCCAGGTGG TCTTCTTCGC GGTCGGCTTC GAGACGACCG CGCCGGCGAA CGCCATGGCG GTGCTGCGCG CCCACCAGCT CGGCCTGCCC AACTTCAGCA TCCTGGTCAG CCACGTCCTC GTCCCACCGG CTATGACGGC GCTCCTCGAC GCGCCCGACC GCCAGGTCCA GGGGTTCCTC GCCGCCGGCC ACGTCTGCGC CGTCATGGGC TGGACGGAGT ACGAGCCCAT CGCGCACCGT TACCAGGTGC CCGTCGTCGT GACCGGCTTC GAGCCGCTCG ACCTGCTCGA GGGCATCCTG ATGGCCGTCC GCCAGCTCGA GGCGGGCCAC GCGCGGGTGG AGAACCAGTA CGCCCGCGCC GTCCACCGCG ACGGCAACAG CCGGGCGCGG GAGGCCATCC GCCGCGTGTT CCGGGTGCGG GACCGCGCCT GGCGCGGCAT CGGCACCATC CCGGACAGCG GCCTGGCCCT CACCGACGAG TTCGCCCGCT ACGACGCCGA GACCCGCTTC GCCGTCTCCG GGCTGACCGC CCGGGAGCAT CCCGCCTGCA TCGCCGGCGA CATCCTCACC GGCGCCCGCG AGCCGACCGA CTGCACCGCC TATGGGACGG CCTGCACGCC ACGCACCCCG CTCGGCGCGC CGATGGTCTC CACCGAGGGC ACCTGCGCCG CCTACCACTC CGCCGGGAGG GCGTCGTGA
|
Protein sequence | MRYLDEYRDP ALARQLLDEI HTTATRPWTL MEVCGGQTHT IVRQGIDNLL PAGLRMIHGP GCPVCVTPLE LIDKALAIAA RPEVIFTSYG DMLRVPGTGT DLLALRARGS DVRVVYSPLD AVRLAEQHPD RQVVFFAVGF ETTAPANAMA VLRAHQLGLP NFSILVSHVL VPPAMTALLD APDRQVQGFL AAGHVCAVMG WTEYEPIAHR YQVPVVVTGF EPLDLLEGIL MAVRQLEAGH ARVENQYARA VHRDGNSRAR EAIRRVFRVR DRAWRGIGTI PDSGLALTDE FARYDAETRF AVSGLTAREH PACIAGDILT GAREPTDCTA YGTACTPRTP LGAPMVSTEG TCAAYHSAGR AS
|
| |