Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3403 |
Symbol | |
ID | 5671774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4034026 |
End bp | 4035228 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242291 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_001507711 |
Protein GI | 158315203 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.643066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCG AATCCGGCCG GCCTTTGTCC GCGCGGCTGG CGGACGCGGG GGTCGACCGG CGGTCGTTCC TGAAATTCTG CGCGGGTATC ACCGCGACGC TCGCCCTGCC GGCCCAGTTG GCGCCCCGGG TCGCCTATGC GCTGGACAAG GTTGACCGTC CCACCGTGAT CTGGCTGGAA TTCCAGGACT GCGCCGGTGA CAGCGAGGCT TTCCTGCGTT CGCGCAACCC GTCGGCGGCG GACCTGATTC TGGGGATGAT CTCGCTCGAC TACCACGAGA CGGTGATGGC GGCCTCCGGC ACAGCCGCGG AGAAGGCCCG CGACGACGCG GTGGCGAAAG GCGGGCATCT CGTCGTCGTC GAGGGCTCCA TTCCGACCGG AATTCCCGGG GCGTGCACGA TCGGCGGCAA GTCGGCCGAG GACATCCTGC GTGAGGCGGT CCGTGGCGCG GCCGGCATCA TCAACGTCGG CACCTGCTCG GCCTTCGGTG GGATCCCGGC GGCCGGGCCG AACCCGACCG GGGCGGTCCG GGTCGAGGAC GTCGTCGGCG GGGTTCCGGT TGTGAACCTG ACCGGTTGCC CGGTCAACGC GGACAACCTG ACCGCCACCA TCGTCCACCA CCTGACCTTC GGTGAGTTCC CGGCGCGGGA CAGGTTCAGC CGGCCGCTGT TCGCCTACGG GCAGCGCATC CACGACACCT GCCAGCGGCG CGGCCACTTC GACGCCGGCG AGTTCGCCGA GCAGTGGGGC GACGCGGGCC ACCGCAACGG CTGGTGCCTG TACCGGCTGG GCTGCAAGGG GCCGAGCACC TTCCACAACT GTCCCAGCGT CCGGTTCAAC GGGGGGACGT CCTGGCCGGT GGCGGCCGGG CACGGCTGTG TGGGCTGCTC CGAGCCGGGC TTCTGGGACA CCATGTCGCC CTTCTACGAC CGGCTGCCGC ACGTGGCGAC CTCCGGTTTC GACCTCACCG CCGACAGGAT CGGCGTCGGC GTGCTGGCGG CCACGGCGAC CGGCTTCGCG GCGCACGGCG TCGGCAAGGT CGTCCAGCAC CGGGTCGCCG CCCGCCGCGA GAAGCAGGCC GCGCGGGACC TGGTGGAAGA CGACGAAGCG CCCACAGACG AAGCGGCCAC AGACGAAGCG GCCACAGACA AGGCACACGC CGACAAGGCG CACGGCGACG GAGAGACGGG CGAAGCGAGA TGA
|
Protein sequence | MTSESGRPLS ARLADAGVDR RSFLKFCAGI TATLALPAQL APRVAYALDK VDRPTVIWLE FQDCAGDSEA FLRSRNPSAA DLILGMISLD YHETVMAASG TAAEKARDDA VAKGGHLVVV EGSIPTGIPG ACTIGGKSAE DILREAVRGA AGIINVGTCS AFGGIPAAGP NPTGAVRVED VVGGVPVVNL TGCPVNADNL TATIVHHLTF GEFPARDRFS RPLFAYGQRI HDTCQRRGHF DAGEFAEQWG DAGHRNGWCL YRLGCKGPST FHNCPSVRFN GGTSWPVAAG HGCVGCSEPG FWDTMSPFYD RLPHVATSGF DLTADRIGVG VLAATATGFA AHGVGKVVQH RVAARREKQA ARDLVEDDEA PTDEAATDEA ATDKAHADKA HGDGETGEAR
|
| |