Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3851 |
Symbol | |
ID | 5672214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4573273 |
End bp | 4574370 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242729 |
Product | hemerythrin HHE cation binding domain-containing protein |
Protein accession | YP_001508149 |
Protein GI | 158315641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00703779 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACC TTCATGATCT CCCGGCCCGG GTCTTCAGCC TCATAGAGTC GGGGGTGGTG GCCGAGTTCT CGACGGTGTC CGCGGCCGGC ATTCCGATCG ACACGCCCAC CTACTATTTT CCGGCCGACG ACATGTCGAC GATCGACCTG GCAACGGGCC TGCCCAATCC GGCCAAGGCC GAGCGGGTGC GGCGCACCTC GAAGGTCGGG CTCCTGATCG AGGGCCGCCC CGAGGAACCG GTCGTGGTCA TTCGAGCCCA CGGTGCGGTG CGCGACAGCG ACATCCAGGC CAACGCCATC CGCTATCTCG CCGAGACCGG CTACGAGGGG ATCAGCCACG GCATCACGTG GGAGGAGGCA CGCAAGGCCG TCACCTACTG GTCTCGGATC ATCATCGAGA ACAAGCCCGA ACGGGTGTAC TGGTGGGACA GCCACGCGGC CCTCGACGAC CCACCGCAGG TGTGGTCCGC GGCGCCCGAC ACGGTCTACC CGACATCCGA CCCCTCGCCC GCCCGCAGGA TGAAGCGGTC GGCCTGGCCG GTCCGTCCCT GGCAGGACGT GGCCAGGGAG GCCGTGGAGG GCGGCACTCC GGCGCATCTG TCGGTGCTCG ACGAGGACGG CTTTCCGCTT CCGATGCGGG CGACCTCCTT CGAGCTGACA GCCGAGGGCT TCCGGCTGGC GGTGCCCACG GGCACCCCCT GGCGGCTGCG GGGCAGGGCG TCGCTCACCT TCGCGGGTTT CCGTACCTTC GTCGGCGACG CCGGAGCGGA CGGCGGTGCT GAAGCGAACG GCGGTGCTGA AGCGAACGGC GGTGCTGGAG TGGACGGCGG CGACCGCGTC CTGTTCCGCG TCGAGCGCTC GCTCCCCCAG CACCCGGCGA CACTCGACAC GAAGCAGGTT CTCCAGCCGA GCGAGGACAC CCTCGCCAAG GCGAGGGCAC GGCTCGAGTA CGAGGCTGAG CGGCGCGGCC AGTCCCTCCC GGTCATCCCG GCGGACCCCC CGCCACGGAC CCGCATCGCG CTGATCCGCC GGGCGCGCAT CGCCAGCGAC GCACCGATCA CCGGCATCAC CGAGGAGCAC GGGAACCGAA GGACCTGA
|
Protein sequence | MSDLHDLPAR VFSLIESGVV AEFSTVSAAG IPIDTPTYYF PADDMSTIDL ATGLPNPAKA ERVRRTSKVG LLIEGRPEEP VVVIRAHGAV RDSDIQANAI RYLAETGYEG ISHGITWEEA RKAVTYWSRI IIENKPERVY WWDSHAALDD PPQVWSAAPD TVYPTSDPSP ARRMKRSAWP VRPWQDVARE AVEGGTPAHL SVLDEDGFPL PMRATSFELT AEGFRLAVPT GTPWRLRGRA SLTFAGFRTF VGDAGADGGA EANGGAEANG GAGVDGGDRV LFRVERSLPQ HPATLDTKQV LQPSEDTLAK ARARLEYEAE RRGQSLPVIP ADPPPRTRIA LIRRARIASD APITGITEEH GNRRT
|
| |