Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5226 |
Symbol | |
ID | 5673560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6274819 |
End bp | 6277422 |
Gene Length | 2604 bp |
Protein Length | 867 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244080 |
Product | carbonate dehydratase |
Protein accession | YP_001509490 |
Protein GI | 158316982 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0288] Carbonic anhydrase [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACGCCG AGCACCACGA GCCAAGAAAA CAGCCGTCGT CGGTGACGCC TGCCCCGACC GCGCTGCCTT ACCCGTCCGA GCCCGACACG GACCGCGCGA AGATCCCAGA CCACGCGGAC AGCCCGGGCC ACACCGAGAA CCCGGACCAC CCGGAGAAGC CGGACCGCCT TCCGGGCCGC CGCCCGTGGC GCCGCGCCCG GCGCGCGCCA GCCAGCCCCG CCGCTGACAG CTCCGACAAC CCCGCCTCCG ACAACCCCCC CTCCGGCGGC GCCGCCCGGC ACGACCCCGA CGGCACCGGC CGTCCCGGGT CCGGCGGCTC GGCCGGCCGC ACCCTCCGGG GAGCCTGGCG CCACGACCTG GAGGCGTCCG TCGTCGTCTT CCTGGTGGCG CTTCCGCTCT CGCTGGGCAT CGCGGTCGCC TCGGGCGCCC CGGTGGTCGC CGGCATCATC GCGGCGGTCG TCGGTGGAGT CGTCGCCGGC GCCGTCGGCG GAGTCCCGCT GCAGGTCTCC GGCCCGGCGG CCGGCCTCAC CGCCGTGGTG GCCGAGATCG TCGCCACGCA CGGCTGGCGG GTCGCCTGTT TCGTCACCGC CGCCGCGGGC GTGGTGCAGA TCCTTTTCGG CCTGAGCCGG GTCGCCCGCG CGGCCCTGGC CATCTCACCC GCGGTCGTGC ACGGCATGCT CGCCGGCATC GGCCTGACCA TCGTCATCGG GCAGATCCAC GTCGTGCTCG GCGGTACCGC CGGCTCGGCC GCCTGGGACA ACCTGATCGT CCTGCCCGGC GAGATCGTGT CGCCGGCCGT CCCGGCGGCG GCGCTGCTCG GCATCGCCGC CATCGCGCTG ACCGTGGTGT GGACGCGGCT GCCCCGGCCG TTGTCGTCCA TACCGGCACC GCTGGCCGCG GTGTCGATCG TGACCGCGGC ATCGCTTCCG TTCGACGTCC CGCGGGTCGC CCTGCCCGAC GACCTGCTGG GCGCGATCGC CCTGCCCGAG CTGCCCGCCG GCGGTGAATG GGGCGCGATC ACGCTGGCCG TGCTGACGGT CGCCCTGGTC GCGAGCATCG AGTCGCTGCT GTCGGCGGTC GCCGTCGAGG CGATGCACTC CGGCCCGCGT GGCGACCTCG ACCGCGAACT GCTCGGCCAG GGCGCCGCGA ACACGGTCTC CGGCCTGCTC GGCGGGCTGC CGGTCACCGG CGTCATCGTC CGCAGCTCGA CGAACGTCCG CGCCGGTGCC CGCACCCGCG CCTCGGCGAT CCTGCACGGC CTGTGGATGG CCGGTTTCGC GCTGCTGCTG GCGCCGCTGG TCGGGCGCAT CCCGCTCGCC GTGCTCGCCG GCCTGCTGGT CGTGATCGGC ATCCGCCTCG TCGACCTCGC GCACATCCGC GCGATCGCCC GCCACGGCGA GCTCGCGATC TACCTGACGA CGGTCGTCGG CGTGGTGCTG TTCAACCTGC TTGAGGGCGT CCTGATCGGG ATCGCCACCG CACTGCTGCT CGCCCTGCGC CGGACGCTGG TGGCGCCGGT CCACGTGCAC CCGCCCACCG GCCCTGGCTC GCCGTGGCGG GTCGTCGTCG AGGGCGCGCT GACCTTCCTC TCCCTGCCCA GGCTGTCCCG CCGGCTCGCC GAGGTGCCCG CCGGGGCGTC CGTCCGGCTC GACCTGGCGG TCGACTACCT CGACCACGGC GCGCACAAGA TGCTGGACGA CTGGATCGCC GAGCGGCATC GCGCCGGTGC CACTGTGACC GTCGACGAGG TGGGGGCCGC ACCGCTGGCC ATCCCCACCG TCGCGGCCCG GCCCCGCGAG GGACGACTCT CCAAGAGCCT CTTCGGGGGG CGTTCCACCG GCCGGCACGC CCGGTCGGCG CGGCCGCCGC GCTGGCTCGC CCCCTGGTCG GACTGGCAGC ACGGCCAGCT CCACGACCGC CAGCACCTGC TCAACGGCGT GGACGAGTTC CACCGCCGGA CGGCCCCGAT GATCGAGCCG TTCCTCACCG AGCTGACCTC GGGGCAGCGG CCGTCGACCC TGTTCATCAC ATGCAGCGAC TCGCGGCTCG TGCCGAACGT GATCACGAGC AGCGGCCCGG GGGACCTGTT CACCGTGCGC ACGCCGGGCG CGTTCGTCCC CGGTCCGCAG GCGGTCGGTG ACTCGACCCT GGCCGCGATC GAGTACGCCG TCGAGGTGCT CCGCGTACGG ACCATCGCCG TCTGCGGACA TTCCGGATGC GGTGCGGTCG CCGCCCTGCT CGACCGGGGC ACACCCGGCC ACAGCGGCTC CATCGTGGGC CCGCTGCGCA ACCTGGAGGC CTGGCTGCGC CACGGCGAAC CGGCCCTGGC CCGCGCGGCC CGTGACGCCG GTGGCCTTCC ACCGGAGCCC GACGAGTTGA GCCGGGTCAG CGTCGCGCAG CAGCTCGTCG CGCTGCGCGG GCTGTCCGTG GTGCGCCGCG CCGAGGCCGA AGGCCGGCTG CAGCTGGTCG GCATGTGGTT CGACATCGCC ACGGCGCGCG CGATCGTCCT GAACGAGTCG ACCGACCGGT TCGAGATCCC GACCGTCAGC GTCGTCCCGG CGTTGGAGAG CGGACAGCGC CTGGCCGACG CCACCCGGGG CTGA
|
Protein sequence | MHAEHHEPRK QPSSVTPAPT ALPYPSEPDT DRAKIPDHAD SPGHTENPDH PEKPDRLPGR RPWRRARRAP ASPAADSSDN PASDNPPSGG AARHDPDGTG RPGSGGSAGR TLRGAWRHDL EASVVVFLVA LPLSLGIAVA SGAPVVAGII AAVVGGVVAG AVGGVPLQVS GPAAGLTAVV AEIVATHGWR VACFVTAAAG VVQILFGLSR VARAALAISP AVVHGMLAGI GLTIVIGQIH VVLGGTAGSA AWDNLIVLPG EIVSPAVPAA ALLGIAAIAL TVVWTRLPRP LSSIPAPLAA VSIVTAASLP FDVPRVALPD DLLGAIALPE LPAGGEWGAI TLAVLTVALV ASIESLLSAV AVEAMHSGPR GDLDRELLGQ GAANTVSGLL GGLPVTGVIV RSSTNVRAGA RTRASAILHG LWMAGFALLL APLVGRIPLA VLAGLLVVIG IRLVDLAHIR AIARHGELAI YLTTVVGVVL FNLLEGVLIG IATALLLALR RTLVAPVHVH PPTGPGSPWR VVVEGALTFL SLPRLSRRLA EVPAGASVRL DLAVDYLDHG AHKMLDDWIA ERHRAGATVT VDEVGAAPLA IPTVAARPRE GRLSKSLFGG RSTGRHARSA RPPRWLAPWS DWQHGQLHDR QHLLNGVDEF HRRTAPMIEP FLTELTSGQR PSTLFITCSD SRLVPNVITS SGPGDLFTVR TPGAFVPGPQ AVGDSTLAAI EYAVEVLRVR TIAVCGHSGC GAVAALLDRG TPGHSGSIVG PLRNLEAWLR HGEPALARAA RDAGGLPPEP DELSRVSVAQ QLVALRGLSV VRRAEAEGRL QLVGMWFDIA TARAIVLNES TDRFEIPTVS VVPALESGQR LADATRG
|
| |