Gene Franean1_5226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5226 
Symbol 
ID5673560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6274819 
End bp6277422 
Gene Length2604 bp 
Protein Length867 aa 
Translation table11 
GC content75% 
IMG OID641244080 
Productcarbonate dehydratase 
Protein accessionYP_001509490 
Protein GI158316982 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0288] Carbonic anhydrase
[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACGCCG AGCACCACGA GCCAAGAAAA CAGCCGTCGT CGGTGACGCC TGCCCCGACC 
GCGCTGCCTT ACCCGTCCGA GCCCGACACG GACCGCGCGA AGATCCCAGA CCACGCGGAC
AGCCCGGGCC ACACCGAGAA CCCGGACCAC CCGGAGAAGC CGGACCGCCT TCCGGGCCGC
CGCCCGTGGC GCCGCGCCCG GCGCGCGCCA GCCAGCCCCG CCGCTGACAG CTCCGACAAC
CCCGCCTCCG ACAACCCCCC CTCCGGCGGC GCCGCCCGGC ACGACCCCGA CGGCACCGGC
CGTCCCGGGT CCGGCGGCTC GGCCGGCCGC ACCCTCCGGG GAGCCTGGCG CCACGACCTG
GAGGCGTCCG TCGTCGTCTT CCTGGTGGCG CTTCCGCTCT CGCTGGGCAT CGCGGTCGCC
TCGGGCGCCC CGGTGGTCGC CGGCATCATC GCGGCGGTCG TCGGTGGAGT CGTCGCCGGC
GCCGTCGGCG GAGTCCCGCT GCAGGTCTCC GGCCCGGCGG CCGGCCTCAC CGCCGTGGTG
GCCGAGATCG TCGCCACGCA CGGCTGGCGG GTCGCCTGTT TCGTCACCGC CGCCGCGGGC
GTGGTGCAGA TCCTTTTCGG CCTGAGCCGG GTCGCCCGCG CGGCCCTGGC CATCTCACCC
GCGGTCGTGC ACGGCATGCT CGCCGGCATC GGCCTGACCA TCGTCATCGG GCAGATCCAC
GTCGTGCTCG GCGGTACCGC CGGCTCGGCC GCCTGGGACA ACCTGATCGT CCTGCCCGGC
GAGATCGTGT CGCCGGCCGT CCCGGCGGCG GCGCTGCTCG GCATCGCCGC CATCGCGCTG
ACCGTGGTGT GGACGCGGCT GCCCCGGCCG TTGTCGTCCA TACCGGCACC GCTGGCCGCG
GTGTCGATCG TGACCGCGGC ATCGCTTCCG TTCGACGTCC CGCGGGTCGC CCTGCCCGAC
GACCTGCTGG GCGCGATCGC CCTGCCCGAG CTGCCCGCCG GCGGTGAATG GGGCGCGATC
ACGCTGGCCG TGCTGACGGT CGCCCTGGTC GCGAGCATCG AGTCGCTGCT GTCGGCGGTC
GCCGTCGAGG CGATGCACTC CGGCCCGCGT GGCGACCTCG ACCGCGAACT GCTCGGCCAG
GGCGCCGCGA ACACGGTCTC CGGCCTGCTC GGCGGGCTGC CGGTCACCGG CGTCATCGTC
CGCAGCTCGA CGAACGTCCG CGCCGGTGCC CGCACCCGCG CCTCGGCGAT CCTGCACGGC
CTGTGGATGG CCGGTTTCGC GCTGCTGCTG GCGCCGCTGG TCGGGCGCAT CCCGCTCGCC
GTGCTCGCCG GCCTGCTGGT CGTGATCGGC ATCCGCCTCG TCGACCTCGC GCACATCCGC
GCGATCGCCC GCCACGGCGA GCTCGCGATC TACCTGACGA CGGTCGTCGG CGTGGTGCTG
TTCAACCTGC TTGAGGGCGT CCTGATCGGG ATCGCCACCG CACTGCTGCT CGCCCTGCGC
CGGACGCTGG TGGCGCCGGT CCACGTGCAC CCGCCCACCG GCCCTGGCTC GCCGTGGCGG
GTCGTCGTCG AGGGCGCGCT GACCTTCCTC TCCCTGCCCA GGCTGTCCCG CCGGCTCGCC
GAGGTGCCCG CCGGGGCGTC CGTCCGGCTC GACCTGGCGG TCGACTACCT CGACCACGGC
GCGCACAAGA TGCTGGACGA CTGGATCGCC GAGCGGCATC GCGCCGGTGC CACTGTGACC
GTCGACGAGG TGGGGGCCGC ACCGCTGGCC ATCCCCACCG TCGCGGCCCG GCCCCGCGAG
GGACGACTCT CCAAGAGCCT CTTCGGGGGG CGTTCCACCG GCCGGCACGC CCGGTCGGCG
CGGCCGCCGC GCTGGCTCGC CCCCTGGTCG GACTGGCAGC ACGGCCAGCT CCACGACCGC
CAGCACCTGC TCAACGGCGT GGACGAGTTC CACCGCCGGA CGGCCCCGAT GATCGAGCCG
TTCCTCACCG AGCTGACCTC GGGGCAGCGG CCGTCGACCC TGTTCATCAC ATGCAGCGAC
TCGCGGCTCG TGCCGAACGT GATCACGAGC AGCGGCCCGG GGGACCTGTT CACCGTGCGC
ACGCCGGGCG CGTTCGTCCC CGGTCCGCAG GCGGTCGGTG ACTCGACCCT GGCCGCGATC
GAGTACGCCG TCGAGGTGCT CCGCGTACGG ACCATCGCCG TCTGCGGACA TTCCGGATGC
GGTGCGGTCG CCGCCCTGCT CGACCGGGGC ACACCCGGCC ACAGCGGCTC CATCGTGGGC
CCGCTGCGCA ACCTGGAGGC CTGGCTGCGC CACGGCGAAC CGGCCCTGGC CCGCGCGGCC
CGTGACGCCG GTGGCCTTCC ACCGGAGCCC GACGAGTTGA GCCGGGTCAG CGTCGCGCAG
CAGCTCGTCG CGCTGCGCGG GCTGTCCGTG GTGCGCCGCG CCGAGGCCGA AGGCCGGCTG
CAGCTGGTCG GCATGTGGTT CGACATCGCC ACGGCGCGCG CGATCGTCCT GAACGAGTCG
ACCGACCGGT TCGAGATCCC GACCGTCAGC GTCGTCCCGG CGTTGGAGAG CGGACAGCGC
CTGGCCGACG CCACCCGGGG CTGA
 
Protein sequence
MHAEHHEPRK QPSSVTPAPT ALPYPSEPDT DRAKIPDHAD SPGHTENPDH PEKPDRLPGR 
RPWRRARRAP ASPAADSSDN PASDNPPSGG AARHDPDGTG RPGSGGSAGR TLRGAWRHDL
EASVVVFLVA LPLSLGIAVA SGAPVVAGII AAVVGGVVAG AVGGVPLQVS GPAAGLTAVV
AEIVATHGWR VACFVTAAAG VVQILFGLSR VARAALAISP AVVHGMLAGI GLTIVIGQIH
VVLGGTAGSA AWDNLIVLPG EIVSPAVPAA ALLGIAAIAL TVVWTRLPRP LSSIPAPLAA
VSIVTAASLP FDVPRVALPD DLLGAIALPE LPAGGEWGAI TLAVLTVALV ASIESLLSAV
AVEAMHSGPR GDLDRELLGQ GAANTVSGLL GGLPVTGVIV RSSTNVRAGA RTRASAILHG
LWMAGFALLL APLVGRIPLA VLAGLLVVIG IRLVDLAHIR AIARHGELAI YLTTVVGVVL
FNLLEGVLIG IATALLLALR RTLVAPVHVH PPTGPGSPWR VVVEGALTFL SLPRLSRRLA
EVPAGASVRL DLAVDYLDHG AHKMLDDWIA ERHRAGATVT VDEVGAAPLA IPTVAARPRE
GRLSKSLFGG RSTGRHARSA RPPRWLAPWS DWQHGQLHDR QHLLNGVDEF HRRTAPMIEP
FLTELTSGQR PSTLFITCSD SRLVPNVITS SGPGDLFTVR TPGAFVPGPQ AVGDSTLAAI
EYAVEVLRVR TIAVCGHSGC GAVAALLDRG TPGHSGSIVG PLRNLEAWLR HGEPALARAA
RDAGGLPPEP DELSRVSVAQ QLVALRGLSV VRRAEAEGRL QLVGMWFDIA TARAIVLNES
TDRFEIPTVS VVPALESGQR LADATRG