Gene Francci3_3245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3245 
Symbol 
ID3904416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3840155 
End bp3842788 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content73% 
IMG OID637880570 
Productcarbonate dehydratase 
Protein accessionYP_482331 
Protein GI86741931 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0288] Carbonic anhydrase
[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.457937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTT CGGTTTCCCC GCCGTCACCG TCCCGGCACA GCCATTCGCC GCCCATGTCC 
GGGTCCACCG CCAAGAACGT GCGCCGGACC CGGGGTGACC ACAACCCCCT GCGCCCCGAT
CTCCAGGCCT CCCTGGTCGT GTTCGTCGTG GCGTTGCCCC TGTCGCTGGG CATCGCCGTC
GCCTCGGGGG CGCCGGTCGC AGCGGGTCTC CTCGCCGCGG TGATCGGCGG CGTCGTCGCC
GGCGCCCTCG GCGGTGCGCC GTTGCAGGTG AGCGGGCCGG CCGCCGGGCT GACCGTCATC
GTCGCCGACC TCGTCCACAC CTACGGGTGG CGCGTTACCT GTCTGATCAC CGCAGGGGCC
GGGATTCTGC AGATCCTGCT CGGGCTCTGC CGGGTCGCCC GCGCCTCCCT GGCCGTCTCG
CCGGCCATCG TCCACGGGAT GCTCGGCGGC ATCGGGCTGA CGATCGTCCT CGGCCAGTTC
CACGTGGTGC TGGGCGGACA GGCCGAGAGT CACGCCTGGG AGAACGTCGC GGCAGTGCCC
GAGGCGATCA TCGATCCGCA CGGTTTCGCG ACCCTCGTCG GCTTCGTCAC CATCGCCACG
ATCCTGCTCT GGCCCCGGTT GCCGAAGAGG CTGCCGAACG CCGTCCGAGC CATCCCCGCC
TATCTCGCGG CGGTCGTGAC GGGCACCCTG CTCGCTGCCG TCGCGGGCTT CGACCTGCCC
AGGGTCGACC TGCCCTCGTC CCTGTTCGAC GCGGTCGCCC TGCCGGGCCT CCCACACGGC
GACTGGTCCG GCATCGCGCT GGGCGTCGTC ACCGTGGCCA TCGTCGCCAG CGTCGAGTCC
CTGCTCTCGG CCGTCGCGGT GGACCAGCTT GCCACCACCC GAGGCTGGCG GGGGCCGCGC
GTCGACCTCG ACCGCGAGCT GCTCGGGCAG GGCTCGGCGA ACCTCGTCTC GGGCCTGGCC
GGCGGGTTGC CGATCACCGG GGTGATCGTG CGCAGCTCCA CCAACGTCGC CTCCGGCGGG
CGTACCCGGA AGTCTGCCGT CCTGCACGGC GTCTGGGTGC TGCTGTTCGC CGTGTTCCTG
GGCCGCATGG TGGAGTGGGT CCCGCTGTCC GCCCTCGCCG GCCTGCTCGT CGTCGTCGGT
GCGCGGCTGG TCAACGTCGC GCATCTGCGC CACGTCCGCC GGCACGGGGA GCTGCCCGTC
TACCTGGTGA CAATCGTCGG TGTCGTCGCC CTCGACCTCC TGCAGGGCGT CGCCCTGGGT
CTGCTCACCG CGCTCGCCCT GGTCCTGCGC CGGGTGCTGT GGTCGAGTGT CCGGCTGGTC
CGCACCGGCG AGCTCTGGCA GGTGGAGGTC GAGGGGACGC TGAGCTTCCT GTCCCTGCCC
CGCCTCGCGC GGATCCTGGG CCGTATCCCC GCCGGTGCCC CCGTCTCGAT CGAACTCATC
GTCGACTACC TTGATCACGC CGCCTATGAG CATCTGCGTG GCTGGTGCGC GAACCACGAG
GCGTCCGGCG GGACGGTGAG CGTCGACGAG ATCGGCCAGG TCTGGTTCCG CCGGCCGGCC
GAGGTCGCGG AGCAACGGCG CCGCACCGTC GTCCCGCACC TCCCGCGCTG GTTCGCGCCC
TGGTCGCAGT GGCAGGAACT GGAACTCGCC TCGTCATCGT CTGATTCCCC TTCGTCTGAT
TCCCCTTCGT CTGATTCCCC TTCGTCGGGA TCGCGGTCTT CGGATTCCCA GCCCGCCGGC
GTCACTCCGC CGGCGATCCA CCCAGTCCAG GGCCCGGACC CGGCGGCGTC GGTGCCGGCG
CAGGCCTCCC CCGACCGGAC ATTCGACCAC GTCAGCACGA TGCTGTTCGG GGTGAACGAG
TTCCACCGGC GGGCCGCCCC GCTCCTGCGG GGCACCTTCG ACGCGCTCGC CGGGGGACAG
CAGCCCGGCG CCCTGTTCCT CACCTGCGCC GACTCCCGCA TTGTTCCCAA CATCATCACC
AGCAGCGGTC CCGGCGACCT GTTCACGATC CGTAATGTCG GGAACATCGT CCCGGTGGAC
GACCCCGCCG GATCCGACCC GGACGCGCCC TTGCGGCGCA GTGGTGACCT GTCGGTGACC
GCCGCGCTGG ACTACGCCGT CGACGTCCTG CGGGTGCCCT CGCTCGTGGT GTGCGGCCAC
TCCGGCTGTG GCGCCATGCA GGCGTTGCTG TCCGGCACGC TCGACGGCGC ACCCGACTCG
GCCCTGGCGG GCTGGCTGTC GCACGCCGCC GCCTCGCTTG AGCGGACGCC CCCCGCCGGC
ACGGAGGATC TGCCCCCGGT GGAAAGGCTG GGCCGCGCCA ACGTGGCGCA GCAGTTGGAG
AACCTGCGCG CGCATCCGGC CGTGCGCCGT GCCCTGGCAC GCGGCACGCT GGAACTCGTC
GGCCTGTACT TCGACATCGC CGACGCGCGC ATCTGGGTGC TCGAGGAGTC GACCGGCCGG
TTCGTCGACC CGATGGACGC GTTGCCGACG GCGGTCATCG GGACCGGCTC GCGTCGCCGG
TGGGACGCAC CGGTGCGGGT GCCCGCACCC ATGGTCGCGG TCGGCGCGGG TCCTGCCGAC
GAGCAACCGC CGCCCGCCGG CCGGCGGGCC CGGTGGCTCG GCCTGCCGCG GTAG
 
Protein sequence
MTASVSPPSP SRHSHSPPMS GSTAKNVRRT RGDHNPLRPD LQASLVVFVV ALPLSLGIAV 
ASGAPVAAGL LAAVIGGVVA GALGGAPLQV SGPAAGLTVI VADLVHTYGW RVTCLITAGA
GILQILLGLC RVARASLAVS PAIVHGMLGG IGLTIVLGQF HVVLGGQAES HAWENVAAVP
EAIIDPHGFA TLVGFVTIAT ILLWPRLPKR LPNAVRAIPA YLAAVVTGTL LAAVAGFDLP
RVDLPSSLFD AVALPGLPHG DWSGIALGVV TVAIVASVES LLSAVAVDQL ATTRGWRGPR
VDLDRELLGQ GSANLVSGLA GGLPITGVIV RSSTNVASGG RTRKSAVLHG VWVLLFAVFL
GRMVEWVPLS ALAGLLVVVG ARLVNVAHLR HVRRHGELPV YLVTIVGVVA LDLLQGVALG
LLTALALVLR RVLWSSVRLV RTGELWQVEV EGTLSFLSLP RLARILGRIP AGAPVSIELI
VDYLDHAAYE HLRGWCANHE ASGGTVSVDE IGQVWFRRPA EVAEQRRRTV VPHLPRWFAP
WSQWQELELA SSSSDSPSSD SPSSDSPSSG SRSSDSQPAG VTPPAIHPVQ GPDPAASVPA
QASPDRTFDH VSTMLFGVNE FHRRAAPLLR GTFDALAGGQ QPGALFLTCA DSRIVPNIIT
SSGPGDLFTI RNVGNIVPVD DPAGSDPDAP LRRSGDLSVT AALDYAVDVL RVPSLVVCGH
SGCGAMQALL SGTLDGAPDS ALAGWLSHAA ASLERTPPAG TEDLPPVERL GRANVAQQLE
NLRAHPAVRR ALARGTLELV GLYFDIADAR IWVLEESTGR FVDPMDALPT AVIGTGSRRR
WDAPVRVPAP MVAVGAGPAD EQPPPAGRRA RWLGLPR