Gene Francci3_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2501 
Symbol 
ID3904879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2952051 
End bp2953931 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content72% 
IMG OID637879831 
ProductCl- channel, voltage gated 
Protein accessionYP_481597 
Protein GI86741197 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0038] Chloride channel protein EriC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.169723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTGA CGTCACGCGG GATCTCCACC CGATGGGCAG CCCGTCCGGA GATCTACCAG 
CACCGAGCCG GCGCCGTCCT CGCGGCCCTG GCCGTCGTGA TCGGCGCGGG GGTCGGGCTC
GGCGCGGTCG CCTTCCGCGG GCTGATCAAC CTCTTCACAG AGGTGTTCTG CGGCCGGGCG
GACTGCTCGG TCGGCGGACG GCTGCCCAAC CCCCACATCC CGGACCTGGG TTTCTGGTTC
CTGCTGGCCG TCCCGGTGAT CGGAGGCTTG GTCTACGGAC CGCTCATCCA CCGGTTCGCC
CGGGAGGCCC GCGGCCACGG AGTCCCCGAA GTGATGTCCG CCGTCGCCGA ACGCGACGGC
CGCATCTCCC CCCGCGTCTC GGCGGTGAAG TCACTGGCCT CGGCGTTGTG CATCGGTGCC
GGAGGGTCGG TCGGCCGGGA AGGCCCGATC GTGCAGATCG GCGCGTCCCT CGGGTCGGCT
CTCGGCCAGC TCCTGCGCGT GCCGGGCCGC CGGCTGCCGA TCCTCGTCGC CTGCGGCGCC
GCCGGCGGCA TCGCCGCCAC CTTCAACGCC CCCGTCGCCG GCGTCCTGTT CGCCCTGGAG
GTGATCCTGC GGACCTTCAC CGCCGAGGCC TTCGGCGTCG TCGTGCTGGC CGCCGTCACC
GCCAGCGTCA TCGGGCGCGC CGCCTTCGGT GACACTCCCT TTCTCAGCCT GCCCACCTTC
GCCCTTCACA GCCAGGGCGA ATACCCCCTG TTCATCCTGC TCGGCGTGGT GGCCGGCCTA
ACCGGCGTGC TGTTCACCCG CCTGCTCTAT CTCATCGAGG ACCTCTGCGA CTGGGCCTGG
CGCGGCCCCG AATGGCTACG CCCCGCTGTC GGCGGCCTGC TCCTCGGCAC CGTCCTGCTC
GCCCTGCCCC AGATGTATGG CGTCGGCTAC CCGGTCCTCG AACACACCGT TCACGGCGGG
TACGCCCTGT GGTTCCTGCT CGTCCTCATC GGCGGGAAGA TCGTGGCCAC GAGTCTCACC
ATCGGCATCG GCGGCTCCGG CGGCGTCTTC GCCCCGTCCC TGTTCATCGG CGCGTCCACC
GGCGCCGCCT TCGGCACCCT CGCCCACCAT ATCGCCCCGG GCACCATCGC CCCTGTCGGG
GCCTACGCCC TGGTCGGCAT GGGTGCCGTC TTCGCCGGCG CGGCCCGCGC GCCCATCACC
GCCGTGCTCA TCCTGTTCGA GCTCACCGGC GAGTACACGA TCATCCTCCC CCTGATGACC
GCCGTCGTCG TCGCCACCCT GACCAGCCGG CTCCTGAGCA CCGACACCAT CTACACCCTG
AAGCTGACCC GCCGTGGCGT CGACCTCGAC GCCTCCCACG ACCTGCGCCG CCTGCGGGCC
ATCCCGGCCA CCGCCGCGAT GCGGTCACCG CCCCCGCCGG TCCCGGCCGG CGCGCTCCTC
TCCGAGGTCG CCGCCCTGCT CGCCGGCTCG CCGTTCCCCG CCCGCCCCGT CACCGACGGA
CACGGGCACT ACCAGGGGAT CATCACCACA CCGGCCGTCA CCCACGCCCT CGAGACCGAC
GCCCGGGCCG AGCAGCGCGC CGCTGGCGAC CTCGCTGTCC GCCCTCCCGC CCTCACCGTC
GACGACAGTG TCGCCACCGC CCTGCATGCG CTCACCGACG ACCCCGGCGC CCCGGGCCTG
CCCGTGCTCA CCAGCGACGG CCATACCGTC GCGGGATGGG TCACCCACCA GTCGGTACTT
GCCGCCGTCT ACCCGCCGCC GGCCGAAACG GGCGGGACGC GCACCGAACC GGTCCAACGC
GTCGACGAAC ACCGCCTTCA GGGTCAGGTC ACTTTTCCCG GTGAGATCAC TCATCGCACA
CGCTCCAGGA CACTCGCATA G
 
Protein sequence
MPVTSRGIST RWAARPEIYQ HRAGAVLAAL AVVIGAGVGL GAVAFRGLIN LFTEVFCGRA 
DCSVGGRLPN PHIPDLGFWF LLAVPVIGGL VYGPLIHRFA REARGHGVPE VMSAVAERDG
RISPRVSAVK SLASALCIGA GGSVGREGPI VQIGASLGSA LGQLLRVPGR RLPILVACGA
AGGIAATFNA PVAGVLFALE VILRTFTAEA FGVVVLAAVT ASVIGRAAFG DTPFLSLPTF
ALHSQGEYPL FILLGVVAGL TGVLFTRLLY LIEDLCDWAW RGPEWLRPAV GGLLLGTVLL
ALPQMYGVGY PVLEHTVHGG YALWFLLVLI GGKIVATSLT IGIGGSGGVF APSLFIGAST
GAAFGTLAHH IAPGTIAPVG AYALVGMGAV FAGAARAPIT AVLILFELTG EYTIILPLMT
AVVVATLTSR LLSTDTIYTL KLTRRGVDLD ASHDLRRLRA IPATAAMRSP PPPVPAGALL
SEVAALLAGS PFPARPVTDG HGHYQGIITT PAVTHALETD ARAEQRAAGD LAVRPPALTV
DDSVATALHA LTDDPGAPGL PVLTSDGHTV AGWVTHQSVL AAVYPPPAET GGTRTEPVQR
VDEHRLQGQV TFPGEITHRT RSRTLA