Gene Francci3_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0349 
Symbol 
ID3905194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp405078 
End bp409643 
Gene Length4566 bp 
Protein Length1521 aa 
Translation table11 
GC content75% 
IMG OID637877678 
ProductATPase, E1-E2 type 
Protein accessionYP_479465 
Protein GI86739065 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0474] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGC CCGCAGCCCG GACCGTCCTG CGCGTGCCGA GCGCCCTCCT GGGGGCGGTG 
GCCCAGCCGG TTCTCGCCCA TACGGTTCTT GCCCATACCC GGACCGGGTG GAGTCCGGAT
TCAAAGCCCG GTTCCCTCGC ACCCGGGCCC CCCACGCCCC GCGGGGTCGA TCGCGACGGC
GCCGATCGCG ACGGCGCCGA TCGCGAGATC GCCGGCTCCA CGGCGGCCGG CGAACCGCGG
GGGAGGGCCG ACGCCCACCA CCTTCTCAGT CTCGGGATCC GGGGGATCTC CGGGCCTGCC
GGCGAATCCC GTCGCCGGCA GCTGGAACGA GTACTGGAGA ATCAGCCCGG GGTGGCGTGG
GCCCGGGTCA ACGAGCCGTT GCAATGTGTC CTGATCGGGC TCGCCGAACC GCCCCCCACC
CTCGACGACC TGGTCGCGGC CCTCGGCCGG GTGGAGGGCG TCGCGCTCGA ACCGAAGCCG
GGAGCGGTGG CGAGGTTCGC GGCGCCGGTC CGCCGGACGG CCGCGGCACT GGCGGCCGAC
GGGGTCGCGT TCGCGCTGGC GTCGGCGGGC CGGCTCGCCC GGGCGGTGCC ACTGCCCATC
GAGATCGCAT CGCTGGTCAC CCTCGTCGAC ACCCAACCCC GGCTGCGCCG GTGGGTGGAG
CAGATGCTCG GCGCGGGCCG GGCCGACGTC GTCCTCGGGC TGGTCAACGC CGGTGCCCTC
GCCCTGGCGC AGGGGAATCT CGGTCTCGCG GTCGACGCCG GCTACCGGAT GGTCGCGCTG
GGCGAGGCGC GTGCGCGCCG GGACGCCTTC ATGACGGCTC GGGACGGGCT GCTCGGTCAT
CCGGACTGGG CCGCGGCCGA GCCGGTGGTG ACCGAACGGC CGCGACCGCT GCCGGACGGT
CCGGTGGAGC GCTACGCCGA CCTCGCCGTG CTCGGCGGGC TCGGCGGGTT CGCGGCCGCC
ACCGCCGCCA CCGGGAATCC GCGCCGTGCC GTGGCGCTGG CCGCCACCTC GCTGCCACGC
CCGGCCCGGG TCGGGCGGGA GGGCTTCGCC GCCCAGTTGA CCCGCGTCCT GGCGCGCCGC
GGCGTCCACG TCATGGATTC GGCGGCCCTG CGGGTGCTCG ACCGCGTCGA CACCATCGTC
ATCGACGCCG ACGTCCTGCG CGGAGAGGAA CGCATAATCG CCGACGTCGT GCCGCTGGCC
GGTGCGAACC GCAGCGACGT CGCGGTCCGC GCGCACGCCC TGTTCCGTCC GGCCGGTATC
AGCGCCGTCG CCACCGCGGA GGGCTGGGCC CTCGGGCCGG TGGAACAGCT CGGCCTGCGC
GGCCGGACCG GTGTCCGGGA GCGCCGCGAA CTGGCCCGGT CCGGGGCGGA CACGGTGCTC
GGTCTGGCTC AGGGCAGCAG GCTGATGGCC ATCGTGTCGG TCGAACCCGG GCGAGCGGAG
GGGAGCCACC ACCTGCTGGC CGCCTGCCGC CGTTCCGGGC GGGCGGTGTT CATCGCCGGT
GACCCTCCCG CCGGCGCCGG CGGAACAGTG GAGGACACCA ACAACCTTCC CGGGGTACCC
GGGGGGGACC GGTTGGTCGG GACGATCCGC GGTCTGCAGG CCGAGCGCGG CGGCGTCCTG
GTCGTGTCCC GCCGACGGGC CGCGGTCGGC ACCGCGGACT GCGGCGTCGG CGTGAACGGC
CACGACGGCT CGCCGGCCTG GGGGGCGCAC ATCCTGGTTG GGAATGACCT GGCCGCCGCC
AGCCTCGTGA TCGAGGCGGT CGCCGCGGCC GCCCAGGTGA GCCGGCACGC GGTGCGGCTC
GCCCAGGTGG GCTCGGGTGC CGGTGCGTTG GCCATGCTCA CCGGGGCCGA CCCGCGGCTG
GTCTCCCGGG CCATGACAAT GGTCACCGCG GCGGTCACCG CCGCCCTCGG CGAGGGGATC
TGGGCGGCGC GGGAACTGGG TCGCCGCCCC CCGCCGCCCG CAGTCCCCCG TACCCCGTGG
CACATCCTGC CGGCCGAGGA CTGCCTGGCT CTGCTGGACA ACAGCCGCCC GGGTGGGCTG
TCCACCGAGG AGGCGGCGCG ACGCCGCCAG GACGGCGCGG TGTCGATGCA GGCGACCGCG
CCCAGCCTCG TCCGGGCGTT CGCGGCCGAA CTGGCCAACC CGCTCACCCC CGTCCTCCTC
GGCGGCGCCG CGCTGTCGGC CGCGACCGGT TCCGTGCTGG ACGCCGGTCT GGTGCTCAGC
GTCGCGATCG GCTCGGCGTT CGCCGGCGCG GTCCAGCAGA TACGGGCGGA CCGGGCACTC
GCCCGACTGT TCGCCGTCTC GGCGGTTCCG GCCCGGGTGC TGCGGGACGG CGAGGAGACC
AAGCTGCCGG CGGACGACCT GGTCCCGGGC GACATCATCA TGGTCGGGGC CGGGGATGTC
GTCCCCGCGG ACTGCCGGCT CCTGTCGACG ACGGGGCTCG ACGTCGACGA GTCGTCGCTG
ACCGGCGAGT CCATGCCGGT CACGAAATCG CCCGGACCGG TGGCAGCCGC CAACCTGGCA
GACCGCTCCT CGATGATCTA CGAAGGAACG ACCGTCGCCG GCGGGCGGGG CGCCGGCGTG
GTGGTGGCCA CCGGCTCGGC CACCGAGGCG GGGCGCAGCA TGGCCGCGAC CGCCGGTCCC
GCCCGGCTCA GCGGCGTCGA GGCGCGACTG TCCACAATCA CCGATCTCAC CATTCCGGTG
GCGCTCGCCG CCGCCGGGGC ACTGTTGGCC TCCGGGTTGA TCCGTGGCCT GCCGATCCGG
GACACGCTCA GCGCCGGGGT GGCGCTCGCC GTCGCGGCGG TGCCCGAAGG GCTGCCCTTC
CTCGCGACCG CCGCCCAGCT TTCCGCCGCC CGACGGCTCT CGGCACGGGG CGCGCTGGTA
CGCAACCCCC GCACCATCGA GACCCTCGGC CGGGTGGACG TGCTGGGTTT CGACAAGACC
GGAACGCTCA CCGAGGGCAG GATCCACCTG CACGCCGTCT CGGACGGCAC GCACACCGCC
GCGGTGACCG AGTTCGGAGC GACGCATCGG CGGGTGCTGG CCGCCGGGCT ACGCGCCACG
CCGCGCGGCA AAGGAAAGAA GAAGTTGCCG CATCCGACCG ATCGCGCGGT GCAGAAGGGA
GCGGCCGCGG CGGCGGTGAC CCGCGAGTAC GGTCTCGCCG TGTGGTCGCC GACGGCCTCG
CTTCCCTTCG AGCCGGGCCG CGGCTACCAC GCGGCAGTGG GGGATGCCGG CACGACCACG
GTGCTGAGCG TCAAAGGGGC GCCCGAGGTG CTGCTGCCAC GGTGCGCCCG CATCCGGACC
GCCGACGGAA CGGCACCGCT GAACGACCGC CGGCGGGCGA GGCTCATCCA GGAGCACAGC
CGGCTCGCGG GCGCGGGCTA CCGCGTCCTC GCGGTGGCCG ATCGCGACCT GGGGTCCGCG
CCCCGACCGG CGGACGAGGA ACTCACCGAT GACAGCGTGG CCGAGTTGGC CTTCCTCGGC
TTCCTCGTCC TGTCCGACCC GGTCCGGACC ACGGCCGGGG CGTCGCTGGA GGCGCTGCGC
GCCGCGGGCG TGCAGGTGCT GATGATGACC GGAGACCATC CGGCCACCGC CAGGACGATC
GCCACCGAAC TGGGAGTGCT CACCGACGAC CAGATCATGC TGACGGGTGT CGAGCTCGAA
GCCATGGACG ACGAGGCCCT GGACGCGGTC CTGCCCAGGG TGGCCGTGAT CGCCCGGAGC
ACTCCGCTGC ATAAGGTGCG GGTCGTGGAG GCCTACCAGC GGCTGGGCAA GACGGTGGCG
ATGACCGGCG ACGGCGCCAA CGACGCGCCC GCCATCCGGC TCGCCGACGT CGGCCTCGCG
CTGGGCCGGC GTGGCACGCC GGCGGCCCGG GCCGCCGCCG ACGTGATCGT GACGGACGAC
CAGCTGAACA CGATCATCGA CGTGCTCGTC GAGGGCCGTT CGATGTGGGC CTCGGTCCGC
CAGGCCCTGG GCATCTTCGT GGGCGGGAAC CTCGGCGAGA TCGCGTTCAC TCTCCTCGGC
TCCATGGCCA CCGGGCGGTC GCCACTGTCC GCGCGGCAAC TGCTGCTCGT CAACCTGCTC
ACCGATCTCG CCCCGGCCCT GGCCGTCGCG CTGCGCGAGC CGGATCCGGA GGCGACCGGA
CAGCTCCTCA GCGAGGGCCC GGAGCGCTCC CTCGGTACCG CCCTGAACCG GGAGATCGCG
GTGCGGGCGG TCGCCACGAC CCTCGCCGCC ACGGGTGCCT GGATCATCGC CCGCCTCACC
GGGCGCCGAC GGCACGCGAA CACGGTCGGC CTCGCCGCGC TCGTCGGCTC GCAACTGGGC
CAGACGCTGC TCGTCGGCGG CCGCAGCCGG ACGGTCCTGC TGAGCATCGC CGCCTCGGCG
GTCGTCCTGG CCGCCATCGT GCAGATGCCA GGCGTCAGCC AGTTCTTCGG TTGCACCCCG
CTCGGTCCGG CGGGCTGGTC GATCGCGATC GGCGCGTCGC TGGCGGGGAC GTTGCTCTCC
TTCCTCCTCC AAGCCTCCGC CGGGCTGCAG TCCGCCGCAG GACACCTTTG GGTACGTGCG
AGGTGA
 
Protein sequence
MRLPAARTVL RVPSALLGAV AQPVLAHTVL AHTRTGWSPD SKPGSLAPGP PTPRGVDRDG 
ADRDGADREI AGSTAAGEPR GRADAHHLLS LGIRGISGPA GESRRRQLER VLENQPGVAW
ARVNEPLQCV LIGLAEPPPT LDDLVAALGR VEGVALEPKP GAVARFAAPV RRTAAALAAD
GVAFALASAG RLARAVPLPI EIASLVTLVD TQPRLRRWVE QMLGAGRADV VLGLVNAGAL
ALAQGNLGLA VDAGYRMVAL GEARARRDAF MTARDGLLGH PDWAAAEPVV TERPRPLPDG
PVERYADLAV LGGLGGFAAA TAATGNPRRA VALAATSLPR PARVGREGFA AQLTRVLARR
GVHVMDSAAL RVLDRVDTIV IDADVLRGEE RIIADVVPLA GANRSDVAVR AHALFRPAGI
SAVATAEGWA LGPVEQLGLR GRTGVRERRE LARSGADTVL GLAQGSRLMA IVSVEPGRAE
GSHHLLAACR RSGRAVFIAG DPPAGAGGTV EDTNNLPGVP GGDRLVGTIR GLQAERGGVL
VVSRRRAAVG TADCGVGVNG HDGSPAWGAH ILVGNDLAAA SLVIEAVAAA AQVSRHAVRL
AQVGSGAGAL AMLTGADPRL VSRAMTMVTA AVTAALGEGI WAARELGRRP PPPAVPRTPW
HILPAEDCLA LLDNSRPGGL STEEAARRRQ DGAVSMQATA PSLVRAFAAE LANPLTPVLL
GGAALSAATG SVLDAGLVLS VAIGSAFAGA VQQIRADRAL ARLFAVSAVP ARVLRDGEET
KLPADDLVPG DIIMVGAGDV VPADCRLLST TGLDVDESSL TGESMPVTKS PGPVAAANLA
DRSSMIYEGT TVAGGRGAGV VVATGSATEA GRSMAATAGP ARLSGVEARL STITDLTIPV
ALAAAGALLA SGLIRGLPIR DTLSAGVALA VAAVPEGLPF LATAAQLSAA RRLSARGALV
RNPRTIETLG RVDVLGFDKT GTLTEGRIHL HAVSDGTHTA AVTEFGATHR RVLAAGLRAT
PRGKGKKKLP HPTDRAVQKG AAAAAVTREY GLAVWSPTAS LPFEPGRGYH AAVGDAGTTT
VLSVKGAPEV LLPRCARIRT ADGTAPLNDR RRARLIQEHS RLAGAGYRVL AVADRDLGSA
PRPADEELTD DSVAELAFLG FLVLSDPVRT TAGASLEALR AAGVQVLMMT GDHPATARTI
ATELGVLTDD QIMLTGVELE AMDDEALDAV LPRVAVIARS TPLHKVRVVE AYQRLGKTVA
MTGDGANDAP AIRLADVGLA LGRRGTPAAR AAADVIVTDD QLNTIIDVLV EGRSMWASVR
QALGIFVGGN LGEIAFTLLG SMATGRSPLS ARQLLLVNLL TDLAPALAVA LREPDPEATG
QLLSEGPERS LGTALNREIA VRAVATTLAA TGAWIIARLT GRRRHANTVG LAALVGSQLG
QTLLVGGRSR TVLLSIAASA VVLAAIVQMP GVSQFFGCTP LGPAGWSIAI GASLAGTLLS
FLLQASAGLQ SAAGHLWVRA R