Gene Franean1_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3975 
Symbol 
ID5672336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4757148 
End bp4761383 
Gene Length4236 bp 
Protein Length1411 aa 
Translation table11 
GC content75% 
IMG OID641242854 
Producthypothetical protein 
Protein accessionYP_001508271 
Protein GI158315763 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1196] Chromosome segregation ATPases 
TIGRFAM ID[TIGR02680] conserved hypothetical protein TIGR02680 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.225862 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC CGACGACCGA ACGCTGGCAG CCGCTGCGCG CCGGGCTGGT CGACGTCTTC 
CTCTACGACG AGGAGGAATT CCACTTCCAC GAGGGCAACC TGCTGCTGCG CGGCAACAAC
GGCACCGGTA AGTCCAAGGT CCTTGCGATG CTGCTGCCGT TCCTGCTCGA CGGTGAGGTG
TCGGCCCACC GCGTCGAACC CGACGGCGAC CCGAAGAAGC GGATGGACTG GAACCTGCTC
ATGGGCGGGC GCCATCCCTA CCCGGAGCGC CTCGGCTACA GCTGGCTGGA GTTCGGCCGG
CTCGACACGT CCGGCGCCGC CGAGTACCGC ACGATCGGGG CCGGGCTGAA GGCGGTCGCC
GGCCGCGGGT CGGTCACCAC CTGGTTCTTC ATCGCCGACG CCCGTGTCGG CGTCGATCTC
GACCTGGTCG GCCCGACCCG GACCGCGCTG ACGAAGGAAC GGCTGCGCGA TGCCCTCGGC
CCGCGCGGCA ACGTCTACGA ACAGGCGTCG CGCTACCGGC GCGCCCTGGA CGAGGCGTTC
TTCGGCCTCG GCGAGGAACG GTACGACGCG CTGGTCAATC TGCTCATCGC GCTCCGCCAG
CCGCAGCTGT CGAAAAAGCC GGACGAGCGG GCCCTGTCGG CGGCCCTGAC CGAGGCGCTG
ACCCCGCTGG ACCAGGCGGT GATCGCGGAC GCGGCCGAGG CGTTCCGGGG TCTCGAGCAG
GAACAGTCGG ACCTGCAGGG GTTCGAGGAG GCGCACGAGG CCGCCACCGC CTTCCTCGGA
CACTATCGCC GCTACGCCCG CATCAGCGCG CGGCGCCGGG CCCGCGAGCC CCGCGCCACC
CAGGCCGCCT ACGAGTCGGC GTCCCGCCAG CTGAACGACG TCCGGGCGGA GCGGGAGCGG
GTCGAGGCCG AGGCGATCCG GGTGGACGGG GAGGACAAGG AGAACGAGGA GCGCATCACC
GCGCTCACCG CCGAACGGGA CACCCTCGCC GCCCGCCCGG AGATGGACCA GGCCCGCGAA
CTCGTCCGGC TGCAGCAGCG GATGTCCGAC CTGGAGCGCG TGGCCAACCG GGCCGGGGCC
GAACTTGCGG CCGTGACGAC CCGGCGCGAC GGCCTCGCCC GCCGGACCGC CGACGCCCGC
TGGCGGGTCG AGGCGACCGT AGCCGCGCTG GAGCGGGCGG CCGATGCCGC CGGCACCGGT
GCCGAGCATG CTGGCGTCGG CGCCGAGCAC CAGGCGCTAC TCGCCCCCCT GCACATACCC
GACGGTCCTC CCGTTGAGCG GGTTGAGGAG GCCTCCTCGG CCGCCCGGCG CGCGGCCGAG
GATGTCACCC GACGCCGCGA GGACGCGGTC AAGCTGGTCG CCGGCCTGAT AACGGCCGAC
CGCCGAGCCC AGGAGCGGGC GGCCGACCGG CGGCGGGAGC GAGACGAGGC CCGGTCGGTC
CTCGATGGGG CGACCGAGGC CGTAGCCGGC GCCGAGGCGG CCCGGGAGAC CGCCGCCGCC
GCGTTCGTCG ACGCGGCGCG TCGCCACCTC GAGGGCGCCG AACAACTCCG CCTGACCACC
CTCGACGACG TCCTCGCCGC GGTGGCCTAC TGGCTGGACG GCTTCGACGG GGACAGTCCG
CTGGTCGCCG CGGTCACGGG GCGCGCCCGC GAGCTGGAGA ACGAGTTCGG CGCCGCCGAC
CAGCGGCTGC GTACCGCCGT CGGCGAGGCA CAGGCGCGGG TGGGCATCCT GACCACCGAG
CGCCGCGCCC TGGCCGAGGG TGCACGGCTG TCCCCGCCGG CCCCGCCGAC CCGCGCTGCC
GACCGGGACG GACGGCCGGG AGCGCCGCTG TGGCAGCTCA TCGAGCCGCG CGAGGCAGGT
CCGACGCCCG CGCCGGCCCT GTCAGCTTCC CCGCCGCTGG GTCCGGCGGA GATCGCCGGG
CTGGAGGCGG CGCTGGAGGC GTCTGGGCTG CTGGACGCCT GGCTCTGCCC CGACGGGCGG
CTGCTCGCCC CCGGCACACA CGACGTGGCC ATCGTCGCCG ACGGCGCCGG TGCGGGCCGT
GTCGGCGACC ACCATCTCGG CGCGGTGTTG TGTCCAGCGG TCGACCGCTC CGACCCGGCC
GCGGCTGCCG TCCCCGACGC GATCGTGGCC GCCGTCCTAG CCCGCGTCGG GTACGTCCCT
GCGGATACGT CCGGCGTGGT GGATCCGGAT GCCGTCGATC GTGTCGGCGA CACCTGGGTC
GCCGCGGACG GGCGGTTCCG CGTCGGGCGG CTGCGCGGCT CCTGGACCAA ACCGGCGGCC
TGCTACCTCG GGCACGCGGC GCGCGAGCTG GCCCGCCGCA ACCGCCTCAC CGACATCGAC
ACCGAGCTGG CCGCCCTCGA CCGCGAGATC GCCGAGCACG GCGCGGCCCG CACGGCCCTG
CGCCAGCGCC GGGCCACGCT CAACGACGAG GTGGCCGGCC TCCCGGCGGA CACCCCGGCC
CGCCACACGT TCGCCACCGC CCGCCAGGCC GACCGCGACC GACGCATGGC CGACGCCTCC
CACCAGACGG CCGTGCGGGC CGCCGAGCAG GCCGAGGACG CGTTCCGGCA GGCCCGGGCC
GAACGCGACG AGACCGCCGC CCAGCTCGGC CTGCCCGTCG CGACCGACGA GCTCGAGGAG
ACCCGCGAGG CGCTCGTCGC CTACCGCGTC GCCCTGTCCG CGCTGTGGCC CGCCGTGCGG
GAACGCATCA GCGCCGCGGC CCAGCTCGCG CAGACCGAGC AGGACCTGGC CGACGCCCAG
GAGCAGGTCG AGCAGCGCGA GATCCAGCAC TCCGGCGCCC GCAGCGAGTA CCTGTCGGCC
CGCACGGCGT TCGAGACGAT GGAACGCACG GTCGGCACCG CCGTCGCCGA GGTGCAGCGC
CAGCTCGCCG CCGCCCAGCG CGGTCTGGAC TCCGCCACCG TCCGCCGCCG GGAACTCGGC
CAGCTCCGGC TGACGATCGC CGAGGCGGTC GGCCAGGCCC GCGGGCAGGA GAGCGGCCTG
ACCAGCCGGG TCGCGGACCT CGGCGAGCAG CGCCGGGACG CGGCCGGCCG GCTGCGCCGC
TTCGCCGCGG GCGGACTGCT GGCCATCGCC CTGCCCGAGC TGACGTTCCC CGACACCGGC
CGGGAATGGG CGCCCGACCC GGCAGTGCGC CTCGCCCGCC AGATCGAGGC GGCGGTCGGT
GAGGTCGACG ACTCCGACGC CGGCTGGGAC CGGGTGCAGA AGCTCATCGC GGCCGAGCAC
GGCACCCTGC GGGAGGTGAT GAGCCGCCAC GGCCACCACG CCGCCGCGAC GCTCACCGAC
GACGGCTGGG TGGTCGAGGC GACGTTCCGC ACCCGCGTGA TGAGCCCCGC GGAGCTGGCC
GACGCGCTCG GCGTCGAGGT CACCGACCGG CGTCGCCTGT TGACGGAACG GGAACGGGCC
ATCCTGGAGA ACCACCTCGT CAGCGAGGTC GCCAGCCATC TCCAGGACCT GATCAGCGAC
GCCGAGCACC AGGTCGAGCG GATGAACGCC GAGCTCGACG AGCGGCCCAA CAGCACCGGC
ATGAAACTGC GCTTCCAGTG GCGCCCCGGC CCGGACGCAC CACCCGGCCT GGGCCCCGCC
CGGGACCGCC TGCTGCGCCA GGTCTCGGAC GCCTGGTCAC CCGCCGACCG GGAAGCCGTC
GGCGCGTTCC TGCAGGCGCA GATCAAGGCC GAGCGCGACC AGGACGCCGG CGGGACGTGG
CTGGAACACC TCGGCCGTGC CCTCGACTAT CGCCGCTGGC ACGTGTTCGC CGTGCAGCGC
TTCTCCGACG GCCAGTGGCG CTCGGCGTCC GGGCCGGCCT CGGGCGGCGA ACGGGCCCTG
GCCGTCACCA TTCCGCTGTT CGCGGCCGCC TCCGGGCACT ACCGCTCGGC CGGCAACCCA
CACGCGCCCC GCCTCGTCAT GCTCGACGAG GCGTTCGCCG GCGTCGACGA CGACGCCCGC
GCGTCCTGCC TTGGCCTGCT CGCCGTCTTC GACATGGACG TCGTCATGAC CAGCGAGCGG
GAATGGGGCT GTTACCCGAC CGTCCCGGGC CTCGCCATCA GCCAGCTGTC CCGGCGGGAC
GGCATCGACG CGGTCCTGGT CACCCGCTGG CGCTGGGACG GCCGCCAGCG CCTCCCCGCC
GGCGTCAAGC GGCCCGCCGG TGTTGCCACT GTTGCCGGGG CGAGGGAACC CGGCGCCACC
GCACCTGCCG AGAACACTCT GTGGGACGAG CCGTGA
 
Protein sequence
MTEPTTERWQ PLRAGLVDVF LYDEEEFHFH EGNLLLRGNN GTGKSKVLAM LLPFLLDGEV 
SAHRVEPDGD PKKRMDWNLL MGGRHPYPER LGYSWLEFGR LDTSGAAEYR TIGAGLKAVA
GRGSVTTWFF IADARVGVDL DLVGPTRTAL TKERLRDALG PRGNVYEQAS RYRRALDEAF
FGLGEERYDA LVNLLIALRQ PQLSKKPDER ALSAALTEAL TPLDQAVIAD AAEAFRGLEQ
EQSDLQGFEE AHEAATAFLG HYRRYARISA RRRAREPRAT QAAYESASRQ LNDVRAERER
VEAEAIRVDG EDKENEERIT ALTAERDTLA ARPEMDQARE LVRLQQRMSD LERVANRAGA
ELAAVTTRRD GLARRTADAR WRVEATVAAL ERAADAAGTG AEHAGVGAEH QALLAPLHIP
DGPPVERVEE ASSAARRAAE DVTRRREDAV KLVAGLITAD RRAQERAADR RRERDEARSV
LDGATEAVAG AEAARETAAA AFVDAARRHL EGAEQLRLTT LDDVLAAVAY WLDGFDGDSP
LVAAVTGRAR ELENEFGAAD QRLRTAVGEA QARVGILTTE RRALAEGARL SPPAPPTRAA
DRDGRPGAPL WQLIEPREAG PTPAPALSAS PPLGPAEIAG LEAALEASGL LDAWLCPDGR
LLAPGTHDVA IVADGAGAGR VGDHHLGAVL CPAVDRSDPA AAAVPDAIVA AVLARVGYVP
ADTSGVVDPD AVDRVGDTWV AADGRFRVGR LRGSWTKPAA CYLGHAAREL ARRNRLTDID
TELAALDREI AEHGAARTAL RQRRATLNDE VAGLPADTPA RHTFATARQA DRDRRMADAS
HQTAVRAAEQ AEDAFRQARA ERDETAAQLG LPVATDELEE TREALVAYRV ALSALWPAVR
ERISAAAQLA QTEQDLADAQ EQVEQREIQH SGARSEYLSA RTAFETMERT VGTAVAEVQR
QLAAAQRGLD SATVRRRELG QLRLTIAEAV GQARGQESGL TSRVADLGEQ RRDAAGRLRR
FAAGGLLAIA LPELTFPDTG REWAPDPAVR LARQIEAAVG EVDDSDAGWD RVQKLIAAEH
GTLREVMSRH GHHAAATLTD DGWVVEATFR TRVMSPAELA DALGVEVTDR RRLLTERERA
ILENHLVSEV ASHLQDLISD AEHQVERMNA ELDERPNSTG MKLRFQWRPG PDAPPGLGPA
RDRLLRQVSD AWSPADREAV GAFLQAQIKA ERDQDAGGTW LEHLGRALDY RRWHVFAVQR
FSDGQWRSAS GPASGGERAL AVTIPLFAAA SGHYRSAGNP HAPRLVMLDE AFAGVDDDAR
ASCLGLLAVF DMDVVMTSER EWGCYPTVPG LAISQLSRRD GIDAVLVTRW RWDGRQRLPA
GVKRPAGVAT VAGAREPGAT APAENTLWDE P