Gene Franean1_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1203 
Symbol 
ID5669616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1436454 
End bp1439261 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content73% 
IMG OID641240135 
Productcell divisionFtsK/SpoIIIE 
Protein accessionYP_001505563 
Protein GI158313055 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1674] DNA segregation ATPase FtsK/SpoIIIE and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.144076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.470302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACAC GCACGTCCAA CGCGCGCACG CAGGCGCAAC CCAAGCCGCG GACGGCCACG 
GCGCGATCCG GCGGCTCGGG CGGCTCCGGT GGTACCGGCC GGGCTCCGGC CAAGGGCCGC
GCCGGGTCGC CCACCCGAGG CGGTGCCGCC CGTGGGGCGA GCACCCGCCG GCGGACGACC
GCGCCGCGGG CGCCCGTCCA GCTGCTGATC GCCCAGGCGG TGATCAAAGC TGTCTACCAC
GGCTGGATGG CCGCCGCCGG CATGCTCGGC GGTCTGCTGC GCCGTTTCGG CCGCGGGACG
CGGGATCTGC GCATCGACTC CGGGCACCGG CGGGACGGGC TCGGCCTGGC GGCGTTCGGC
GCCGCGATCC TGCTCGGTGC CGCGTTGTGG GGCGGGGCCG GGGGCCCGAT CGGCGCCTTC
GTCGCCGCGG GGGTGGAACT GTTCGTCGGC GTCGGCGCGC TGGCCGTCCC GCTGTTCTGC
CTTGGTGCCT CCTGGCGGCT GGTGCGCTCG CCGTCCGATC CGGACAGCCG TGGCCGGGTC
GTGATCGGCT GGGCCGCGGT CGCCGTCGGG ATCCTCGGCG TGCTGCACAT CAGCCGTGGC
CTGCCGGACT TCGCCGACGG CACGGGCAGG CTGCGCGACG GCGGCGGCAT GCTCGGGCTG
CTCGGGAGCG CGCCGCTGGT CAGCGCCGTG ACCCCCTACG TGGCCGTCCC GCTGCTGCTG
CTCATCGCCT TCTTCGGGCT GCTGGTGGTG ACGGCGACGC CGGTCCGCCA GGTGCCGGAG
AAGCTGCGTG AGCTCGCCGA CCGGCTCACC GTCCCGATCC GGGCGCCCGG CGATTCCTAC
GACGAGTTTG ACGACGAGTA CGACGATCTC GACGCCGACA GCGATGCCGG CGACGGTGTC
ATCGACGAGC TGCTCGAGGA CGAGGCGCCG CCGCGGGGCC GGCGCGGACG CGGTCGCGCC
GGACGCTCCC GGTCCGGCGA GGGCGTCTTC GACCTCGACG CCGAGCTGGC GGGCGCCGGT
CCGGCAGGCG CGCAGGACGA GCCGGCCGGC GCCGGGCGTT CCCGGTCGGG CCGGTCCCGC
CGGGCCGCGG TGCCGGACGC CGAGGCCCTG GACCCTTCGG ATCTGTTCGC CGACGTCGCC
GGGCTGACCG ATGGCGAGGA GGAGGCGGAC GGCGTCGCCC CGGTCGTCGG TGCGGGCGGT
GCGGGCGGCG CGGGCGCCGG TTCGCGGCGG GCGCGTGGCC GGGCGGGCGC CGCCGCGGGC
GACGCGGCCG CGGGGTCCGT CCCGTCCCCG GCCCAGCCGC CCGGCCCGGA TGACATCGAG
TACCTCGTCC CGCCGACGGA CGGTGCCTAC CGGCTGCCGT CGCCGACGCT GCTGCGCTCC
GGCACCCCGC CGAAGGTGCG CTCGGCGGCC ACCGACGGTG TCATCGCCTC CCTGACGGAC
GTCTTCACGC AGTTCAAGGT CGATGCCAAG GTCACCGGGT TCACCCGTGG CCCGACGGTG
ACCCGGTACG AGGTGGAGCT GGGCTCGGCG GTGAAGGTCG AGCGCATCAC CCAGCTCGCG
AAGAACATCG CCTACGCGGT CAAGAGCCCC GACGTGCGGA TCATCAGCCC GATCCCGGGC
AAGAGCGCGG TGGGCATCGA GATCCCGAAC ACCGACCGCG AGCTGGTGTC CCTGGGTGAC
GTCCTGCGCA GCGGTGAGGC GACCGGGAAC CCGCACCCGC TGGTCGTGGC CCTCGGCAAG
GACATCGAGG GCGGCTACGT CCTGGCGAAC CTGGCGAAGA TGCCGCACAT CCTCATCGCG
GGCGCCACTG GGGCGGGTAA GTCGACCTGC ATCAACACGC TCATCACCAG CGTGCTGGCC
CGTGCCACCC CGGACCAGGT CCGGATGGTC CTCGTCGACC CGAAACGGGT CGAGCTGACG
AGCTACCAGG GCATTCCCCA TCTCATCACG CCGATCATCA CCAATCCCAA GAAGGCGGCG
GACGCGCTGC AGTGGGTGGT GAAGGAGATG GAGAACCGCT ACGAGGACCT CGCGGCCTGC
GGTGTTCGCC ATGTCGACGA CTTCAACCGG AAGGTGCGGG CGGGCGAGAT CGTCGCGCCG
CCCGGTTCGG AACGGGTCTA CACCCCGTAT CCGTACATTC TTGCCATCGT CGACGAGCTG
GCGGACCTCA TGATGGTGGC CCCGCGCGAC GTCGAGGACG CGATCTGCCG GATCACGGCC
ATGGCCCGGG CGGTGGGGAT CCACCTGGTG CTGGCGACAC AGCGGCCCTC CGTGGACGTC
GTCACCGGTC TGATCAAGGC GAACGTGCCG TCCCGGCTGG CCTTCGCCAC GGCGTCGCTG
GCCGACAGCC GGACGATCCT GGACCAGGCC GGCGCCGAGA AGCTTGTCGG CCTCGGGGAC
GCGCTGTTCC TGCCGATGGG GGCGAGCAAG CCGGCCCGCA TCCAGGGCGC GTTCGTCTCC
GAGGACGAGA TCGCCGCGAT TGTCGACCAC ACTAAGGAGC AGGCGCAGCC CACCTTCGTG
GTTGACGTCT TCGAGGGCGG CGGCGAGGCG CGCAAGGACA TCGACGAGGA GATCGGCGAC
GACATGGCGC TGTTCCTCCA GGCCGTCGAG CTGGTGGTGA GCACCCAGTT CGGGTCGACG
TCCATGCTGC AGCGCAAGTT GCGGGTCGGG TTCGCGAAGG CCGGCCGCCT GATGGACCTG
ATGGAAAGCC GCGGGATAGT CGGGCCCAGC GAGGGCTCCA AGGCCCGCGA CGTACTCGTG
ATGCCCGACG AGCTGGAAGG ACTTCTCACG ACACTCCGCA GCGGATAA
 
Protein sequence
MATRTSNART QAQPKPRTAT ARSGGSGGSG GTGRAPAKGR AGSPTRGGAA RGASTRRRTT 
APRAPVQLLI AQAVIKAVYH GWMAAAGMLG GLLRRFGRGT RDLRIDSGHR RDGLGLAAFG
AAILLGAALW GGAGGPIGAF VAAGVELFVG VGALAVPLFC LGASWRLVRS PSDPDSRGRV
VIGWAAVAVG ILGVLHISRG LPDFADGTGR LRDGGGMLGL LGSAPLVSAV TPYVAVPLLL
LIAFFGLLVV TATPVRQVPE KLRELADRLT VPIRAPGDSY DEFDDEYDDL DADSDAGDGV
IDELLEDEAP PRGRRGRGRA GRSRSGEGVF DLDAELAGAG PAGAQDEPAG AGRSRSGRSR
RAAVPDAEAL DPSDLFADVA GLTDGEEEAD GVAPVVGAGG AGGAGAGSRR ARGRAGAAAG
DAAAGSVPSP AQPPGPDDIE YLVPPTDGAY RLPSPTLLRS GTPPKVRSAA TDGVIASLTD
VFTQFKVDAK VTGFTRGPTV TRYEVELGSA VKVERITQLA KNIAYAVKSP DVRIISPIPG
KSAVGIEIPN TDRELVSLGD VLRSGEATGN PHPLVVALGK DIEGGYVLAN LAKMPHILIA
GATGAGKSTC INTLITSVLA RATPDQVRMV LVDPKRVELT SYQGIPHLIT PIITNPKKAA
DALQWVVKEM ENRYEDLAAC GVRHVDDFNR KVRAGEIVAP PGSERVYTPY PYILAIVDEL
ADLMMVAPRD VEDAICRITA MARAVGIHLV LATQRPSVDV VTGLIKANVP SRLAFATASL
ADSRTILDQA GAEKLVGLGD ALFLPMGASK PARIQGAFVS EDEIAAIVDH TKEQAQPTFV
VDVFEGGGEA RKDIDEEIGD DMALFLQAVE LVVSTQFGST SMLQRKLRVG FAKAGRLMDL
MESRGIVGPS EGSKARDVLV MPDELEGLLT TLRSG