Gene Francci3_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1424 
SymbolileS 
ID3903155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1714744 
End bp1717953 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content72% 
IMG OID637878761 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_480530 
Protein GI86740130 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.292782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0637422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC CTCTGCGGCA TCCCACCTTC GCTCCGCTGC CCGCCCAGGT GGACCTGCCC 
GCCCTGGAGC GGGAGACGTT GGCCCGCTGG CGCGACACCA AGGTGTTCCA CCGTTCGCTG
GAGGCCACCG CGGACCGCCC GCTGTGGGTT TTCTACGAGG GCCCACCCAC CGCCAACGGC
AGGCCGGGGG CGCACCACGT GGAGGCCAGG GTCTTCAAGG ACCTGTTCCC CCGCTACCGG
ACGATGAAGG GCTACCACGT TCCCCGGCGG GCGGGCTGGG ACTGCCACGG GCTGCCCGTC
GAGCTCGCGG TCGAGAAGGA GCTCGGCTTC ACCAGCAAAA ACGACATCGA GGCGTTCGGC
ATCGCCGAGT TCAACGCCCG GTGCCGCGAG TCGGTGCTGC GCCATGTCGC GGACTTCTCC
GCGATGACCG AGCGGATGGG CTATTGGGTC GACCTCGATG GCGCCTACCG CACCATGGAC
ACCAGCTACG TCGAGAGCGT CTGGTGGTCG CTCAAGCAGA TCTTCGACCA GGGCCTGCTG
GTCGAGGACT TCCGGGTCAC CCCGTACTGC CCGCGCGACG AAACGCCGCT TAGTGACCAC
GAGGTGTCCC AGGGCTACTC GGACGTCGAC GACCCCTCGG TCTACGTCCG TTTCCCGCTC
GTCGCCGACG CCCTGGGCCT CGCCGGGCAG GGTGCGCAGC TGCTGGTATG GACGACGACC
CCGTGGACGC TGGTGTCGAA CACGGCGGTG GCCGTTCATC CGGAGGTGGA GTACGTCCTG
GCCCGCGCGG GGGACGGGGA GTTGTTCGTC GTCGCCGAAC CACTGGTCAC CGCGGCCCTC
GGCGAGGACG CCGAGATCGT CGAGCGGTTC CGCGGGGCGG AGCTGGCCGG TGCCCGCTAC
ACCCGCCCGT TCGAGCTGCT GGCGGCCGAG CGGTTCGCGG CCGGCACCGG CGTCCCGCAC
TCGGTCGTGC TCGCCGACTA CGTGACGACC ACGGATGGCA CGGGCCTGGT CCACCAGGCG
CCAGCGTTCG GCGCGGAGGA CCTCGCAGTC TGCCGGGCGA GCGGGCTGCC GGTGGTGAAC
CCGATCGGGA CGGACGGTCG CTTCCTCGCC GACGTCCCCC TGGTCGGCGG GATGTTCTTC
AAGGATGCCG ACGCGCCGCT GACCGCCGAC CTGCGCGAGC GGGGCCGGCT GTGGCGGGCG
TCGACGTATA CGCACAGTTA TCCGCACTGC TGGCGCTGTC ACACACCGTT GATCTACTAT
CCGCTGCCGT CCTGGTACAT CCGGACCACC GCCATCCGCG ACGAGCTGCT GGCGCAGAAC
GAGCGGACGA CCTGGCACCC CGAGCGGATC AAGACCGGTC GGTACGGCGA GTGGCTGCGC
GGCAACGTCG ACTGGGCGCT GTCCCGCAAC CGATACTGGG GGACACCGCT GCCGGTCTGG
CGCTGCGACG ACGACCCCAC CCACCTGGTG TGCGTCGGGT CACTCGCGGA GCTCTCCGAG
CTCGCCGGGC GGAACCTGGC CGACCTCGAC CCGCACCGCC CGTTCGTCGA CGAGGTCACC
GGGACCTGCC CGACCTGTGG CGGCGCCTCG CACCGGGTGC CCGAGGTGAT CGACGTCTGG
TATGACAGCG GCGCGATGCC GTTCGCCCAG TGGGGCGCCC CGCACCACAA CCTCGCGGCG
TTCACCCGGC AGTACCCGGC GCAGTACATT TGCGAGGCGA TCGACCAGAC CCGCGGCTGG
TTCTACACGA TGATGGCGGT CGGCACGCTG GTGTTCGGCC GCTCCTCCTA CGAGACGGTG
CTCTGTCTCG GCCTGCTCCT GGACGCCGAC GGCCGCAAGA TGAGCAAGCA TCTCGGCAAC
GTGCTCGATC CCTTCGAGCT GTTCGAGCGG CACGGCGCGG ACGCGGTCCG CTGGCTGATG
CTCGCCGGCG GCTCGCCGTG GGCGGACCGC CGGGTGAGTC ACGAGGCGAT CGAGGACATC
GTCCGCAAGG TCCTGCTCAC CTACTGGAAC ACCTCGTCCT TCTTCGCTCT CTATGCCGGG
GCAGCCGGCT GGCGCCCAGG CGCGGACCCG GCCGCGGACC CGCGCGCGAC ACCGCCGGCC
CGACGGCACG TGCTGGACCG CTGGGCGCTG TCCGAGCTCG CGGCCACCGT CGCCGAGGTG
GACGATGCGC TGGAGAACTT CGACTCGCTG CGGGCCGGGC GGCGGATCGC CCGGTTCGTC
GACGACCTGT CCAACTGGTA CGTCCGCCGG TCCCGCCGCC GGTTCTGGGC CGGCGACGCC
GACGCCCTGA GCACCCTGCA CACCTGCCTG GACGCGCTGA CCCGGGTGAT GGCGCCGTTC
ACGCCGTTCC TCACCGACTG GCTGTGGTCA CGGCTGTTCG CCGACGCATC CCCGCGGACC
CCCGACTCGG TGCACCTGGC CGCCTGGCCC GAGCTCCCGG CGGGGCTGCA CACGCCGGAG
CTCTCGGAGC AGATGGATCT CGTCCGGCGG ATCGTGGAAC TCGGCCGCGC CGCCCGGGCC
GCCAGCGGGG TGCGCACCCG CCAGCCGTTG CCGCGGGCGG TCGTCGGCGC GAGTGCCTTT
GACGAGCTCT CCCCCGAGCT GATCGCGCAG ATCACCGAGG AGCTCAACGT GACCACGGTG
GAGCCGGCGA CCTCGGAGGT CGTCGACATC TCGGTGAAGC CGAACTTCCG GGCGCTGGGG
CGGCGCTTCG GCAGGAACAC CAAGGCGGCC GCCGCGGCCA TCGCGGCCGC CGGTCCTCCC
GTCAACGGAC GGCTCACCGT CACCGTTGAC GGGGAGGACG TCGAGCTGTC CGGGGACGAG
CTGATCATCA CGGAGACGCC GCGGCAGGGC TGGGCGGTCA CCGCCGAGTC CGGGCTCTCC
GTCGCCCTCG ACCTGGAGAT CTCCCCGCAG CTCGCCCGCG CCGGGCTCGC CCGCGACGTC
GTCCGGGTGC TCCAGGACGC GCGCAAGGCG GCGGGGCTGG AGATCACCGA CCGGGTGGAC
GTCTCCTGGG CGGCGACGCG CGAGGAGACC GCGCTCGCTC TGCGTACCCA CGGTCAGACG
GTGGCCGAGG AGGTGCTGGC GGTCTCCTTC ACCGAGGCGG CCCGCACGGA GCTACCTGCG
GCGCAGCCGC GCGAGACGGC AGCCCGCTCG GCGGCCGAGG AGCTGGGCCT GGCGTTCACG
CTCACCCGGC ACGAGACGAC CGGCGGCTGA
 
Protein sequence
MSTPLRHPTF APLPAQVDLP ALERETLARW RDTKVFHRSL EATADRPLWV FYEGPPTANG 
RPGAHHVEAR VFKDLFPRYR TMKGYHVPRR AGWDCHGLPV ELAVEKELGF TSKNDIEAFG
IAEFNARCRE SVLRHVADFS AMTERMGYWV DLDGAYRTMD TSYVESVWWS LKQIFDQGLL
VEDFRVTPYC PRDETPLSDH EVSQGYSDVD DPSVYVRFPL VADALGLAGQ GAQLLVWTTT
PWTLVSNTAV AVHPEVEYVL ARAGDGELFV VAEPLVTAAL GEDAEIVERF RGAELAGARY
TRPFELLAAE RFAAGTGVPH SVVLADYVTT TDGTGLVHQA PAFGAEDLAV CRASGLPVVN
PIGTDGRFLA DVPLVGGMFF KDADAPLTAD LRERGRLWRA STYTHSYPHC WRCHTPLIYY
PLPSWYIRTT AIRDELLAQN ERTTWHPERI KTGRYGEWLR GNVDWALSRN RYWGTPLPVW
RCDDDPTHLV CVGSLAELSE LAGRNLADLD PHRPFVDEVT GTCPTCGGAS HRVPEVIDVW
YDSGAMPFAQ WGAPHHNLAA FTRQYPAQYI CEAIDQTRGW FYTMMAVGTL VFGRSSYETV
LCLGLLLDAD GRKMSKHLGN VLDPFELFER HGADAVRWLM LAGGSPWADR RVSHEAIEDI
VRKVLLTYWN TSSFFALYAG AAGWRPGADP AADPRATPPA RRHVLDRWAL SELAATVAEV
DDALENFDSL RAGRRIARFV DDLSNWYVRR SRRRFWAGDA DALSTLHTCL DALTRVMAPF
TPFLTDWLWS RLFADASPRT PDSVHLAAWP ELPAGLHTPE LSEQMDLVRR IVELGRAARA
ASGVRTRQPL PRAVVGASAF DELSPELIAQ ITEELNVTTV EPATSEVVDI SVKPNFRALG
RRFGRNTKAA AAAIAAAGPP VNGRLTVTVD GEDVELSGDE LIITETPRQG WAVTAESGLS
VALDLEISPQ LARAGLARDV VRVLQDARKA AGLEITDRVD VSWAATREET ALALRTHGQT
VAEEVLAVSF TEAARTELPA AQPRETAARS AAEELGLAFT LTRHETTGG