Gene Cagg_0718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0718 
Symbol 
ID7266970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp890092 
End bp893292 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content61% 
IMG OID643565569 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_002462078 
Protein GI219847645 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAGC CGGTTGATCC GAATGTCAAA TTTCCCCAAC TCGAAGAAGA AGTGCTGGCG 
TGGTGGGATG CCAATGACGT GGTGGCGAAG TCGTTGGCCG CTGGCGAAAA GCCGTTTGTC
TTCTATGAGG GGCCGCCGAC CGCGAATGGC CGTCCCGGTC TTCACCATAC CATCTCGCGC
AGCTTCAAAG ATGTGATTTT GCGCTATCGC TCGATGCAGG GCTATCGTAT CATCGGTCGC
CGCGAAGGCT GGGACACCCA CGGCCTACCG GTCGAAATTG AGATCGAGAA GAAACTCGGT
TTTAGCGGTA AACCTGACAT CGAGCGGTTT GGTATCGCGG AATTCAATCG CCTCTGCCGC
GAGAGCGTCT GGGAGTACAT CCAGGAGTGG AAGGCGTTTA CCAAGCGGAT CGCGTTCTGG
CTGAGCGAAG ATGCGTATAT CACCTACGAG AACGACTATA TCGAGTCGAC ATGGTGGATC
TTCCGTCAGT TGTGGGATCG CGGCCTGCTC TTCCGCGACT ACAAGGTCAC GATGCACTGC
CCGCGCTGCG GCACCAGCCT CAGCGACCAC GAGGTCAGCC TTGGCGCGCG CGATGATGTC
GATGATCCGA GCGTCTATAT CAAGTTCCGC GTGAAAGGCA CGACGTTACC ACCACCCGCA
GTTGATGGTA CACTCGAAGG GGCTTTTCTG GTGGCGTGGA CGACGACGCC ATGGACGTTG
CCGGCCAACG TGGCGCTGGC CGTCAAGCAC GACGCCGAGT ATGTCGAGGT TGAGCATAAC
GGTGAGCGAT TGGTGATGGC GGCGACACTG ATCAATCAGG TTTTGCCCGC CGAGAGCTTC
ACTGTGCTGC GTCGGTTCCG CGGTAACGAT CTGGTCGGCC TCCGCTACGA GCCGCTCTTC
CGCGGCGTGC CCGGCGCCGG CGATACCGTC GATTGGGAGA CGGCCTACCG CGTGATCGCC
GATGAGATCG TCAGCCTTGA CGATGGTACC GGCATTGTCC ATATCGCACC GGCCTACGGC
GACCTCGAAG TCGGGCGCAA GCACGGCTTA CCTACCCTGT TTTCGGTTGG TCTCGACGGT
CGGGTATTGC CGGAGTTTGC CGATCTCGGT TTTGCCGGTA AGTTCTTCAA AGAGGCCGAT
CCCGATATTA CCCGCAACCT CAAAGCGCGT GGCCTGCTCT TGCGCAGCGG GCGCGTGCGC
CACTACTACC CGTTCTGCTG GCGCTGCGGC ACGCCACTCC TGTACTACGC CAAGCGTTCG
TGGTACATCC GCACCACTGC CTTCAAAGCC GATCTGGTCG CCAACAACCA GCAGATTCAC
TGGGTGCCCG AACACATCCG CGACGGCCGG TTCGGCAACT GGCTGGAAAA TAACATCGAC
TGGGCGATCA GCCGCGAGCG GTATTGGGGC ACGCCAATCC CGATCTGGAC GAACGCTGAC
GGCTCGCACA TGGTCTGCAT CGGTTCGCTG GCCGAGCTGG AAGAAAAGGT GGGCCGTTCG
CTGCGCGATC TCGACCTGCA CCGGCCCTAC ATCGACGAGG TCGTCTGGGA AGACCCCGAC
CATGGCTTGA TGCGCCGGAT TCCCGACGTT GCCGACTGCT GGTTCGACAG CGGTTCGATG
CCGGTGGCCC AGTGGCACTA CCCGTTTGAG AATCGCGACG TCTTCGAGAT GTCCCATCCC
GCCGACTATA TCTGTGAGGC GGTAGACCAG ACCCGCGGCT GGTTCTACAC CTTGCACGCG
GTCAGTACCC TGCTCTTCGA CCGGCCCGCG TTCAAGAACG TGATCTGTCT CGGCCACATT
CTCGACAAAG ACGGCCAAAA GATGAGCAAG AGCCGGGGCA ATGTGATCGA ACCGCAAGAA
GTGATCAACG CTTACGGCGT TGATGCCCTG CGCTGGTACC TGTTCACCGC TGCGCCTCCC
GGTAATGCCC GCCGCTTCAG CATGGATCTG GTCAGCGAGA GCATGCGCAA GTTCCTGCTC
ACGCTCTGGA ACACGTATGC CTTCTTCGTC ACCTACGCCA ACCTCGACCG GTGGCAACCT
AACAGCGGGC GCACCGCCGA ATTGCAGCCA ATCGACCGCT GGGCGCTGGC AGCGCTCAAC
CAGCTTGTGC AAACGGCAAC CGCTGCCTTC GAGGAGTACG ATGTCTACTC GGCGGCGAAC
GCAATCGAGC ACTTCGTCGA TGAGTTGTCG AACTGGTACG TGCGTCGCAA CCGTCGTCGC
TTCTGGAAGA GCGAAGGCGA CGCCGATAAG GAAGCGGCCT ACCAGACCCT GTATACCTGT
CTGGTCACCG TTGCTAAGCT GGCAGCACCG TTTATCCCGT TTGTCAGCGA GGAGATCTAT
CGCAATCTGG TCGCCGAACG GGACGCTTCC GCTCCCGAAA GTGTTCATCT CGCTCGTTGG
CCAGAAGTCG ATCAAGCACT GCTCGATGAC CAACTGGTGG CCGATACCGA AGCGTTGCTG
ACGGCAGTAT CACTCGGACG AGCAGCACGT AAACAGGCGA ATATCAAGGT TCGTCAGCCC
CTCAGTGAGC TGTGGCTGCG AGCTAGCACC CCGGCGCTGC TCAACGGTGT CCGCCGCTTC
GAGGCTGAGC TGCGCGATGA GCTGAACGTC AAAGTGGTGC GCTACCTCGA CGCTAACAGC
GCCGTGGTCG AGTACCGCCT GAAGCCAAAC CTACGCCTCG TTGGCAAGAA GTTCGGTAAA
TTGGTGCCGG CGATCACCAC CGCACTGCGC GATCTCACCG GTGATGACGC GCGAGCCGCA
GCGCAGGCCG TTGAAGCCGG TCAGCCGGTA CACCTATCGG TTGACGGCCA AACGATTGAG
CTGCTGGCCG AAGAGGTGCT GGTCGAGAGC AGTGCGCCCG CAGGGTACGC GGTGGCCGAA
GCCGACGGGA TGCTGGTCGC GTTGAACACG ACGGTGACGG AAGAACTACG GTTGGAAGGC
GCGGCCCGCG ATCTGGTCCG CTACGTGCAA GATGCGCGTA AGAGTGCTGG ACTGGCAATC
AGCGACCGCA TCCGGCTCTT CCTCAGCAGC ACCGACGAAG CCGCACTGCT GGCGGCCACC
CTTGCACAGC ACGGTGCGTA CATCCAAAAC GAGACCCTCG CCGTCGAGCT GACGGTCAGC
GCACCGCCGG CCGGTGCGCA CGTCGAGACC GATGAGTTCG GTGACGGCGA GATAACGATC
GGGGTTGTGA AGGCCGGCTG A
 
Protein sequence
MFKPVDPNVK FPQLEEEVLA WWDANDVVAK SLAAGEKPFV FYEGPPTANG RPGLHHTISR 
SFKDVILRYR SMQGYRIIGR REGWDTHGLP VEIEIEKKLG FSGKPDIERF GIAEFNRLCR
ESVWEYIQEW KAFTKRIAFW LSEDAYITYE NDYIESTWWI FRQLWDRGLL FRDYKVTMHC
PRCGTSLSDH EVSLGARDDV DDPSVYIKFR VKGTTLPPPA VDGTLEGAFL VAWTTTPWTL
PANVALAVKH DAEYVEVEHN GERLVMAATL INQVLPAESF TVLRRFRGND LVGLRYEPLF
RGVPGAGDTV DWETAYRVIA DEIVSLDDGT GIVHIAPAYG DLEVGRKHGL PTLFSVGLDG
RVLPEFADLG FAGKFFKEAD PDITRNLKAR GLLLRSGRVR HYYPFCWRCG TPLLYYAKRS
WYIRTTAFKA DLVANNQQIH WVPEHIRDGR FGNWLENNID WAISRERYWG TPIPIWTNAD
GSHMVCIGSL AELEEKVGRS LRDLDLHRPY IDEVVWEDPD HGLMRRIPDV ADCWFDSGSM
PVAQWHYPFE NRDVFEMSHP ADYICEAVDQ TRGWFYTLHA VSTLLFDRPA FKNVICLGHI
LDKDGQKMSK SRGNVIEPQE VINAYGVDAL RWYLFTAAPP GNARRFSMDL VSESMRKFLL
TLWNTYAFFV TYANLDRWQP NSGRTAELQP IDRWALAALN QLVQTATAAF EEYDVYSAAN
AIEHFVDELS NWYVRRNRRR FWKSEGDADK EAAYQTLYTC LVTVAKLAAP FIPFVSEEIY
RNLVAERDAS APESVHLARW PEVDQALLDD QLVADTEALL TAVSLGRAAR KQANIKVRQP
LSELWLRAST PALLNGVRRF EAELRDELNV KVVRYLDANS AVVEYRLKPN LRLVGKKFGK
LVPAITTALR DLTGDDARAA AQAVEAGQPV HLSVDGQTIE LLAEEVLVES SAPAGYAVAE
ADGMLVALNT TVTEELRLEG AARDLVRYVQ DARKSAGLAI SDRIRLFLSS TDEAALLAAT
LAQHGAYIQN ETLAVELTVS APPAGAHVET DEFGDGEITI GVVKAG