Gene Cag_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0072 
Symbol 
ID3746406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp79762 
End bp80826 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content49% 
IMG OID637772598 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_378394 
Protein GI78188056 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAG CACGAATTTT AAGCGGCATG CGACCTACCG GCAAGCTCCA TCTTGGACAC 
TACACCGGTG CCCTTGAAAA TTGGGTTGCC CAACAAAATC AATGCTCTGC GGATGGCAAC
CGCGCTTACG ACACCTATTT TCTGATTGCC GATTACCATA CCTTAACCAC CTCGCTTTCA
ACCGATGACG TGTATGCTCA TTCGCTTGAT ATGCTGGTGG ATTGGCTTGC CGCTGGCATT
GATCCCGAAA AAAGTCCTAT GTTTCGCCAA TCGCAGGTAA AGCAACATGC CGAGCTTTTT
TTGCTTTTCT CTATGCTTAT TACCTCCGCA CGCTTGGAGC GCAATCCAAC GTTAAAAGAG
CAAGTGCGCG ACCTTCATAT GGATTCAATG AGCTACGGGC ATCTTGGCTA TCCTGTTTTG
CAATCAGCAG ATATTTTGCT CTACAAGGCA AACGTGGTGC CTGTTGGTGA GGATCAAATT
CCCCATGTGG AAATTACCCG CGAAATTGCT CGCAAGTTTA ACAACCACTT TCCTCATCCG
CTTTACGGCA ACGTCTTTGC TGAACCTGAA CCAAAAATCA CCAAATTTGC ACGCCTTGCA
GGGCTTGACG GAAAAGCAAA AATGTCGAAA TCACTCGGCA ACACCATTTT CCTCTCCGAT
CCACCCGACG AAGTGCTCCG CAAAATGCGC ACGGCGGTTA CCGATACCCA AAAAGTGCGC
AAAAACGATG CAGGACGCCC CGAAGTGTGC ACCGTTTTTA GTTACCACAA ACGCTTTTCC
ACGCCTGAGC AGTGCGAAGA AATTGCGGCT GGCTGCCAAA GCGGAGCGCT TGGTTGCGTT
GATTGTAAAA AGCAGTGTGC CGCAAACATT TCTGCTGAAC TTGCACCGCT CTTAGAACGC
CGCACATACT ACGAAGCTCG CATGGATGAG GTGAAAAATA TTTTATTTGA GGGAGAAGCA
AAAGCGCGCA CCGTTGCCGA ACAGACCATG CAAGAGGTAC GCACCGCAAT GAAGCTTGGT
GAAGCAAATT GCAGCGCCAC TTTTTTCAAC ACTTCATGTT CATAG
 
Protein sequence
MATARILSGM RPTGKLHLGH YTGALENWVA QQNQCSADGN RAYDTYFLIA DYHTLTTSLS 
TDDVYAHSLD MLVDWLAAGI DPEKSPMFRQ SQVKQHAELF LLFSMLITSA RLERNPTLKE
QVRDLHMDSM SYGHLGYPVL QSADILLYKA NVVPVGEDQI PHVEITREIA RKFNNHFPHP
LYGNVFAEPE PKITKFARLA GLDGKAKMSK SLGNTIFLSD PPDEVLRKMR TAVTDTQKVR
KNDAGRPEVC TVFSYHKRFS TPEQCEEIAA GCQSGALGCV DCKKQCAANI SAELAPLLER
RTYYEARMDE VKNILFEGEA KARTVAEQTM QEVRTAMKLG EANCSATFFN TSCS