Gene Meso_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_4035 
Symbol 
ID4182833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp4344265 
End bp4345332 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content59% 
IMG OID638069931 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_676567 
Protein GI110636359 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAT TCAAGCAGCT CGTTTTTTCC GGAGTGCAGC CCACCGGCAA TCTCCATCTC 
GGCAATTATC TAGGCGCATT GCAAAAGTTC GTCGCCCTGC AGGATAGCTA CGAATGCATC
TACTGCGTTG TGGATATGCA TTCACTGACG GCCACGCTGG TTCACGACGA TCTCATCGAC
CAAACGCGGG GGATCGCCGC GGCCTATCTC GCCTGCGGAC TCGATCCCAA GAAAAACATC
ATCTTCAACC AAGCCCGCGT GCCGCAACAT GCGGAGCTTG CCTGGATATT CAACTGCGTC
GCCCGCATTG GCTGGATGAA CCGCATGACG CAGTTCAAGG ACAAGGCAGG CAAGGACCGC
GAGAATGCCT CGCTCGGCCT TCTCGCCTAT CCGAGCCTGA TGGCCGCTGA CATACTTCTT
TACCGCGCCA CCCATGTGCC CGTCGGCGAG GACCAGAAGC AGCATCTGGA ATTGACCCGC
GACATCGCCC AGAAGTTCAA TAACGACTTC TCCGAGAAAA TCGCAAATCT CGGCTACGGC
GTCGAAATGA CGGTGGGCGA GGAGAAGGTG AACGGCTTTT TCCCGCTGAC GGAGCCGCTT
ATCGAGGGGC CTGCGCCGCG CGTGATGAGC CTGCGCGACG GCTCCAAGAA GATGTCTAAA
TCCGATCCAT CGGACCTCTC GCGCATCAAT CTCCTGGACG ATGCTGACAC GATCGCGCGC
AAGATCAGGA AGGCAAAGAC TGATCCGGAA CCGTTGCCGG GCGATGTCGA AGGTTTCGCC
GGACGTCCGG AGGCCGATAA TCTGGTGGGC ATCTACGCCG CACTTGCCGG CATGCCGCGG
GAAAACGTAA TCGCCGAGTT TGGCGGACGC CAGTTCTCCG ATTTCAAACC CGCACTTGCG
GATCTTGCCG TGGAGAAGCT CGCGCCTATC GGCGGGGAGA TGCGACGCCT CAAGGCCGAT
CCGGCCTACA TCGACAATGT TCTCAGGGAT GGCGGCGAGC GTGCGTCCGT CAAGGCTGAG
GCGACCATGA AGCATGTGCA CGAAATTATC GGTCTGCTGG TGAACTGA
 
Protein sequence
MSEFKQLVFS GVQPTGNLHL GNYLGALQKF VALQDSYECI YCVVDMHSLT ATLVHDDLID 
QTRGIAAAYL ACGLDPKKNI IFNQARVPQH AELAWIFNCV ARIGWMNRMT QFKDKAGKDR
ENASLGLLAY PSLMAADILL YRATHVPVGE DQKQHLELTR DIAQKFNNDF SEKIANLGYG
VEMTVGEEKV NGFFPLTEPL IEGPAPRVMS LRDGSKKMSK SDPSDLSRIN LLDDADTIAR
KIRKAKTDPE PLPGDVEGFA GRPEADNLVG IYAALAGMPR ENVIAEFGGR QFSDFKPALA
DLAVEKLAPI GGEMRRLKAD PAYIDNVLRD GGERASVKAE ATMKHVHEII GLLVN