Gene Saro_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1939 
Symbol 
ID3917162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2053996 
End bp2055561 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content59% 
IMG OID640444685 
Productmethionyl-tRNA synthetase 
Protein accessionYP_497213 
Protein GI87199956 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0143] Methionyl-tRNA synthetase 
TIGRFAM ID[TIGR00398] methionyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAAC CCTTCTACAT CACCACTGCC ATCTCCTACC CCAACGGCAA GCCGCATATC 
GGCCATGCCT ACGAGGCAAT TGCCGCCGAT GTTATCGCGC GCTTCCAACG GGCCATGGGA
CGTGATGTGC GTTTCCAGAC GGGCACGGAC GAACACGGGT TGAAGATGGC GCAGAAGGCC
CGTGAACTCG GCATAACGCC ACGCGAGCTT TCCGACGAAA TGTCATCTTA TTTCATTAAG
ATGTGCGACG AACTTAATGT TTCGTACGAC GTTTTCATTC GCACCACCGA GGAGCGGCAT
CACGCCTCGA CGCAGGAACT CTGGCGTCGC ATGGAGGCCA ATGGTGATCT CTATCTCGAC
CGCTACGAAG GCTGGTATTC GGTCCGGGAC GAGGCTTTTT ACGATGAGAG CGAACTGGTA
GCGGGCGAGG GCGGGGAGAA GCTGTCGCCC CAGGGTACCC CGGTGGATTG GACAGTCGAG
GAAAGCTGGT TCTTCCGGCT TTCGAAATAT GCCGAACCGC TACTGAAGCT CTACGAGGAG
AATCCCGGGT TCATCCAGCC CGACAGCCGC CGCAATGAAG TGATGCGCTT CGTCGAGGGT
GGACTGCGTG ACCTTTCGGT TTCTCGCACC AGCTTCGACT GGGGTGTGAA GGTTCCGGGG
TGCGATGGCC ATGTAATGTA CGTTTGGGTC GATGCTCTCA CCAACTATAT CACTGGGCTC
GGTTTTCCGG ACGAAAACGG CGACTTTGCA AAGTATTGGC CGGCGAACCT GCACCTGATC
GGCAAGGATA TCGTCCGCTT CCACACTGTC TACTGGCCGG CCTTCCTGAT GAGCGCGGAC
ATCGCGCTGC CGCGGCAAGT CTTCGGGCAC GGATTCCTGC TCAACCGTGG CCAGAAGGAA
TCGAAGTCGC TCGGTAACGT CACCGATCCA CTCGACCTTG CCGACCGGTT CGGGGTAGAT
CCGCTTCGTT ACTTCCTGAT GCGGGAAGTA GCCTTCGGTC AGGACGGATC CTATTCGGCC
GAGGCCATTG TGACGCGATG CAATGCAGAG CTAGCAAACA GCTACGGCAA TCTCGTTCAG
CGCACACTAT CCATGATTTT CAAAAACATG GGCGGCAATC TTGAGACATT TCATAGCAAT
GTGGGGGACG ACGAACTGCT GGCTACGGTG TTCAATGCGT GCCGTGAGGA ACTGCCGCGC
GAGTTTTCCG CGCTGAACTT CTCGGCCGGG ATCGAAGCTT GGATGCGTGC GGTCTTTGCC
TGCAACGCCT ATGTCGACGA ACAGGCGCCG TGGGCGCTGC GCAAGACCGA TCCCGAGCGC
ATGAAGGCCG TGCTGCTGAC GCTGTTCATA GCGATCCGCG ACCTGACCGT GGCGATTTCA
CCCGTCGTTC CGGCTGCCGC AGCCAAGGTG CTGGACCAGC TCGGCATTCC AAGGGAAGCG
CGGGGTTTCG ATGCGTTGAC TGATGCGGAC TGGTACATGG CACGCGTGGC AACCGGAGAG
AGGCTTGCGC AGCCCATGCC TGCATTCCCC CGCCTTGAAC TGCCGGAGGA GGAAGGATCG
GCATGA
 
Protein sequence
MGEPFYITTA ISYPNGKPHI GHAYEAIAAD VIARFQRAMG RDVRFQTGTD EHGLKMAQKA 
RELGITPREL SDEMSSYFIK MCDELNVSYD VFIRTTEERH HASTQELWRR MEANGDLYLD
RYEGWYSVRD EAFYDESELV AGEGGEKLSP QGTPVDWTVE ESWFFRLSKY AEPLLKLYEE
NPGFIQPDSR RNEVMRFVEG GLRDLSVSRT SFDWGVKVPG CDGHVMYVWV DALTNYITGL
GFPDENGDFA KYWPANLHLI GKDIVRFHTV YWPAFLMSAD IALPRQVFGH GFLLNRGQKE
SKSLGNVTDP LDLADRFGVD PLRYFLMREV AFGQDGSYSA EAIVTRCNAE LANSYGNLVQ
RTLSMIFKNM GGNLETFHSN VGDDELLATV FNACREELPR EFSALNFSAG IEAWMRAVFA
CNAYVDEQAP WALRKTDPER MKAVLLTLFI AIRDLTVAIS PVVPAAAAKV LDQLGIPREA
RGFDALTDAD WYMARVATGE RLAQPMPAFP RLELPEEEGS A