Gene Saro_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2121 
Symbol 
ID3918784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2260265 
End bp2261365 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID640444874 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_497394 
Protein GI87200137 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.328757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTTC GTTGCGGGAT CGTCGGTCTT CCCAATGTCG GCAAGTCGAC GCTGTTCAAT 
GCGCTGACCG AAACGCAGGC GGCGCAGGCC GCCAACTATC CGTTCTGCAC GATCGAGCCC
AACGTCGGCA ACGTCGGCGT CCCTGATCCC CGGCTCGACA AGCTGGCCGA GATCGCTGGC
AGCCAGAAGA TCATCCCCAC CCAGCTCGGC TTCGTCGACA TCGCCGGCCT CGTGCGCGGG
GCATCGAAGG GCGAAGGCCT CGGCAACCAG TTCCTCGGCA ACATCCGCGA AGTGGACGCC
ATCGTCCACG TCCTGCGCTG TTTCGAGAAC GACGACATCC AGCACGTCGA CAACAAGGTC
GATCCTATCT CCGACGCCGA GACGGTCGAG ACCGAACTGA TGCTGTCGGA CCTCGAAAGC
CTCGAGAAGC GCGTTCCCGC CGCCGAAAAG AAGGCCAAGG CGGGCGACAA GGAATCGAAG
ATCATCGCCT CGGTCCTCGG CCAGGCGCTC GAACTTCTGC GCGACGGCAA GCCCGCTCGC
CTCACCCAGC CGAAGGATGA CGAGGAAGCG CGCGTCTTCA AGCAGGCCCA GCTCCTCACC
GCCAAGCCCG TTCTCTACGT CTGCAACGTC GAGGAAGAAA GCGCGGCGAA CGGCAACGCC
TTCTCCGCCC GCGTCTTCGA AAAGGCCAAG GCCGAAGGCG CCAACGCGGT GATCGTTTCG
GCCGCGATCG AATCCGAACT CGTCGGCATG GACCCCGAGG AACGCTCCGT TTTCCTCGAG
GAAATGGGCC TGCACGAAAC CGGCCTCGCC CGCGTGATCC GCGCCGGCTA CGAGCTGCTT
CACCTCATCA CCTTCTTCAC CGTCGGCCCC AAGGAAGCGC GTGCATGGAC CGTGCACCTT
GGCGCAAAGG CGCCCGAAGC CGCCGGTGAG ATCCACTCCG ACATGCAGCG CGGCTTCATC
CGCGCCGAAA CCATCGCCTA CGACGATTTC GTCAGCCTCG GCGGCGAAAG CGCCGCGCGC
GATGCCGGCA AGCTGCGCCA GGAAGGCAAG GAGTACGTGG TGAAGGACGG CGACGTCCTC
CACTTCAAGT TCAACGTCTG A
 
Protein sequence
MGFRCGIVGL PNVGKSTLFN ALTETQAAQA ANYPFCTIEP NVGNVGVPDP RLDKLAEIAG 
SQKIIPTQLG FVDIAGLVRG ASKGEGLGNQ FLGNIREVDA IVHVLRCFEN DDIQHVDNKV
DPISDAETVE TELMLSDLES LEKRVPAAEK KAKAGDKESK IIASVLGQAL ELLRDGKPAR
LTQPKDDEEA RVFKQAQLLT AKPVLYVCNV EEESAANGNA FSARVFEKAK AEGANAVIVS
AAIESELVGM DPEERSVFLE EMGLHETGLA RVIRAGYELL HLITFFTVGP KEARAWTVHL
GAKAPEAAGE IHSDMQRGFI RAETIAYDDF VSLGGESAAR DAGKLRQEGK EYVVKDGDVL
HFKFNV