Gene Saro_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1021 
Symbol 
ID3915803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1059940 
End bp1061142 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content68% 
IMG OID640443755 
Productpolynucleotide adenylyltransferase region 
Protein accessionYP_496300 
Protein GI87199043 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGACC GGTTGCCCAA GGCCGACTGG ACCGCCCGGG ACGACCTCGC GGCCCTCGTA 
GCTGCGCTCG ATCCAGACTC GCAGGGCAAC TGCCGCTGGG TGGGCGGCGT CGTGCGAGAC
ACGATCCTGG GCCTGCCCGC CAAGGACATC GACATGGCAA CGACGTTGCC GCCGGACGAG
ACTGCCGCGC GACTTTCCAC CGCAGGCATC AAATCCGTGC CGACCGGGAT CGCCCACGGC
ACGGTGACCG CCGTGCTGCC GGGCGGACCG GTGGAGATCA CCACGTTGCG CCGCGATGTC
AGCACCGACG GACGCCATGC CACGGTGAGC TTTTCGACCG ACTGGCGCGA TGACGCCGCG
CGCCGCGATT TCACGATCAA CGCGCTCTAT GCCGACCCGC GCACTCTGGA GGTGTTCGAC
TACCATGGCG GCCTGGCCGA TCTTGCCGCG CGGCGCGTGC GCTTCATCGG CGATGCCCGC
CAGCGCATCC GCGAAGACTA CCTGCGCATC CTTCGCTACT TCCGCTTCCA GGCACGGTTC
GGGTCGATAC CCGCCGATAC CGAGGCGGAA AGCGCCGTTT CGGAACTGGC TGCGGGCCTG
AAGGGGCTTT CCCGCGAACG CGTGGGCTGG GAGCTGATGA ACCTTCTCGG CCTGCCCGAC
CCGGCACCAA CCATGCGCCG CATGGCCGAA CTGGGCGTTT TGGCGCAGGT GCTGCCGGAG
ACTGCCGCAG ATGGCCTTGA TGCGCTTGAG GCGCTGATCG GCAATGAGCA GTCCGCCAGA
GCAGATGCGG TGGCGATCCG GCGGCTAGCA GCGCTGCTTC CCGCCGACCG GCATTTGGCG
GAACTGGTCG CGGCGCGATT GAGGCTTTCG GCCGCACAGC GCAAGCGCCT CGCAACGGTC
GCGGCGCCTT CCGGTGCGGA CGGCGATGCG CGGGCCCTTG CCTACCGCAT CGGGATCGAG
GAAGCACGCG ACCGACTGCT GATTGCCGGA AGGCCCGTTG CCGCACTTGA TGGATGGCAG
GCCCCCGCCT TGCCACTGAA GGGCGGAGAG ATCGTGGCCC GCGGGGTCAC GGCCGGGCCG
GAGGTTGCGC GAACGCTGCG TGCGGTCGAA GACCAGTGGA TCAGCGAAGG CTTCCCTGAC
GGCGCAAGAG TTTCCGCAAT CCTCGATCAA GTCCTGGGAC ATACCAGCGC ACCACGCGAC
TGA
 
Protein sequence
MTDRLPKADW TARDDLAALV AALDPDSQGN CRWVGGVVRD TILGLPAKDI DMATTLPPDE 
TAARLSTAGI KSVPTGIAHG TVTAVLPGGP VEITTLRRDV STDGRHATVS FSTDWRDDAA
RRDFTINALY ADPRTLEVFD YHGGLADLAA RRVRFIGDAR QRIREDYLRI LRYFRFQARF
GSIPADTEAE SAVSELAAGL KGLSRERVGW ELMNLLGLPD PAPTMRRMAE LGVLAQVLPE
TAADGLDALE ALIGNEQSAR ADAVAIRRLA ALLPADRHLA ELVAARLRLS AAQRKRLATV
AAPSGADGDA RALAYRIGIE EARDRLLIAG RPVAALDGWQ APALPLKGGE IVARGVTAGP
EVARTLRAVE DQWISEGFPD GARVSAILDQ VLGHTSAPRD