Gene Saro_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3304 
Symbol 
ID3915951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3522575 
End bp3523831 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content66% 
IMG OID640446089 
Productaspartate kinase 
Protein accessionYP_498573 
Protein GI87201316 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0250326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCCGCA TCGTGATGAA ATTCGGCGGC ACTTCGATGG CCGGCACCGA GCGCATCCGC 
CGCGTGGCGC GCATCGTGCA GCGCCAGCAG GCGGCAGGGC ACGAGGTGGC GGTGGTCGTC
TCTGCCATGG CGGGCGAGAC CGACCGCCTC GTCAACTTCT GCCGCGAGGC GAACCCGCTC
TACGATCCGG CCGAATACGA CGTGGTCGTG GCCAGCGGCG AGCAAGTGAC GTCGGGTCTC
CTGGCGATGC ATCTCCAGGC GCTGGGCTGC AAGGCGCGCT CGTGGCTGGG ATGGCAGCTG
CCGATCCACA CCGACGACGC GCATTCGAAG GCGCGCATCG AAGGCATCGA TTCGGAAGCG
CTGCTTGCCA GCATGGGCGC GGGCGAGATC GCGGTGATCC CGGGATTCCA GGGCCTTACC
GCCGACAACC GCGTGACCAC CCTGGGCCGT GGCGGTTCCG ACACTTCGGC CGTGGCAGTG
GCGGCGGCGG TCAAGGCCGA CCGTTGCGAC ATCTACACCG ACGTGGACGG GGTCTACACC
ACCGATCCGC GCATCGTGGC CAAGGCCCGC AAGCTCAAGA ACGTGACCTA CGAGGAAATG
CTCGAACTGG CCTCGGTCGG CTCGAAGGTC CTGCAGACCC GCTCGGTCAG CCTTGCCATG
AAGGAAGGCG TGCGCGTGCA GGTGCTTTCC TCATTCATCG ACGACGACGC CCCGGCGGCG
GACACGATCC CCGGCACGAT GATCGTTTCC GACGAGGAAC TTGAAGGATT GGATATGGAA
CGCCAGCTGA TCACCGGCAT CGCCGCCGAC AAGAACGAGG CGAAAGTTAC CCTGACCCGC
ATCGCGGACC GCCCCGGCGC GGTCGCGGCG ATCTTCGGCC CGCTGGCCGC GGCGAACATC
AACGTCGACA TGATCATCCA GAACATCGCC AAGGACAAGG GCGAGACCGA CGTCACCTTC
ACGGTTCCGA TCTCGGACCT CGCCCGTACC CAGGCGCTGC TTGAAGAGCG CAAGGACACG
ATCGGCTACT ACCGCATGCT GGCCAACAGC AAGGTCGCCA AGATCAGCGT CGTCGGCGTC
GGCATGCGCA GCCACGCCGG CGTCGCCAGC ACCATGTTCC GCGCCCTGGC CGACCGCGGC
ATCAATATCC AGGCGATCAC CACCAGCGAG ATCAAGGTCT CGGTGCTGAT CGACGAGGAC
GAGACCGAAC TCGCGGTGCG CGTGCTGCAC ACCGCCTACG GCCTCGACGG CGAGTAA
 
Protein sequence
MARIVMKFGG TSMAGTERIR RVARIVQRQQ AAGHEVAVVV SAMAGETDRL VNFCREANPL 
YDPAEYDVVV ASGEQVTSGL LAMHLQALGC KARSWLGWQL PIHTDDAHSK ARIEGIDSEA
LLASMGAGEI AVIPGFQGLT ADNRVTTLGR GGSDTSAVAV AAAVKADRCD IYTDVDGVYT
TDPRIVAKAR KLKNVTYEEM LELASVGSKV LQTRSVSLAM KEGVRVQVLS SFIDDDAPAA
DTIPGTMIVS DEELEGLDME RQLITGIAAD KNEAKVTLTR IADRPGAVAA IFGPLAAANI
NVDMIIQNIA KDKGETDVTF TVPISDLART QALLEERKDT IGYYRMLANS KVAKISVVGV
GMRSHAGVAS TMFRALADRG INIQAITTSE IKVSVLIDED ETELAVRVLH TAYGLDGE